Smart error, Bios error messages – Dell PERC 4/SI User Manual
Page 66
SMART Error
describes issues related to the Self-Monitoring Analysis and Reporting Technology (SMART). SMART monitors the internal performance of all motors,
heads, and hard drive electronics and detects predictable hard drive failures.
Table 6-4. SMART Error
BIOS Error Messages
In PERC RAID controllers, the BIOS (option ROM) provides INT 13h functionality (disk I/O) for the logical drives connected to the controller, so that you can boot
from or access the drives without the need of a driver.
describes the error messages and warnings that display for the BIOS.
Table 6-5. BIOS Errors and Warnings
Rebuilding a hard disk
drive after a single drive
failure
If you have configured hot spares, the RAID controller automatically tries to use them to rebuild failed disks. Manual rebuild is
necessary if no hot spares with enough capacity to rebuild the failed drives are available.You must insert a drive with enough
storage into the subsystem before rebuilding the failed drive. You can use the BIOS Configuration Utility or Dell OpenManage
®
Array Manager to perform a manual rebuild of an individual drive.
Refer to
Rebuilding Failed Hard Drives
in
RAID Configuration and Management
for procedures for rebuilding a single hard disk
drive.
Rebuilding hard disk
drives after a multi-
drive failure
Multiple drive errors in a single array typically indicate a failure in cabling or connection and could involve the loss of data. It is
possible to recover the logical drive from a multiple drive failure. Perform the following steps to recover the logical drive:
1.
Shut down the system, check cable connections, and reset hard drives.
Be sure to follow safety precautions to prevent electrostatic discharge.
2.
If the system logs are available, try to identify the order in which the drives failed in the multiple drive failure scenario.
3.
Force the first drive online, then the second (if applicable), and continue till you reach the last disk.
4.
Perform a rebuild on the last disk.
You can use the BIOS Configuration Utility or Dell OpenManage
®
Array Manager to perform a manual rebuild of multiple drives.
See
Rebuilding Failed Hard Drives
in
RAID Configuration and Management
for procedures to rebuild a single hard disk drive.
A drive is taking longer
than expected to
rebuild.
An array may take longer to rebuild when under high stress; for example, when there is one rebuild I/O operation for every five
host I/O operations.
A node in a clustering
environment fails during
a rebuild.
In a clustering environment, if a node fails during a rebuild, the rebuild is re-started by another node. The rebuild on the second
mode starts at zero percent.
Problem
Suggested Solution
A SMART error is detected in a fault-tolerant
RAID array.
Perform the following steps:
1.
Force the hard disk drive offline.
2.
Replace it with a new drive.
3.
Perform a rebuild.
See
Rebuilding Failed Hard Drives
in
RAID Configuration and Management
for rebuild procedures.
A SMART error is detected in non-fault-
tolerant RAID array.
Perform the following steps:
1.
Back up your data.
2.
Delete the logical drive.
See
Deleting Logical Drives
in
RAID Configuration and Management
for the procedure for deleting a
logical drive.
3.
Replace the affected hard disk drive with a new drive.
4.
Recreate the logical drive.
See
Simple Array Setup
or
Advanced Array Setup
in
RAID Configuration and Management
for
procedures for creating logical drives.
5.
Restore the backup.
Message
Meaning
This warning displays after you disable the option ROM in the configuration utility so that the BIOS will
not hook Int13h and thus will not provide any I/O functionality to the logical drives.