beautypg.com

Statistics – HP Insight Management Agents User Manual

Page 54

background image

Hardware Errors—The Hardware Errors counter records the number of times that a physical
SCSI drive returned a Hardware Error status when a SCSI command was attempted. This error
status indicates unsuccessful termination of the SCSI command. The controller typically retries
this command several times before failing the drive.

If the number of hardware errors is not zero and the drive has failed, replace the drive. If the
counter is not zero and the drive is OK (has not failed), there may be an intermittent problem
that requires drive replacement. If you observe that the count is increasing over time, replace
the drive.

Not Ready Errors—When a physical drive returns a not ready status when it should be ready,
a Drive Not Ready Error occurs. This error could occur if a drive spins down unexpectedly or
if the drive never becomes ready after the spin up command is issued.

If the number of not ready errors is not zero and the drive has failed, replace the drive. If the
counter is not zero and the drive is OK (has not failed), there may be an intermittent problem
that requires drive replacement. If you observe that the count is increasing over time, replace
the drive.

Bad Target Errors—When a physical drive performs an action that does not conform to the
SCSI-2 port protocol, the SCSI port is reset.

If the number of bad target errors is not zero and the drive has failed, replace the drive. If the
counter is not zero and the drive is OK (has not failed), there may be an intermittent problem
that requires drive replacement. If you observe that the count is increasing over time, replace
the drive.

Failed Recovery Writes—indicates whether write errors occurred while Automatic Data Recovery
was being performed to this physical drive. If a write error occurs, Automatic Data Recovery
stops. These errors indicate that the physical drive has failed.

If the number of fail recov writes is not zero and the drive has failed, replace the drive. If the
counter is not zero and the drive is OK (has not failed), there may be an intermittent problem
that requires drive replacement. If you observe that the count is increasing over time, replace
the drive.

Self-Test Errors—Indicates if a physical drive failed its self test. The physical drive does a self
test each time the system is turned on.

If the number of self-test errors is not zero and the drive has failed, replace the drive. If the
counter is not zero and the drive is OK (has not failed), there may be an intermittent problem
that requires drive replacement. If you observe that the count is increasing over time, replace
the drive.

The above information is available for those drives that have been stamped with monitoring and
performance data enabled. The drive was stamped when it left the factory.

Statistics

This section displays statistics about a specific drive array controller physical drive. You can use
the run-time statistics to monitor the health of a specific drive.

The following information displays:

Sectors Read—Shows the total number of sectors read from the physical drive since the drive
was stamped. The drive was stamped when it left the factory.

Hard Read Errors—Displays the number of read errors that could not be recovered by a
physical drive's Error Correction Code (ECC) algorithm or through retries. Over time, a drive
may produce these errors. If you receive these errors, a problem may exist with your drive.

The severity of these errors depends on whether the managed system is running in a fault
tolerant mode. With fault tolerance, the controller can remap data to eliminate the problems
caused by these errors.

54

Agent information