Statistics – HP Insight Management Agents User Manual

Page 47

Agent information 47

If the number of hardware errors is not zero and the drive has failed, replace the drive. If the counter is not zero

and the drive is OK (has not failed), there may be an intermittent problem that requires drive replacement. If you

observe that the count is increasing over time, replace the drive.

•

Not Ready Errors—When a physical drive returns a not ready status when it should be ready, a Drive Not

Ready Error occurs. This error could occur if a drive spins down unexpectedly or if the drive never becomes

ready after the spin up command is issued.
If the number of not ready errors is not zero and the drive has failed, replace the drive. If the counter is not zero

and the drive is OK (has not failed), there may be an intermittent problem that requires drive replacement. If you

observe that the count is increasing over time, replace the drive.

•

Bad Target Errors—When a physical drive performs an action that does not conform to the SCSI-2 port protocol,

the SCSI port is reset.
If the number of bad target errors is not zero and the drive has failed, replace the drive. If the counter is not

zero and the drive is OK (has not failed), there may be an intermittent problem that requires drive replacement.

If you observe that the count is increasing over time, replace the drive.

•

Failed Recovery Writes—indicates whether write errors occurred while Automatic Data Recovery was being

performed to this physical drive. If a write error occurs, Automatic Data Recovery stops. These errors indicate

that the physical drive has failed.
If the number of fail recov writes is not zero and the drive has failed, replace the drive. If the counter is not zero

and the drive is OK (has not failed), there may be an intermittent problem that requires drive replacement. If you

observe that the count is increasing over time, replace the drive.

•

Self-Test Errors—Indicates if a physical drive failed its self test. The physical drive does a self test each time the

system is turned on.
If the number of self-test errors is not zero and the drive has failed, replace the drive. If the counter is not zero

and the drive is OK (has not failed), there may be an intermittent problem that requires drive replacement. If you

observe that the count is increasing over time, replace the drive.

The above information is available for those drives that have been stamped with monitoring and performance data

enabled. The drive was stamped when it left the factory.

Statistics

This section displays statistics about a specific drive array controller physical drive. You can use the run-time statistics

to monitor the health of a specific drive.
The following information displays:

•

Sectors Read—Shows the total number of sectors read from the physical drive since the drive was stamped. The

drive was stamped when it left the factory.

•

Hard Read Errors—Displays the number of read errors that could not be recovered by a physical drive's Error

Correction Code (ECC) algorithm or through retries. Over time, a drive may produce these errors. If you receive

these errors, a problem may exist with your drive.
The severity of these errors depends on whether the managed system is running in a fault tolerant mode. With

fault tolerance, the controller can remap data to eliminate the problems caused by these errors.

•

Recovered Read Errors—Displays the number of read errors corrected through physical drive retries. Over time,

all drives produce these errors. If you notice a rapid increase in the value for Recovered Read Errors or Hard

Read Errors, a problem may exist with the drive. Expect more errors for this monitored item than for Hard Read

Errors.

•

Total Seeks—Displays the total number of seek operations during seek tests performed by the physical drive

since the drive was stamped. The drive was stamped when it left the factory.
During normal reads and writes to the drive, the drive implied seeks to the location where data resides. These

are not included in this count.

•

Seek Errors—Displays the number of seek errors that a physical drive detects. A seek error is a seek that failed.

Over time, a drive usually produces these errors. If you notice a rapid increase in the value shown for Seek

Errors, this physical drive may be failing. Only an unusually rapid increase in these errors indicates a problem.

•

Sectors Written—Displays the total number of sectors written to the physical drive since the drive was stamped.

The drive was stamped when it left the factory.

•

Hard Write Errors—Displays the number of write errors that could not be recovered by a physical drive. Over

time, a drive may produce these errors. If you notice an increase in the value shown for Hard Write Errors or

Recovered Write Errors, a problem may exist with the drive. The counter value increases every time the physical

drive detects another error. On average, these errors should occur less frequently than read errors.