Troubleshooting, Event monitor tools, 5 troubleshooting – HP StorageWorks 2500 Disk System User Manual
Page 45
5 Troubleshooting
This chapter provides information to help you troubleshoot DS2500 error conditions. Topics include:
Event monitor tools
The EMS Disk Monitor (disk_em) and the HP High Availability Storage System Monitor (ds_ses_enclosure)
monitor the disks in the DS2500. For information about these monitoring tools, see HP EMS Hardware
Monitors User’s Guide at the following web site:
The monitoring tools report changes in disk status by sending an e-mail to the root account. These
messages provide early notification of a disk problem. Events reported include media errors, failed read
and write attempts, invalid commands, changed operating parameters, and failed diagnostics. Event are
assigned to one of the following severity levels:
•
Critical—An event that causes data loss, host system downtime, or other loss of service. Continued
use the enclosure will negatively impact host system operation. Immediate action is required.
•
Serious—An event that can cause data loss, host system downtime, or other loss of service if left
uncorrected. Host system and hardware operation may be adversely affected. The problem needs
repair as soon as possible.
•
Warning—An event that could escalate to a serious condition if not corrected. Host system operation
should not be affected and normal use of the enclosure can continue. Repair is needed but at a
convenient time.
•
Information—An event that is expected as part of the normal operation of the hardware. No action
is required.
shows an example of a hardware event notification. The notification provides the following
types of information:
•
Message data—Date and time the message was sent, the source and destination of the message,
and the severity level of the event.
•
Event data—Date and time of the event, the host, event ID, name of the monitor, event number, event
class, severity level, hardware path,and associated OS error log entry ID.
•
Description of Error—Narrative information indicating the component that experienced the event
and the nature of the event.
•
Probable Cause / Recommended Action—The cause of the event and suggested steps toward a
solution. This information should be the first step in troubleshooting.
2500 Disk System user guide
45