beautypg.com

Correctable errors, Power-on messages, Integrated management log – HP Insight Management Agents User Manual

Page 146

background image

NMI-System Concurrency Error—A potential error condition was detected within the Data
Flow Manager, resulting in a system failure.

NMI-Uncorrectable Memory Error—The device experienced an uncorrectable memory parity
error resulting in a device failure.

NMI-Unknown Error Type—The device driver does not recognize this NMI. The health driver
might need to be updated.

Processor Failure—The processor failed during the POST.

Server Manager Failure—An error occurred in the server interface with the Server Manager.

UPS A/C Line Failure/Shutdown or Battery Low—The device has initiated a UPS or operating
system shutdown, or the battery is almost depleted after an AC line failure.

The Last Failure Message on this window displays the last failure message associated with a critical
error.

Correctable errors

This alarm indicates that a block of memory has failed or is failing and might need to be replaced.
This condition is generally non-critical because the memory controller can correct the problem.
However, this type of error indicates that a memory component is failing or has failed in the system
issuing the alarm. The system continues to correct any errors it can.

Memory errors are corrected by the ECC memory subsystem when they occur. If these errors
increase, correct the problems as soon as possible. Further degradation of the memory components
might occur, and then errors can no longer be correctable.

Power-On Messages

The Power-On Messages section displays the Power-On messages logged when the device was
turned on. For a listing of possible Power-On error messages and their meanings, see the device
documentation. Click the Clear Power-On Message button to clear the Power-On message log.
This button is only available if there are messages to clear.

Integrated Management Log

The Integrated Management Log records system events, critical errors, power-on message errors,
and memory errors. The log also records catastrophic hardware and software errors that typically
cause a system to fail. This information helps to quickly identify and correct the problem and
minimize downtime.

Each event log entry has a status to identify the severity of the event:

Informational—General information about a system event.

Repaired—An entry has been repaired. Users must mark entries as repaired.

Caution—A non-fatal error condition has occurred.

Critical—A component of the system has failed.

If any events in the log have a condition of Caution, the overall log condition is marked as degraded.
If Critical events exist in the log, the overall log condition is marked as failed.

To clear a degraded or failed event log, repair the condition that caused a log entry to be
generated, and then mark the log entry as repaired. Perform the following steps:

1.

Highlight the log entries in the Integrated Management Log.

2.

Click the Mark Repaired button. This button is located at the bottom of the Integrated
Management Log section of the Web browser.

146

Agent information