beautypg.com

HP Insight Control Software for Linux User Manual

Page 222

background image

Figure 35 Sample Nagios messages

Messages are categorized in the Status column as OK, Unknown, Pending, Warning, and
Critical

and are color-coded.

The messages described in this section are indexed by the Service and Status Information columns.
The messages in this section are arranged alphabetically by the Service column entry. If there is
a warning or critical message, find the information for that service in the Status Information column
and apply it to the specified Nagios host.

NOTE:

The following messages are based on the default values. The messages differ if the

messages in the /opt/hptc/nagios/etc/misccommands.cfg file were changed.

Service: Apache HTTPS Server

Status Information: HTTPS performance information

Displays the status of the Apache

HTTPS

web server on the CMS.

Service: Configuration Monitor

Status Information: Node information

This message reports the total number of managed systems, the number of managed systems enabled,
the number of managed systems disabled, and the number of managed systems imaged.

No action is required.

Service: Environment

Status Information: Node sensor status

A warning or critical message indicates that one or more monitored sensors reported that a threshold
was exceeded.

Correct the condition.

Service: Load Average

Status Information: Node Load Ave: x/y/z QueLen: n

A warning or critical message indicates that load average thresholds for the specific managed system
were exceeded.

Thresholds can be set on a per-managed system, per-class, or per-system basis in the nagios_vars.ini
file. These values are specific to the site and depend on site load.

If the load average thresholds are reasonable, monitor for excessive activity on the managed system.

Service: Nagios Monitor

Status Information: Nagios status information

Typically, the status of Nagios, the number of Nagios services located, and the last time the Nagios
status log was updated.

A warning or critical message indicates that one or more of the Nagios monitor processes either failed
or reported error conditions that can degrade monitoring.

Ensure that the managed system can communicate with the CMS.

Service: Nodeinfo

Status Information: Node process status total/user/zombie , uptime

Displays the total number of processes, the number of user processes, the number of Zombie processes,
and the uptime for the Nagios host.

222 Troubleshooting