beautypg.com

2 limitations, 4 stopping hp insight cmu monitoring, 5 tuning hp insight cmu monitoring – HP Insight Cluster Management Utility User Manual

Page 77: 1 action and alert files, Valid archived user group parameters

background image

5.3.8.2 Limitations

To display an archived user group, the following conditions must be satisfied:

Time must not exceed 24 hours.

The number of nodes must not exceed 4096.

The number of metrics must not exceed 100.

The product of the three parameters above must not exceed 409600.

Table 2 (page 77)

displays examples of valid combinations of these three parameters.

Table 2 Valid archived user group parameters

Nodes*Metrics*Hours

Hours

Metrics

Nodes

409600

10

10

4096

409600

20

5

4096

409600

1

100

4096

307200

12

100

256

393216

24

8

2048

393216

24

16

1024

IMPORTANT:

If the above criteria is not met, display fails with a warning message.

5.4 Stopping HP Insight CMU monitoring

To stop the HP Insight CMU Monitoring GUI, click the X in the upper right corner of the main HP
Insight CMU Monitoring window.

When the Monitoring GUI is stopped, the monitoring engine is not automatically stopped. To stop
the monitoring engine on the cluster, on the toolbar, click the Monitoring tab, and then select Stop
Monitoring Engine.

5.5 Tuning HP Insight CMU monitoring

5.5.1 Action and alert files

Sensors, alerts, and alert reactions are described in the /opt/cmu/etc/
ActionAndAlertsFile.txt

file.

Following is an example of the contents of the file:

#This is a CMU action and alerts description file
#=============================================================
#
#
ACTIONS
#
#
#
#-------------KERNEL VERSION, RELEASE, BIOS VERSIONS---------#
kernel_version "kernel version" 9999999 string Instantaneous release uname -r
#-------------CPU--------------------------------------------#
#
#- Native
cpuload "% cpu load (raw)" 1 numerical MeanOverTime 100 % awk '/cpu /
{printf"%d\n",$2+$3+$4}' /proc/stat
#- Collectl
#cpuload "% cpu load (normalized)" 1 numerical Instantaneous 100 % COLLECTL (cputotals.user) + (cputotals.nice)
+ (cputotals.sys)
#cpuload "% cpu load (normalized)" 1 numerical Instantaneous 100 % COLLECTL 100 - (cputotals.idle)
#
#-------------MEMORY-----------------------------------------#
#
#- Native
#memory_used "% memory used" 1 numerical Instantaneous 100 % free | awk '
BEGIN { freemem=0; totalmemory=0; } /cache:/ { freemem=$4; } /Mem:/ { totalmemory=$2; } END { printf "%d\n",
(((totalmemory-freemem)*100)/totalmemory); }'

5.4 Stopping HP Insight CMU monitoring

77