6 managing a cluster with hp insight cmu, 7 advanced topics – HP Insight Cluster Management Utility User Manual
Page 6
5.5.2 Actions....................................................................................................................78
5.5.3 Alerts.......................................................................................................................79
5.5.4 Alert reactions..........................................................................................................79
5.5.5 Modifying the sensors, alerts, and alert reactions monitored by HP Insight CMU................80
5.5.6 Using collectl for gathering monitoring data..................................................................81
5.5.6.1 Installing and starting collectl on compute nodes....................................................81
5.5.6.2 Modifying the ActionAndAlerts.txt file...................................................................81
5.5.6.3 Installing and configuring colplot for plotting collectl data.......................................83
5.5.7.1 Monitoring NVIDIA GPUs....................................................................................85
5.5.7.2 Monitoring AMD GPUs.......................................................................................86
5.5.7.3 Monitoring Intel coprocessors..............................................................................87
5.5.8 Monitoring HP Insight CMU alerts in HP Systems Insight Manager...................................88
5.5.9 Extended metric support.............................................................................................89
6 Managing a cluster with HP Insight CMU....................................................91
6.1 Unprivileged user menu......................................................................................................91
6.2 Administrator menu...........................................................................................................91
6.3 SSH connection................................................................................................................91
6.4 Management card connection............................................................................................92
6.5 Virtual serial port connection..............................................................................................92
6.6 Shutdown........................................................................................................................92
6.7 Power off.........................................................................................................................92
6.8 Boot................................................................................................................................93
6.9 Reboot............................................................................................................................93
6.10 Change UID LED status.....................................................................................................93
6.11 Multiple windows broadcast..............................................................................................94
6.12 Single window pdsh........................................................................................................94
6.13 Parallel distributed copy (pdcp)..........................................................................................97
6.14 User group management..................................................................................................98
6.14.1 Adding user groups..................................................................................................98
6.14.2 Deleting user groups.................................................................................................99
6.14.3 Renaming user groups..............................................................................................99
6.15.1 Viewing and analyzing BIOS settings..........................................................................99
6.15.2 Checking BIOS versions..........................................................................................100
6.15.3 Installing and upgrading firmware............................................................................100
6.17.1 Starting a CLI interactive session................................................................................101
6.17.2 Basic commands....................................................................................................101
6.17.3 Specifying nodes....................................................................................................103
6.17.4 Administration and cloning commands......................................................................105
6.17.5 Administration utilities pdcp and pdsh.......................................................................111
6.17.6 HP Insight CMU Linux shell commands......................................................................111
7.1.1 Custom menu options for non-root users.......................................................................113
7.1.2 Configuring sudo support .........................................................................................113
7.1.3 Examples.................................................................................................................114
6
Contents