6 debugging applications, 7 monitoring node activity, 8 tuning applications – HP XC System 4.x Software User Manual
Page 5: 9 using slurm
5.2.1 Submitting a Serial Job with the LSF bsub Command............................................................49
5.2.2 Submitting a Serial Job Through SLURM Only......................................................................50
5.3.1 Submitting a Non-MPI Parallel Job.........................................................................................51
5.3.2 Submitting a Parallel Job That Uses the HP-MPI Message Passing Interface.........................52
5.3.3 Submitting a Parallel Job Using the SLURM External Scheduler...........................................53
5.4 Submitting a Batch Job or Job Script...............................................................................................56
5.5 Submitting Multiple MPI Jobs Across the Same Set of Nodes........................................................58
6.2.1.1 SSH and TotalView..........................................................................................................64
6.2.1.2 Setting Up TotalView......................................................................................................64
6.2.1.3 Using TotalView with SLURM........................................................................................65
6.2.1.4 Using TotalView with LSF...............................................................................................65
6.2.1.5 Setting TotalView Preferences.........................................................................................65
6.2.1.6 Debugging an Application..............................................................................................66
6.2.1.7 Debugging Running Applications..................................................................................67
6.2.1.8 Exiting TotalView............................................................................................................67
7.1 The Xtools Utilities..........................................................................................................................69
7.2 Running Performance Health Tests.................................................................................................70
8.1.1 Building a Program — Intel Trace Collector and HP-MPI......................................................75
8.1.2 Running a Program – Intel Trace Collector and HP-MPI.......................................................76
9.4 Monitoring Jobs with the squeue Command..................................................................................82
9.5 Terminating Jobs with the scancel Command.................................................................................83
9.6 Getting System Information with the sinfo Command...................................................................83
Table of Contents
5