6 debugging applications, 7 tuning applications, 8 using slurm – HP XC System 3.x Software User Manual
Page 5: 9 using lsf
SSH and TotalView.....................................................................................................................54
Setting Up TotalView...................................................................................................................54
Using TotalView with SLURM........................................................................................................54
Using TotalView with LSF-HPC......................................................................................................55
Setting TotalView Preferences.......................................................................................................55
Debugging an Application..........................................................................................................55
Debugging Running Applications..................................................................................................56
Exiting TotalView........................................................................................................................57
Building a Program — Intel Trace Collector and HP-MPI........................................................................59
Running a Program – Intel Trace Collector and HP-MPI..........................................................................60
Visualizing Data – Intel Trace Analyzer and HP-MPI..............................................................................60
Monitoring Jobs with the squeue Command..............................................................................................64
Terminating Jobs with the scancel Command............................................................................................65
Getting System Information with the sinfo Command..................................................................................65
Job Accounting.....................................................................................................................................65
Fault Tolerance.....................................................................................................................................66
Security...............................................................................................................................................66
Overview of LSF-HPC..................................................................................................................68
Differences Between LSF-HPC and Standard LSF..............................................................................69
Job Terminology.........................................................................................................................70
HP XCCompute Node Resource Support........................................................................................71
Notes on LSF-HPC.......................................................................................................................72
How LSF-HPC and SLURM Launch and Manage a Job.....................................................................73
Notes About Using LSF-HPC in the HP XC Environment....................................................................74
Job Startup and Job Control....................................................................................................74
Preemption...........................................................................................................................75
Getting the Status of LSF-HPC.......................................................................................................75
Getting Information About LSF Execution Host Node.......................................................................75
Getting Host Load Information......................................................................................................76
Examining LSF-HPC System Queues...............................................................................................76
Table of Contents
5