beautypg.com

Getting information about system partitions, Launching jobs, Getting information about your jobs – HP XC System 3.x Software User Manual

Page 29: Stopping and suspending jobs, Resuming suspended jobs, Performing other common user tasks

background image

$ lsload

For more information about using this command and a sample of its output, see

Getting Host Load

Information (page 76)

.

Getting Information About System Partitions

You can view information about system partitions with the SLURM sinfo command. The sinfo command
reports the state of all partitions and nodes managed by SLURM and provides a wide variety of filtering,
sorting, and formatting options. sinfo displays a summary of available partition and node (not job)
information, such as partition names, nodes per partition, and cores per node).

$ sinfo

For more information about using the sinfo command and a sample of its output, see

Getting Information

About the lsf Partition (page 76)

.

Launching Jobs

To launch a job on an HP XC system, use the LSF bsub command. The bsub command submits batch jobs
or interactive batch jobs to an LSF-HPC queue for execution.

See

Submitting Jobs (page 45)

for full information about launching jobs.

Getting Information About Your Jobs

Use the LSF bjobs and bhist commands to obtain information about your running or completed jobs:

Use the bjobs command to examine the status of a running job (

Examining the Status of a Job (page

81)

).

Use the bhist command to obtain brief or full information about finished jobs (

Viewing the Historical

Information for a Job (page 82)

)

You can view the components of the actual SLURM allocation command with the LSF bjobs -l and bhist
-l

commands.

Stopping and Suspending Jobs

You can suspend or stop your jobs with the bstop and bkill commands:

Use the bstop command to stop or suspend an LSF-HPC job.

Use the bkill command to kill an LSF-HPC job.

Resuming Suspended Jobs

Use the LSF bresume command to resume a stopped or suspended job.

Performing Other Common User Tasks

This section contains general information about using the HP XC system.

Determining the LSF Cluster Name and the LSF Execution Host

The lsid command returns the LSF cluster name, the LSF-HPC version, and the name of the LSF execution
host:

$ lsid

Platform LSF HPC 6.1 for SLURM, date and time stamp

Copyright 1992-2005 Platform Computing Corporation

My cluster name is hptclsf

My master name is lsfhost.localdomain

In this example, hptclsf is the LSF cluster name, and lsfhost.localdomain is the name of the virtual
IP address used by the node where LSF-HPC is installed and running (LSF execution host).

Performing Other Common User Tasks

29