beautypg.com

4 examining system queues, 5 getting information about the lsf partition, 11 getting information about jobs – HP XC System 3.x Software User Manual

Page 92: 1 getting job allocation information, Examining system, Getting information

background image

In the previous example output, the LSF execution host (lsfhost.localdomain) is listed under the
HOST_NAME

column. The status is listed as ok, indicating that it can accept remote jobs. The ls column

shows the number of current login users on this host.

See the OUTPUT section of the lsload manpage for further information about the output of this example.
In addition, see the Platform Computing Corporation LSF documentation and the lsload(1) manpage for
more information about the features of this command.

For individual compute node load information, see the discussion on metrics in shownode(1).

10.10.4 Examining System Queues

All jobs on the HP XC system that are submitted to LSF-HPC are placed into an LSF-HPC job queue.HP
recommends that you examine the status and availability of LSF-HPC system queues before launching a
job so that you can select the most appropriate queue for your job.

You can easily verify the status, limits, and configurations of LSF-HPC queues with the bqueues command.
This command is fully described in Platform Computing Corporation's LSF documentation and manpages.

See bsub(1) for more information on submitting jobs to specific queues.

For more information on the bqueues command, see bqueues(1).

10.10.5 Getting Information About the lsf Partition

Information about the SLURM lsf compute node partition can be viewed with the SLURM sinfo
command. A partition is one or more compute nodes that have been grouped together. A SLURM lsf
partition is created when the HP XC system is installed. This partition contains the resources that will be
managed by LSF-HPC and available for jobs submitted to LSF-HPC.

The sinfo command reports the state of the lsf partition and all other partitions on the system. The
sinfo

command displays a summary of available partition and node information such as partition names,

nodes/partition, and cores/node). It has a wide variety of filtering, sorting, and formatting options.

The following example shows the use of the sinfo command to obtain lsf partition information:

$ sinfo -p lsf

PARTITION AVAIL TIMELIMIT NODES STATE NODELIST

lsf up infinite 128 idle n[1-128]

Use the following command to obtain more information on the nodes in the lsf partition:

$ sinfo -p lsf -lNe

NODELIST NODES PARTITION STATE CPUS MEMORY TMP_DISK WEIGHT FEATURES REASON

n[1-128] 128 lsf idle 2 3456 1 1 (null) none

See

“Getting System Information with the sinfo Command”

and the sinfo(1) manpage and for further

information about using the sinfo command.

10.11 Getting Information About Jobs

There are several ways you can get information about a specific job after it has been submitted to LSF-HPC
integrated with SLURM. This section briefly describes some of the commands that are available under
LSF-HPC integrated with SLURM to gather information about a job. This section is only intended to give
you an idea of the commonly used commands and to describe any differences there may be in their
operation in the HP XC environment, not as a complete reference on this topic. See the LSF manpages for
full information about the commands described in this section.

The following LSF commands are described in this section:

bjobs

“Examining the Status of a Job”

bhist

“Viewing the Historical Information for a Job”

10.11.1 Getting Job Allocation Information

Before a job runs, LSF-HPC integrated with SLURM allocates SLURM compute nodes based on job resource
requirements.

92

Using LSF-HPC