10.10.1 Examining System Core Status

The bhosts command displays LSF resource usage information. This command is useful to examine the status of the system cores. The bhosts command provides a summary of the jobs on the system and information about the current state of LSF. For example, it can be used to determine if LSF is ready to start accepting batch jobs.

LSF daemons run on only one node in the HP XC system, so the bhosts command will list one host, which represents all the resources of the HP XC system. The total number of cores for that host should be equal to the total number of cores assigned to the SLURM lsf partition.

By default, this command returns the host name, host status, and job state statistics.

The following example shows the output from the bhosts command:

$ bhosts

 

 

 

 

 

 

 

 

HOST_NAME

STATUS JL/U

MAX

NJOBS RUN SSUSP USUSP RSV

lsfhost.localdomain

ok

-

16

0

0

0

0

0

Of note in the bhosts output:The HOST_NAME column displays the name of the LSF execution host.

The MAX column displays the total core count (usable cores) of all available computer nodes in the lsf partition.

The STATUS column shows the state of LSF and displays a status of either ok or closed.

The NJOBS column displays the number of jobs. Note that in LSF terminology, a parallel job with 10 tasks counts as 10 jobs.

10.10.2Getting Information About the LSF Execution Host Node

The lshosts command displays resource information about the LSF cluster. This command is useful for verifying machine-specific information.

LSF daemons run on only one node in the HP XC system, so the lshosts command will list one host — which represents all the resources assigned to it by the HP XC system. The total number of cores for that host should be equal to the total number of cores assigned to the SLURM lsf partition.

By default, lshosts returns the following information: host name, host type, host model, core factor, number of cores, total memory, total swap space, server information, and static resources.

The following example shows the output from the lshosts command:

$ lshosts

 

 

 

 

 

 

 

 

HOST_NAME

type

model

cpuf ncpus maxmem maxswp server

RESOURCES

lsfhost.loc

SLINUX6

Itanium2

16.0

12

3456M

-

Yes

(slurm)

n7

UNKNOWN

UNKNOWN_

1.0

-

-

-

No

()

n8

UNKNOWN

UNKNOWN_

1.0

-

-

-

No

()

n2

UNKNOWN

UNKNOWN_

1.0

-

-

-

No

()

Of note in the lshosts output:

The HOST_NAME column displays the name of the LSF execution host, lsfhost.localdomain and any other HP XC nodes that have been granted a floating client license because LSF commands were executed on them. LSF does not know about these floating client hosts, so they are listed as UNKNOWN types and models.

The type column displays the type of resource. This value is SLINUX64 for all HP XC systems.

The ncpus column displays the total core count (usable cores) of all available computer nodes in the lsf partition.

The maxmem column displays minimum maxmem over all available computer nodes in the lsf partition.

The maxtmp column (not shown) displays minimum maxtmp over all available computer nodes in the lsf partition. Use the lshosts -lcommand to display this column.

10.10 Determining Available System Resources

95