HP XC System 2.x Software manual Getting Information About the System’s Partitions, Launching Jobs

Models: XC System 2.x Software

1 154
Download 154 pages 20.06 Kb
Page 34
Image 34

The LSF lshosts command displays machine-specific information for the LSF execution host node.

$ lshosts

Refer to Section 7.3.2 for more information about using this command and a sample of its output.

The LSF lsload command displays load information for the LSF execution host node.

$ lsload

Refer to Section 7.3.3 for more information about using this command and a sample of its output.

2.3.4Getting Information About the System’s Partitions

Information about the system’s partitions can be viewed with the SLURM sinfo command. The sinfo command reports the state of all partitions and nodes managed by SLURM and provides a wide variety of filtering, sorting, and formatting options. sinfo displays a summary of available partition and node (not job) information such as partition names, nodes/partition, and CPUs/node).

$ sinfo

Refer to Section 7.3.5 for more information about using the sinfo command and a sample of its output.

2.3.5 Launching Jobs

To launch a job on an HP XC system, use the LSF bsub command. The bsub command submits batch jobs or interactive batch jobs to an LSF queue for execution.

This section provides some brief examples of how to launch some typical serial and parallel jobs. Refer to Section 7.4 for full information about launching jobs with the bsub command.

2.3.5.1 Submitting a Serial Job

Submitting serial jobs is discussed in detail in Section 7.4.3. The command format to submit a serial job is:

bsub [bsub-options] [srun [srun-options] ] executable [executable-options]

Use the LSF bsub command to submit a job on the LSF-HPC execution host. The SLURM srun job launch command is only needed if the LSF-HPC JOB_STARTER script is

not configured for the intended queue (but can be used regardless of whether or not the script is configured). You can use the bqueues command to confirm whether or not the JOB_STARTER script exists.The executable parameter is the name of an executable file or command.

The use of srun is discussed in detail in Chapter 6.

Consider an HP XC configuration where lsfhost.localdomain is the LSF execution host, and nodes n[1-10]are compute nodes in the LSF partition. All nodes contain two processors, providing 20 processors for use by LSF jobs. The following example shows one way to submit a serial job on this system:

Example 2-1: Submitting a Serial Job

$ bsub -I srun hostname

Job <20> is submitted to default queue <normal>. <<Waiting for dispatch ...>>

<<Starting on lsfhost.localdomain>> n1

2-8Using the System

Page 34
Image 34
HP XC System 2.x Software manual Getting Information About the System’s Partitions, Launching Jobs, Submitting a Serial Job