$ sinfo -p lsf

PARTITION AVAIL TIMELIMIT NODES STATE NODELIST

lsf

up infinite

128

idle n[1-128]

Use the following command to obtain more information on the nodes in the lsf partition:

$ sinfo -p lsf -lNe

 

 

 

 

 

 

 

NODELIST

NODES

PARTITION STATE CPUS

MEMORY

TMP_DISK

WEIGHT

FEATURES

REASON

n[1-128]

128

lsf idle

2

3456

 

1

1

(null)

none

Refer to "Getting System Information with the sinfo Command"

and the sinfo(1) manpage and for further

information about using the sinfo command.

 

 

 

 

 

 

 

Submitting Jobs

The bsub command submits jobs to the LSF-HPC system. This section focuses on enhancements to the bsub command from the LSF-HPC integration with SLURM on the HP XC system; this section does not discuss standard bsub functionality or flexibility. See the Platform LSF documentation and the bsub(1) manpage for more information on this important command.

This topic of submitting jobs is explored in detail in Chapter Chapter 5. Submitting Jobs and this section refers to some of these examples.

The bsub command and its options, including the external SLURM scheduler, is used to request a set of resources on which to launch a job. See "HP XCCompute Node Resource Support" for an introduction and "Summary of the LSF bsub Command Format" for additional information. The arguments to the bsub command consist of the user job and its arguments. The bsub options allow you to provide information on the amount and type of resources needed by the job.

The basic synopsis of the bsub command is:

bsub [bsub-options] jobname jobname [job-options]

The HP XC system has several features that make it optimal for running parallel applications, particularly (but not exclusively) MPI applications. You can use the bsub command's -nto request more than one core for a job. This option, coupled with the external SLURM scheduler, discussed in "LSF-SLURM External Scheduler" , gives you much flexibility in selecting resources and shaping how the job is executed on those resources.

LSF-HPC, like standard LSF, reserves the requested number of nodes and executes one instance of the job on the first reserved node, when you request multiple nodes. Use the srun command or the mpirun command with the -srunoption in your jobs to launch parallel applications. The -sruncan be set implicitly for the mpirun command; see "Submitting a Parallel Job That Uses the HP-MPI Message Passing Interface" for more information on using the mpirun -sruncommand.

Most parallel applications rely on rsh or ssh to "launch" remote tasks. The ssh utility is installed on the HP XC system by default. If you configured the ssh keys to allow unprompted access to other nodes in the HP XC system, the parallel applications can use ssh. See "Enabling Remote Execution with OpenSSH" for more information on ssh.

Summary of the LSF bsub Command Format

This section provides a summary of the format LSF bsub command on the HP XC system. The bsub command can have the following formats:

bsub

When you invoke the bsub command without any arguments, you are prompted for a command from standard input.

bsub [bsub-options]jobname [job-arguments]

This is the bsub command format to submit a serial job. The srun command is required to run parallel jobs on the allocated compute node. Refer to "Submitting a Serial Job Using LSF-HPC" .

bsub -nnum-procs [bsub-options]jobname [job-arguments]

This is the standard bsub command format to submit a parallel job to the LSF execution host. The jobname parameter can be name of an executable or a batch script. If jobname is executable, job is launched on LSF execution host node. If jobname is batch script (containing srun commands), job is launched on

Using LSF-HPC 77

Page 77
Image 77
HP XC System 3.x Software manual Summary of the LSF bsub Command Format, $ sinfo -p lsf -lNe