Sample UNIX installation directories

Daemon error log files

Daemon error log files are stored in the directory defined by LSF_LOGDIR in lsf.conf.

LSF base system daemon log files

LSF batch system daemon log files

 

 

pim.log.host_namembatchd.log.host_name

res.log.host_namesbatchd.log.host_name

lim.log.host_namembschd.log.host_name

If EGO_LOGDIR is defined in ego.conf, The file lim.log.host_name is stored in the directory defined by EGO_LOGDIR.

Configuration files

lsf.conf, lsf.shared, and lsf.cluster.cluster_name are located in LSF_CONFDIR.

lsb.params, lsb.queues, lsb.modules, and lsb.resources are located in LSB_CONFDIR/ cluster_name/configdir/.

File

Description

install.config

Options for Platform LSF installation and configuration

 

 

lsf.conf

Generic environment configuration file describing the

 

configuration and operation of the cluster

lsf.shared

Definition file shared by all clusters. Used to define cluster

 

name, host types, host models and site-defined resources

lsf.cluster.cluster_name Cluster configuration files used to define hosts, administrators, and locality of site-defined shared resources

lsf.licensescheduler

Configures Platform LSF License Scheduler

 

 

lsb.applications

defines application profiles to define common parameters for

 

the same type of jobs

lsb.params

Configures LSF batch parameters

 

 

File

Description

lsb.queues

Batch queue configuration file

lsb.resources

Configures resource allocation limits, exports, and resource

 

usage limits

 

 

lsb.serviceclasses

Defines service-level agreements (SLAs) in an LSF cluster as

 

service classes, which define the properties of the SLA

lsb.users

Configures user groups, hierarchical fairshare for users and

 

user groups, and job slot limits for users and user groups

Cluster configuration parameters (lsf.conf)

Variable

Description

UNIX Default

LSF_BINDIR

Directory containing LSF user commands,

LSF_TOP/version/

 

shared by all hosts of the same type

platform/bin

 

 

 

LSF_CONFDIR

Directory for all LSF configuration files

LSF_TOP/conf

LSF_ENVDIR

Directory containing the lsf.conf file, must

/etc (if LSF_CONFDIR

 

be owned by root

is not defined)

 

 

LSF_INCLUDEDIR Directory containing LSF API header files

LSF_TOP/version/

 

lsf.h and lsbatch.h

include

LSF_LIBDIR

LSF libraries, shared by all hosts of the

LSF_TOP/version/

 

same type

platform/lib

 

 

 

LSF_LOGDIR

(Optional) Directory for LSF daemon logs,

/tmp

 

must be owned by root

 

LSF_LOG_MASK

Specifies the logging level of error

LOG_WARNING

 

messages from LSF commands

 

 

 

 

LSF_MANDIR

Directory containing LSF man pages

LSF_TOP/version/man

LSF_MISC

Help files for the LSF GUI tools, sample C

LSF_TOP/version/

 

programs and shell scripts, and a template

misc

 

for an external LIM (elim)

 

 

 

 

LSF_SERVERDIR

Directory for all server binaries and shell

LSF_TOP/version/

 

scripts, and external executables invoked

platform/etc

 

by LSF daemons, must be owned by root,

 

 

and shared by all hosts of the same type

 

LSF_TOP

Top-level installation directory. The path to

Not defined

 

LSF_TOP must be shared and accessible

Required for

 

to all hosts in the cluster. It cannot be the

installation

 

root directory (/).

 

 

 

 

LSB_CONFDIR

Directory for LSF Batch configuration

LSF_CONFDIR/

 

directories, containing user and host lists,

lsbatch

 

operation parameters, and batch queues

 

LSB_SHAREDIR

Directory for LSF Batch job history and

LSF_TOP/work

 

accounting log files for each cluster, must

 

 

be owned by primary LSF administrator

 

 

 

 

LSF_LIM_PORT

TCP service port used for communication

6879

 

with lim

 

LSF_RES_PORT

TCP service port used for communication

6878

 

with res

 

 

 

 

LSB_MBD_PORT

TCP service port used for communication

6881

 

with mbatchd

 

LSB_SBD_PORT

TCP service port used for communication

6882

 

with sbatchd

 

 

 

 

P l a t f o r m L S F ™

Q u i c k R e f e r e n c e

Version 7 Update 3

Administration and accounting commands

Only LSF administrators or root can use these commands.

Command

Description

lsacct

Displays accounting statistics on finished RES tasks in the LSF system

 

 

lsadmin

LSF administrative tool to control the operation of the LIM and RES

 

daemons in an LSF cluster. lsadmin help shows all subcommands.

lsfinstall

Install LSF using install.config input file

 

 

lsfrestart

Restart the LSF daemons on all hosts in the local cluster

lsfshutdown

Shut down the LSF daemons on all hosts in the local cluster

 

 

lsfstartup

Start the LSF daemons on all hosts in the local cluster

bacct

Reports accounting statistics on completed LSF jobs

 

 

badmin

LSF administrative tool to control the operation of the LSF Batch

 

system including sbatchd, mbatchd, hosts and queues. badmin help

 

shows all subcommands.

bladmin

reconfigures the Platform LSF License Scheduler daemon (bld)

Daemons

Executable Name

Description

lim

Load Information Manager (LIM)—collects load and resource

 

information about all server hosts in the cluster and provides host

 

selection services to applications through LSLIB. LIM maintains

 

information on static system resources and dynamic load indices.

 

 

mbatchd

Master Batch Daemon (MBD)—accepts and holds all batch jobs.

 

MBD periodically checks load indices on all server hosts by

 

contacting the Master LIM.

mbschd

Master Batch Scheduler Daemon— performs the scheduling

 

functions of LSF and sends job scheduling decisions to MBD for

 

dispatch. Runs on the LSF master server host.

 

 

sbatchd

Slave Batch Daemon (SBD)—accepts job execution requests

 

from MBD, and monitors the progress of jobs. Controls job

 

execution, enforces batch policies, reports job status to MBD, and

 

launches MBD.

pim

Process Information Manager (PIM)—monitors resources used

 

by submitted jobs while they are running. PIM is used to enforce

 

resource limits and load thresholds, and for fairshare scheduling.

 

 

res

Remote Execution Server (RES)—accepts remote execution

 

requests from all load sharing applications and handles I/O on the

 

remote host for load sharing processes.