bmig

bmig

migrates checkpointable or rerunnable jobs

Synopsis

bmig [-f] [job_ID "job_ID[index_list]"] ...

bmig [-f][-Jjob_name] [-m "host_name ..." -m "host_group ..."] [-uuser_name -uuser_group -u all] [0]

bmig [-h -V]

Description

Migrates one or more of your checkpointable or rerunnable jobs to a different host. You can migrate only running or suspended jobs; you cannot migrate pending jobs. Members of a chunk job in the WAIT state can be migrated; LSF removes waiting jobs from the job chunk and changes their original dispatch sequence.

By default, migrates the most recently submitted job, or the most recently submitted job that also satisfies other specified options (-uand -J). Specify 0 (zero) to migrate multiple jobs. Only LSF administrators and root can migrate jobs submitted by other users. Both the original and the new hosts must:

Be binary compatible

Run the same dot version of the operating system for predictable results

Have network connectivity and read/execute permissions to the checkpoint and restart executables (in LSF_SERVERDIR by default)

Have network connectivity and read/write permissions to the checkpoint directory and the checkpoint file

Have access to all files open during job execution so that LSF can locate them using an absolute path name

When you migrate a checkpointable job, LSF checkpoints and kills the job and then restarts the job on the next available host. If checkpoint fails, the job continues to run on the original host. If you issue the bmig command while a job is being checkpointed—for example, with periodic checkpointing enabled—LSF ignores the migration request.

When you migrate a rerunnable job, LSF kills the job and then restarts it from the beginning on the next available host. LSF sets the environment variable LSB_RESTART to Y when a migrating job restarts or reruns.

NOTE: The user does not receive notification when LSF kills a checkpointable or rerunnable job as part of job migration.

In a MultiCluster environment, you must use brun rather than bmig to move a job to another host.

When absolute job priority scheduling (APS) is configured in the queue, LSF always schedules migrated jobs before pending jobs. For migrated jobs, LSF keeps the existing job priority. If LSB_REQUEUE_TO_BOTTOM and LSB_MIG2PEND are

116Platform LSF Command Reference