The squeue command can report on jobs in the job queue according to their state; valid states are: pending, running, completing, completed, failed, timeout, and node_fail. Example
$ squeue |
|
|
|
|
| ||
JOBID | PARTITION | NAME | USER | ST | TIME | NODES | NODELIST |
59 | amt1 | hostname | root | F | 0:00 | 0 |
|
|
|
|
|
|
|
|
|
6.6 Killing Jobs with the scancel Command
The scancel command cancels a pending or running job or job step. It can also be used to send a specified signal to all processes on all nodes associated with a job. Only job owners or administrators can cancel jobs.
Example
$ scancel 415
Example
$ scancel
Example
$ scancel
6.7 Getting System Information with the sinfo Command
The sinfo command reports the state of partitions and nodes managed by SLURM. It has a wide variety of filtering, sorting, and formatting options. sinfo displays a summary of available partition and node (not job) information (such as partition names, nodes/partition, and CPUs/node).
Example$ sinfo |
|
|
|
|
|
PARTITION AVAIL TIMELIMIT NODES | STATE | NODELIST | |||
lsf | up | infinite | 1 | down* | n15 |
lsf | up | infinite | 2 | idle | n[14,16] |
|
|
|
|
|
|
Using SLURM