Running Preexecution Programs | 51 |
6 Debugging Applications |
|
Debugging Serial Applications | 53 |
Debugging Parallel Applications | 53 |
Debugging with TotalView | 53 |
SSH and TotalView | 54 |
Setting Up TotalView | 54 |
Using TotalView with SLURM | 54 |
Using TotalView with | 55 |
Setting TotalView Preferences | 55 |
Debugging an Application | 55 |
Debugging Running Applications | 56 |
Exiting TotalView | 57 |
7 Tuning Applications |
|
Using the Intel Trace Collector and Intel Trace Analyzer | 59 |
Building a Program — Intel Trace Collector and | 59 |
Running a Program – Intel Trace Collector and | 60 |
Visualizing Data – Intel Trace Analyzer and | 60 |
8 Using SLURM |
|
Introduction to SLURM | 63 |
SLURM Utilities | 63 |
Launching Jobs with the srun Command | 63 |
The srun Roles and Modes | 64 |
The srun Roles | 64 |
The srun Modes | 64 |
Using the srun Command with | 64 |
Using the srun Command with | 64 |
Monitoring Jobs with the squeue Command | 64 |
Terminating Jobs with the scancel Command | 65 |
Getting System Information with the sinfo Command | 65 |
Job Accounting | 65 |
Fault Tolerance | 66 |
Security | 66 |
9 Using LSF |
|
Using Standard LSF on an HP XC System | 67 |
Using | 67 |
Introduction to | 68 |
Overview of | 68 |
Differences Between | 69 |
Job Terminology | 70 |
HP XCCompute Node Resource Support | 71 |
Notes on | 72 |
How | 73 |
Notes About Using | 74 |
Job Startup and Job Control | 74 |
Preemption | 75 |
Determining the LSF Execution Host | 75 |
Determining Available | 75 |
Getting the Status of | 75 |
Getting Information About LSF Execution Host Node | 75 |
Getting Host Load Information | 76 |
Examining | 76 |
Table of Contents | 5 |