Section: 3.3 Running Scali MPI Connect programs
For each MPI process SMC will try to establish contact with each other MPI process, in the order listed. This enables mixed interconnect systems, and provides a means for working around failed hardware.
In a system interconnect where the primary interconnect is Myrinet, if one node has a faulty card, using the device list in the example, all communication to and from the faulty node will happen over TCP/IP while the remaining nodes will use Myrinet. This offers the unique ability to continue running applications over the full set of nodes even when there are interconnect faults.
3.3.3 mpirun - wrapper script
mpirun is a wrapper script for mpimon, providing legacy MPICH style startup for SMC applications. Instead of the mpimon syntax, where a list of pairs of node name and number of
Using scaconftool, mpirun attempts to generate a list of operational nodes. Note that only operational nodes are selected. If no operational node is available, an error message is printed and mpirun terminates. If scaconftool is not available, mpirun attempts to use the file /opt/ scali/etc/ScaConf.nodeidmap for selecting the list of operational notes. In the generated list of nodes, mpirun evenly divides the
3.3.3.1 mpirun usage
mpirun <mpirunoptions> <mpimonoptions> <userprogram> [<programoptions>]
where
<mpirunoptions> mpirun options <mpimonoptions> options passed on to mpimon <userprogram> name of application program to run.
and
<programoptions> program options passed on to the application program.
The following mpirunoptions exist:
Limit runtime to <time> minutes. | |
Total number of | |
Maximum number of | |
| nodes. |
| Submit job to PBS queue system |
Use mpich compatible pgfile for program, | |
| specification. pgfile entry: <nodename> <#procs> <progname> |
| The program name given at command line is additionally started |
| with one |
Verbose. | |
Debug all | |
| Limit runtime to <time> minutes. |
| Do not use scaconftool for generating nodelist. |
Ignore the /opt/scali/etc/ScaConf.nodearchmap file (which | |
| describes each node). |
Specify nodename of | |
| Distribute stdin to |
Scali MPI Connect Release 4.4 Users Guide | 27 |