Section:
B-1.2 Why can I not start mpid?
mpid opens a socket and assigns a predefined mpid port number (see /etc/services for more information), to the end point. If mpid is terminated abnormally, the mpid port number cannot be
Use netstat
A previous SMC run has not terminated properly.
Check for
Use /opt/scali/sbin/scidle
Use /opt/scali/bin/scash to check for leftover shared memory segments on all nodes (ipcs for Solaris and Linux).
Note: core dumping takes time.
The application has required too much SCI or shared memory resources.
The mpimon
B-1.3 Why does my program terminate abnormally?
The application core dumps.
Use a debugger to locate the point of violation. The application may need to be recompiled to include symbolic debug information
Define SCAMPI_INSTALL_SIGSEGV_HANDLER=1 and attach to the failing process with the debugger.
B-1.4 General problems
Are you reasonably certain that your algorithms are MPI safe?
Check if every send has a matching receive.
The program just hangs
If the application has a large degree of asynchronicity, try to increase the
The program terminates without an error message
Investigate the core file, or rerun the program in a debugger.
Scali MPI Connect Release 4.4 Users Guide | 55 |