Escali 4.4 manual Scali kernel drivers, Uninstalling SMC, Troubleshooting Network providers

Page 71

Section:

-n <hostname> - Specify hostname of Scali license server

This option tells the software which host to contact to check out a license. This can also be manually edited by modifying the scalm_net_server parameter in

/opt/scali/etc/scalm.conf.

-l- Creates a license request to be sent to license@scali.com. Host information from the license server must be included in the license request.

Scali MPI Connect is licensed software. You need a license from Scali to be able to run an MPI application using the mpirun or mpimon program launcher. Usually Scali will provide a time- limited demo license to be used for installation and system test. Then a permanent license request is sent to license@scali.com by the user. Scali will process the license request and reply with a permanent license file. This file must be installed as /opt/scali/etc/license.dat on the license server using the following command (as described above):

root# /opt/scali/sbin/smcinstall -u <licfile>

C-9 Scali kernel drivers

Scali MPI Connect contains proprietary kernel mode drivers which are loaded into the kernel. The drivers (ScaKal, ScaDET and ScaSCI) will automatically build to fit the running kernel, provided that a fully configured kernel source tree is installed. This is provided by the kernel- source RPM on SUSE and RedHat distributions, however SUSE might require some manual configuration.

If the automatic build process fails, the drivers must be built manually using the script /opt/scali/libexec/rebuild_module.sh in the following way :

root# /opt/scali/libexec/rebuild_module.sh scakal <path to your linux kernel source>

optionally :

root# /opt/scali/libexec/rebuild_module.sh scadet <path to your linux kernel source>

if Scali MPI Connect for Direct Ethernet is installed, and :

root# /opt/scali/libexec/rebuild_module.sh ssci <path to your linux kernel source>

if Scali MPI Connect for SCI is installed. To complete the process, re-run the smcinstall script with the same options as previously used.

C-10 Uninstalling SMC

To remove Scali MPI Connect, use the script :

root# /opt/scali/sbin/smcunistall

C-11 Troubleshooting Network providers

The Scali MPI Connect now uses DAT as its API to connect to drivers for different interconnects. In DAT terminology the drivers are called provider libraries, or dapl’s.

Scali MPI Connect Release 4.4 Users Guide

59

Image 71
Contents Scali MPI ConnectTM Users Guide Copyright 1999-2005 Scali AS. All rights reserved AcknowledgementScali Bronze Software Certificate Grant of License MaintenanceII Software License Terms Commencement Export Requirements SupportLicense Manager Sub-license and distributionLICENSEE’s Obligations SCALI’s ObligationsTransfer Title to Intellectual Property RightsCompliance with Licenses Warranty of Title and Substantial PerformanceScali MPI Connect Release 4.4 Users Guide Vii Limitation on Remedies and LiabilitiesMiscellaneous Proprietary InformationGoverning Law Scali MPI Connect Release 4.4 Users Guide Table of contents Profiling with Scali MPI Connect Appendix a Example MPI code Scali MPI Connect Release 4.4 Users Guide Scali MPI Connect product context ChapterPlatforms supported Scali mailing lists SMC FAQ SMC release documentsProblem reports SupportFeedback How to read this guideAcronyms and abbreviations LicensingNIC GUI style font Terms and conventionsTypographic conventions Typographic conventions Scali MPI Connect components Description of Scali MPI ConnectSMC network devices Ethernet Devices Direct Access Transport DATNetwork devices Shared Memory Device3.2 DET Using detctlUsing detstat 5.1 IB MyrinetInfiniband 4.1 GMChannel buffer Communication protocols on DAT-devices6 SCI Transporter protocol Inlining protocolEagerbuffering protocol Zerocopy protocol MPI-2 FeaturesSupport for other interconnects Scali MPI Connect Release 4.4 Users Guide MPI-2 Features Running Setting up a Scali MPI Connect environmentCompiling and linking Scali MPI Connect environment variablesLinker flags Compiler supportNaming conventions Running Scali MPI Connect programsIdentity of parallel processes Mpimon monitor programBasic usage Program spec Controlling options to mpimonStandard input Standard outputNetwork options How to provide options to mpimonMpirun usage Mpirun wrapper scriptPart part Running with tcp error detection TfdrSuspending and resuming jobs Running with dynamic interconnect failover capabilitiesDebugging with a sequential debugger Debugging and profilingDebugging with Etnus Totalview Using built-in segment protect violation handlerBuilt-in-tools for debugging Assistance for external profilingChannelinlinethreshold size to set threshold for inlining Controlling communication resourcesCommunication resources on DAT-devices Matching MPIRecv with MPIProbe Using MPIIsend, MPIIrecvUsing MPIBsend Good programming practice with SMCUnsafe MPI programs Error and warning messagesUser interface errors and warnings Fatal errorsMpimon options Postfix Giving numeric values to mpimonPrefix Scali MPI Connect Release 4.4 Users Guide Example Profiling with Scali MPI ConnectTracing Using Scali MPI Connect built-in trace+relSecs S eTime where AbsRank MPIcallcommNamerankcall-dependant-parameters whereExample FeaturesTiming Using Scali MPI Connect built-in timingMPIcallDcallsDtimeDfreq TcallsTtimeTfreq Count!avrLen!zroLen!inline!eager!transporter! where Using the scanalyzeCommrank recv from fromworldFromcommonFields Commrank send to toworldTocommonFields whereFor timing This produces the following report Using SMCs built-in CPU-usage functionalityScali MPI Connect Release 4.4 Users Guide Automatic buffer management Tuning communication resourcesFirst iteration is very slow How to optimize MPI performanceBenchmarking Caching the application program on the nodesMemory consumption increase after warm-up Collective operationsFinding the best algorithm Image contrast enhancement Appendix aPrograms in the ScaMPItst package Scali MPI Connect Release 4.4 Users Guide Original File formatAppendix B When things do not work troubleshootingWhy does not my program start to run? General problems Why can I not start mpid?Why does my program terminate abnormally? Appendix C Per node installation of Scali MPI ConnectExample Install Scali MPI Connect for TCP/IPInstall Scali MPI Connect for Direct Ethernet Install Scali MPI Connect for MyrinetLicense options Install Scali MPI Connect for InfinibandInstall Scali MPI Connect for SCI Install and configure SCI management softwareScali kernel drivers Uninstalling SMCTroubleshooting Network providers Troubleshooting the GM provider Troubleshooting 3rdparty DAT providersScali MPI Connect Release 4.4 Users Guide Grouping Appendix D Bracket expansion and groupingBracket expansion Scali MPI Connect Release 4.4 Users Guide Related documentation Appendix EScali MPI Connect Release 4.4 Users Guide List of figures Scali MPI Connect Release 4.4 Users Guide Transporter protocol IndexSSP