PAR Technologies V5 manual Appendix A. Quick Installation Guide

Page 87

Appendix A. Quick Installation Guide

This appendix gives a brief overview how to install ParaStation5 on a cluster. A detailed description can be found in Chapter 3, Installation and Chapter 4, Configuration.

1.Shutdown

If this is an update of ParaStation, first shut down the ParaStation system. In order to do this, startup psiadmin and issue a shutdown command.

#/opt/parastation/bin/psiadmin psiadmin> shutdown

This will terminate all currently running tasks controlled by ParaStation, including psiadmin.

2.Get the installation packages

Get the necessary installation packages from the download section of the ParaStation homepage www.parastation.com. Required packages are psmgmt, pscom and psmpi2. The documentation package psdoc is optional.

If you want to compile the packages yourself, download the source packages (*.src.rpm) and rebuild it, using the rpmbuild command, e.g.:

#rpmbuild --rebuild psmgmt.5.0.0-0.src.rpm

#rpm -U psmgmt.5.0.0-0.i586.rpm

#rpmbuild --rebuild pscom.5.0.0-0.src.rpm

#rpm -U pscom.5.0.0-0.i586.rpm

#rpm -U pscom-modules.5.0.0-0.i586.rpm

#rpmbuild --rebuild psmpi2.5.0.0-1.src.rpm

#rpm -U psmpi2.5.0.0-1.i586.rpm

The psmgmt package must be installed before the pscom package may be built, similar for pscom and psmpi2. If you only want to rebuild the kernel modules for the p4sock protocol, use

# rpmbuild --rebuild --with modules pscom.5.0.0-0.src.rpm

This will render a RPM package with the ParaStation kernel modules suitable for your setup.

3.Install software on the server

Install the ParaStation distribution files on the server machine, if not yet done:

#rpm -U psmgmt.5.0.0-0.i586.rpm pscom.5.0.0-0.i586.rpm \ pscom-modules.5.0.0-0.i586.rpm psmpi2.5.0.0-1.i586.rpm \ psdoc.5.0.0-0.noarch.rpm

4.Install software on the compute nodes

Repeat step 3 for each node. You may omit the documentation package.

5.Configuration

Next, the configuration file parastation.conf has to be adapted to the local settings. The template file /opt/parastation/config/parastation.conf.tmpl should be copied to /etc/ parastation.conf and adjusted to the local needs. The configuration could be verified using the command test_config(1) located in /opt/parastation/bin.

This configuration file must be copied to all other nodes.

6.Startup ParaStation

ParaStation5 Administrator's Guide

83

Image 87
Contents Administrators Guide Info@par-tec.com ParaStation5 Administrators GuideTable of Contents Problem different groups of nodes are seen as up or down History of ParaStation IntroductionAbout this document Kernel modules Technical overviewRuntime daemon LibrariesLicense Installation PrerequisitesHardware Directory structure SoftwareKernel version Mpi2, mpi2-intel, mpi2-pgi, mpi2-psc Installation via RPM packagesGetting the ParaStation5 RPM packages ManInstalling the RPMs Compiling the ParaStation5 packages from sourceFile Version Installing the documentation Etc/init.d/xinetd reloadParaStation entries # rpm -Uv psmpi2.5.0.0-1.i586.rpm Installing MPIFurther steps # rpm -Uv psdoc-5.0.0-1.noarch.rpmUninstalling ParaStation5 ParaStation5 Administrators Guide Define Number of nodes ConfigurationConfiguration of the ParaStation system Copy templateEnable optimized network drivers # /opt/parastation/bin/testconfigHostname id HWType runJob starter accounter Testing the installation # /opt/parastation/bin/testnodes -np nodes # /opt/parastation/bin/psiadmin -s -c listParaStation5 pscom communication library Insight ParaStation5# echo 10 /proc/sys/ps4/state/ResendTimeout Directory /proc/sys/ps4/state# cat /proc/sys/ps4/state/connections Directory /proc/sys/ps4/local Controlling process placementUsing the ParaStation5 queuing facility Using non-ParaStationapplicationsExporting environment variables for a task Export LDPRELOAD=/opt/parastation/lib64/libp4tcp.so Controlling ParaStation5 communication pathsExport PSPLIB=/opt/parastation/lib64/libpscomopenib.so Authentication within ParaStation5Pspshm or Pspsharedmem PSPP4S or PSPP4SOCKNodes and CPUs Homogeneous user ID spaceSingle system view Parallel shell toolPSIRARGPRE0=/some/path/env2tok Integration with AFSIntegrating external queuing systems Tok2envMulticasts Route add -net 224.0.0.0 netmask 240.0.0.0 dev ethX Using ParaStation accountingCopying files in parallel # UseMCastSpawning processes belonging to all groups Using ParaStation process pinningUsing memory binding Changing the default ports for psid8Port Troubleshooting Problem psiadmin returns errorProblem node shown as down Problem cannot start process on frontend Problem cannot start parallel taskProblem bad performance Problem different groups of nodes are seen as up or downProblem psid does not startup, reports port in use Problem pssh failsProblem processes cannot access files on remote nodes Reference Pages ParaStation5 Administrators Guide Parameters InstallDir inst-dir , InstallationDir inst-dirParastation.conf DescriptionStatusscript SetupscriptStartscript StopscriptElan P4sockOpenib MvapiNrOfNodes num Accounter$GENERATE 1-96 node$0,2 $0 Node node17 16 HWType ethernet p4sock starter yes runJobs noMCastGroup group-num SelectTime timeDeadInterval num LogLevel numMemLock size Core sizeCPUTime time DataSize sizeProc CPUmap map Processes maxprocsRdpResendTimeout ms RdpTimeout msStatusTimeout ms RdpClosedTimeout msSee also ErrorsParaStation5 Administrators Guide Psiadmin SynopsisOptions Extended description Standard ErrorStandard Input Standard OutputExit AllAllproc cnt count Load Count hw hwDown HardwareRdp Summary max maxQuit Maxproc nodes Accounters nodesUser nodes Group nodesNodesSort nodes Master nodesFreeOnSuspend nodes HandleOldBins nodesRlrss nodes Cpumap nodesRdpResendTimeout nodes RdpTimeout nodesStatusTimeout nodes RdpClosedTimeout nodesRestart nodes Resolve nodesPsiddebug mask nodes Selecttime time nodesPattern Name Description HandleOldBins 0 1 nodes Rdpmaxretrans val nodes RdpResendTimeout ms nodes RdpTimeout ms nodesStatusTimeout ms nodes RdpClosedTimeout ms nodesVerbose FilesQuiet NormalPsid Configfile=file Debug=levelLogfile=file ? , --usage Show a help message TestconfigFilename NumParaStation5 Administrators Guide Map TestnodesNp num Cnt numParaStation5 Administrators Guide Testpse -npnum TestpseParaStation5 Administrators Guide ?,--help P4statSock NetParaStation5 Administrators Guide P4tcp AddDelete ParaStation5 Administrators Guide Pattern Description PsaccounterVar/account/yyyymmdd Accounting files, one per day DumpcoreCoredir=dir ?, --helpPsaccview Ls,--ltotsum Lj,--ljobsLu,--ltotuser Lg,--ltotgroupEnd CpuweightAqtime CputimeInitialization file Mlisten ParaStation5 Administrators Guide Appendix A. Quick Installation Guide # /opt/parastation/bin/psiadmin psiadmin add # chkconfig -a /etc/init.d/parastationTesting Appendix B. ParaStation license Page Page Page Changes to the runtime environment Building and installing ParaStation5 packages# psiadmin -s Appendix C. Upgrading ParaStation4 to ParaStation5Page ARP GlossarySee ParaStation Logger To share a common address space within a node ParaStation5 Administrators Guide