PAR Technologies V5 manual Psid, Synopsis

Page 67

psid

psid the ParaStation daemon. The organizer of the ParaStation software architecture.

Synopsis

psid [-v?] [-d level] [-f configfile] [-l logfile] [--usage]

Description

The ParaStation daemon is implemented as a Unix daemon process. It supervises allocated resources, cleans up after application shutdowns, and controls access to common resources. Thus, it takes care of tasks which are usually managed by the operating system.

The local daemon is usually started by executing psiadmin(1). If it is not running at the time a ParaStation process is starting, the inetd(8) or xinetd(8) daemon is starting up psid automatically. The daemon can also be started using the command line. Parameters can be given at the command line or in the configuration file inetd.conf(5) or xinetd.conf(5). Most of the parameters can also be given in the ParaStation configuration file parastation.conf(5). As an alternative, the psid can be installed as a service using the start/stop script /etc/init.d/parastation.

Nodes without a running ParaStation daemon are not visible within the cluster. Be aware of the fact that psiadmin(1) usually only starts the local daemon. All other daemons managing the nodes configured to belong to the cluster may be started using the add directive from within the ParaStation administration tool psiadmin(1).

If psiadmin(1) is started with the -soption, all daemons within the cluster will be started automatically.

The ParaStation daemon must always run with root privileges.

Before a process can communicate with the ParaStation system, it has to register with the daemon. Access may be granted or denied. The daemon can deny the access due to several reasons:

the ParaStation system library of the process and the ParaStation daemon are incompatible.

the daemon is in a state where it does not accept new connections.

insufficient resources.

the user is temporally not allowed to access ParaStation (see psiadmin(1)).

the group is temporally not allowed to access ParaStation (see psiadmin(1)).

the number of processes exceed the maximum set by psiadmin(1).

The ParaStation daemon can restrict the access to the communication subsystem to a specific user or a maximum number of processes. This enables the cluster to run in an optimized way, since multiple processes slow down application execution due to scheduling overhead. (See psiadmin(1), set user, set group and set maxproc for this features.)

All ParaStation daemons are connected to each other. They exchange local information and transmit demands of local processes to the psid of the destination node. With this cooperation, the ParaStation system offers a distributed resource management.

The ParaStation daemon spawns and kills client processes on demand of a parent process. The ParaStation system transfers remote spawning or killing requests to the daemon of the destination node. Then operating system functionality is used to spawn and kill the processes on the local node. The spawned process runs with same user and group id as the spawning process. The ParaStation system redirects the output of spawned process to the terminal of the parent process.

ParaStation5 Administrator's Guide

63

Image 67
Contents Administrators Guide Info@par-tec.com ParaStation5 Administrators GuideTable of Contents Problem different groups of nodes are seen as up or down History of ParaStation IntroductionAbout this document Kernel modules Technical overviewRuntime daemon LibrariesLicense Prerequisites InstallationHardware Software Directory structureKernel version Mpi2, mpi2-intel, mpi2-pgi, mpi2-psc Installation via RPM packagesGetting the ParaStation5 RPM packages ManCompiling the ParaStation5 packages from source Installing the RPMsFile Version Etc/init.d/xinetd reload Installing the documentationParaStation entries # rpm -Uv psmpi2.5.0.0-1.i586.rpm Installing MPIFurther steps # rpm -Uv psdoc-5.0.0-1.noarch.rpmUninstalling ParaStation5 ParaStation5 Administrators Guide Define Number of nodes ConfigurationConfiguration of the ParaStation system Copy template# /opt/parastation/bin/testconfig Enable optimized network driversHostname id HWType runJob starter accounter Testing the installation # /opt/parastation/bin/testnodes -np nodes # /opt/parastation/bin/psiadmin -s -c listParaStation5 pscom communication library Insight ParaStation5Directory /proc/sys/ps4/state # echo 10 /proc/sys/ps4/state/ResendTimeout# cat /proc/sys/ps4/state/connections Directory /proc/sys/ps4/local Controlling process placementUsing non-ParaStationapplications Using the ParaStation5 queuing facilityExporting environment variables for a task Export LDPRELOAD=/opt/parastation/lib64/libp4tcp.so Controlling ParaStation5 communication pathsExport PSPLIB=/opt/parastation/lib64/libpscomopenib.so Authentication within ParaStation5Pspshm or Pspsharedmem PSPP4S or PSPP4SOCKNodes and CPUs Homogeneous user ID spaceSingle system view Parallel shell toolPSIRARGPRE0=/some/path/env2tok Integration with AFSIntegrating external queuing systems Tok2envMulticasts Route add -net 224.0.0.0 netmask 240.0.0.0 dev ethX Using ParaStation accountingCopying files in parallel # UseMCastSpawning processes belonging to all groups Using ParaStation process pinningUsing memory binding Changing the default ports for psid8Port Problem psiadmin returns error TroubleshootingProblem node shown as down Problem cannot start process on frontend Problem cannot start parallel taskProblem bad performance Problem different groups of nodes are seen as up or downProblem psid does not startup, reports port in use Problem pssh failsProblem processes cannot access files on remote nodes Reference Pages ParaStation5 Administrators Guide Parameters InstallDir inst-dir , InstallationDir inst-dirParastation.conf DescriptionStatusscript SetupscriptStartscript StopscriptElan P4sockOpenib MvapiNrOfNodes num Accounter$GENERATE 1-96 node$0,2 $0 Node node17 16 HWType ethernet p4sock starter yes runJobs noMCastGroup group-num SelectTime timeDeadInterval num LogLevel numMemLock size Core sizeCPUTime time DataSize sizeProc CPUmap map Processes maxprocsRdpResendTimeout ms RdpTimeout msStatusTimeout ms RdpClosedTimeout msSee also ErrorsParaStation5 Administrators Guide Synopsis PsiadminOptions Extended description Standard ErrorStandard Input Standard OutputAll ExitAllproc cnt count Load Count hw hwDown HardwareSummary max max RdpQuit Maxproc nodes Accounters nodesUser nodes Group nodesNodesSort nodes Master nodesFreeOnSuspend nodes HandleOldBins nodesRlrss nodes Cpumap nodesRdpResendTimeout nodes RdpTimeout nodesStatusTimeout nodes RdpClosedTimeout nodesRestart nodes Resolve nodesPsiddebug mask nodes Selecttime time nodesPattern Name Description HandleOldBins 0 1 nodes Rdpmaxretrans val nodes RdpResendTimeout ms nodes RdpTimeout ms nodesStatusTimeout ms nodes RdpClosedTimeout ms nodesVerbose FilesQuiet NormalPsid Debug=level Configfile=fileLogfile=file ? , --usage Show a help message TestconfigFilename NumParaStation5 Administrators Guide Map TestnodesNp num Cnt numParaStation5 Administrators Guide Testpse -npnum TestpseParaStation5 Administrators Guide ?,--help P4statSock NetParaStation5 Administrators Guide Add P4tcpDelete ParaStation5 Administrators Guide Pattern Description PsaccounterVar/account/yyyymmdd Accounting files, one per day DumpcoreCoredir=dir ?, --helpPsaccview Ls,--ltotsum Lj,--ljobsLu,--ltotuser Lg,--ltotgroupEnd CpuweightAqtime CputimeInitialization file Mlisten ParaStation5 Administrators Guide Appendix A. Quick Installation Guide # chkconfig -a /etc/init.d/parastation # /opt/parastation/bin/psiadmin psiadmin addTesting Appendix B. ParaStation license Page Page Page Changes to the runtime environment Building and installing ParaStation5 packages# psiadmin -s Appendix C. Upgrading ParaStation4 to ParaStation5Page ARP GlossarySee ParaStation Logger To share a common address space within a node ParaStation5 Administrators Guide