PAR Technologies V5 manual Controlling process placement, Directory /proc/sys/ps4/local

Page 23

Directory /proc/sys/ps4/local

MaxAcksPending: maximum number of pending ACK messages until an "urgent" ACK messages will be sent.

MaxDevSendQSize: maximum number of entries of the (protocol internal) send queue to the network device.

MaxMTU: maximum packet size used for network packets. For sending packets, the minimum of MaxMTU and service specific MTU will be used.

MaxRecvQSize: size of the protocol internal receive queue.

MaxResend: Number of retries until a connection is declared as dead.

MaxSendQSize: size of the protocol internal send queue.

ResendTimeout: delay in "jiffies" for resending packets not acknowledged up to now. Must be greater then AckDelay.

5.2.3.Directory /proc/sys/ps4/local

Currently, there are no entries defined for this directory.

5.2.4.p4stat

The command p4stat can be used to list open sockets and network connections of the p4sock protocol.

$ /opt/parastation/bin/p4stat -s

 

 

Socket #0 : Addr: <00><00><00><00><00><'........' last_idx 0

refs 2

Socket

#1

: Addr: <70><6f><72><74><33><'port384.' last_idx 0

refs

10

Socket

#2

: Addr: <70><6f><72><74><31><'port144.' last_idx 0

refs

10

$ /opt/parastation/bin/p4stat -n

 

 

 

 

 

 

 

net_idx

SSeqNo SWindow RSeqNo RWindow lusridx lnetidx rnetidx snq rnq refs

84

30107

30467

30109

30468

84

84

230

0

0

2

85

30106

30466

30106

30465

85

85

231

0

0

2

86

30107

30467

30109

30468

86

86

84

0

0

2

87

30106

30466

30106

30465

87

87

85

0

0

2

88

30107

30467

30109

30468

88

88

217

0

0

2

89

30106

30466

30106

30465

89

89

218

0

0

2

90

30106

30466

30106

30465

90

90

220

0

0

2

91

30106

30466

30106

30465

91

91

221

0

0

2

92

30001

30361

30003

30362

92

92

232

0

0

2

93

30001

30361

30003

30362

93

93

219

0

0

2

94

30000

30000

30001

30360

94

94

233

0

0

2

95

30000

30000

30001

30360

95

95

222

0

0

2

96

30000

30000

30001

30360

96

96

222

0

0

2

This command shows some protocol internal parameters, like open connections, sequence numbers, reference counters, etc. For more information, see p4stat(8).

5.3. Controlling process placement

ParaStation includes sophisticated functions to control the process placement for newly created parallel and serial tasks. These processes typically require a dedicated CPU (core). Upon task startup, the environment variables PSI_NODES, PSI_HOSTS and PSI_HOSTFILE are looked up (in this order) to get

ParaStation5 Administrator's Guide

19

Image 23
Contents Administrators Guide Info@par-tec.com ParaStation5 Administrators GuideTable of Contents Problem different groups of nodes are seen as up or down History of ParaStation IntroductionAbout this document Kernel modules Technical overviewRuntime daemon LibrariesLicense Hardware InstallationPrerequisites Kernel version Directory structureSoftware Mpi2, mpi2-intel, mpi2-pgi, mpi2-psc Installation via RPM packagesGetting the ParaStation5 RPM packages ManFile Version Installing the RPMsCompiling the ParaStation5 packages from source ParaStation entries Installing the documentationEtc/init.d/xinetd reload # rpm -Uv psmpi2.5.0.0-1.i586.rpm Installing MPIFurther steps # rpm -Uv psdoc-5.0.0-1.noarch.rpmUninstalling ParaStation5 ParaStation5 Administrators Guide Define Number of nodes ConfigurationConfiguration of the ParaStation system Copy templateHostname id HWType runJob starter accounter Enable optimized network drivers# /opt/parastation/bin/testconfig Testing the installation # /opt/parastation/bin/testnodes -np nodes # /opt/parastation/bin/psiadmin -s -c listParaStation5 pscom communication library Insight ParaStation5# cat /proc/sys/ps4/state/connections # echo 10 /proc/sys/ps4/state/ResendTimeoutDirectory /proc/sys/ps4/state Directory /proc/sys/ps4/local Controlling process placementExporting environment variables for a task Using the ParaStation5 queuing facilityUsing non-ParaStationapplications Export LDPRELOAD=/opt/parastation/lib64/libp4tcp.so Controlling ParaStation5 communication pathsExport PSPLIB=/opt/parastation/lib64/libpscomopenib.so Authentication within ParaStation5Pspshm or Pspsharedmem PSPP4S or PSPP4SOCKNodes and CPUs Homogeneous user ID spaceSingle system view Parallel shell toolPSIRARGPRE0=/some/path/env2tok Integration with AFSIntegrating external queuing systems Tok2envMulticasts Route add -net 224.0.0.0 netmask 240.0.0.0 dev ethX Using ParaStation accountingCopying files in parallel # UseMCastSpawning processes belonging to all groups Using ParaStation process pinningUsing memory binding Changing the default ports for psid8Port Problem node shown as down TroubleshootingProblem psiadmin returns error Problem cannot start process on frontend Problem cannot start parallel taskProblem bad performance Problem different groups of nodes are seen as up or downProblem psid does not startup, reports port in use Problem pssh failsProblem processes cannot access files on remote nodes Reference Pages ParaStation5 Administrators Guide Parameters InstallDir inst-dir , InstallationDir inst-dirParastation.conf DescriptionStatusscript SetupscriptStartscript StopscriptElan P4sockOpenib MvapiNrOfNodes num Accounter$GENERATE 1-96 node$0,2 $0 Node node17 16 HWType ethernet p4sock starter yes runJobs noMCastGroup group-num SelectTime timeDeadInterval num LogLevel numMemLock size Core sizeCPUTime time DataSize sizeProc CPUmap map Processes maxprocsRdpResendTimeout ms RdpTimeout msStatusTimeout ms RdpClosedTimeout msSee also ErrorsParaStation5 Administrators Guide Options PsiadminSynopsis Extended description Standard ErrorStandard Input Standard OutputAllproc cnt count ExitAll Load Count hw hwDown HardwareQuit RdpSummary max max Maxproc nodes Accounters nodesUser nodes Group nodesNodesSort nodes Master nodesFreeOnSuspend nodes HandleOldBins nodesRlrss nodes Cpumap nodesRdpResendTimeout nodes RdpTimeout nodesStatusTimeout nodes RdpClosedTimeout nodesRestart nodes Resolve nodesPsiddebug mask nodes Selecttime time nodesPattern Name Description HandleOldBins 0 1 nodes Rdpmaxretrans val nodes RdpResendTimeout ms nodes RdpTimeout ms nodesStatusTimeout ms nodes RdpClosedTimeout ms nodesVerbose FilesQuiet NormalPsid Logfile=file Configfile=fileDebug=level ? , --usage Show a help message TestconfigFilename NumParaStation5 Administrators Guide Map TestnodesNp num Cnt numParaStation5 Administrators Guide Testpse -npnum TestpseParaStation5 Administrators Guide ?,--help P4statSock NetParaStation5 Administrators Guide Delete P4tcpAdd ParaStation5 Administrators Guide Pattern Description PsaccounterVar/account/yyyymmdd Accounting files, one per day DumpcoreCoredir=dir ?, --helpPsaccview Ls,--ltotsum Lj,--ljobsLu,--ltotuser Lg,--ltotgroupEnd CpuweightAqtime CputimeInitialization file Mlisten ParaStation5 Administrators Guide Appendix A. Quick Installation Guide Testing # /opt/parastation/bin/psiadmin psiadmin add# chkconfig -a /etc/init.d/parastation Appendix B. ParaStation license Page Page Page Changes to the runtime environment Building and installing ParaStation5 packages# psiadmin -s Appendix C. Upgrading ParaStation4 to ParaStation5Page ARP GlossarySee ParaStation Logger To share a common address space within a node ParaStation5 Administrators Guide