PAR Technologies V5 manual Directory /proc/sys/ps4/state, # cat /proc/sys/ps4/state/connections

Page 22

Directory /proc/sys/ps4/state

The p4sock.ko module inserts a number of entries within the /proc filesystem. All ParaStation5 entries are located within the subdirectory /proc/sys/ps4. Three different subdirectories, listed below, are available.

To read a value, e.g. just type

# cat /proc/sys/ps4/state/connections

to get the number of currently open connections. To modify a value, for e.g. type

# echo 10 > /proc/sys/ps4/state/ResendTimeout

to set the new value for ResendTimeout.

5.2.1.Directory /proc/sys/ps4/state

Within this state directory, various entries showing protocol counters. All these entries, except polling, are read only!

HZ: reads the number of timer interrupts per second for this kernel ("jiffies").

A jiffy is the base unit for system timers, used by the Linux kernel. So all timeouts within the kernel are based on this timer resolution. On kernels with version 2.4, this it typically 100Hz (= 10ms). But there are kernel versions available, e.g. for newer SuSE Linux versions, which include patches to change this to a much higher value!

connections: reads the current number of open connections.

polling: returns the current value for the polling flag: 0 = never poll, 1 = poll if otherwise idle (number of runable processes < number of CPUs), 2 = always poll. Writing this value will immediately change the polling strategy.

recv_net_ack: number of received ACKs.

recv_net_ctrl: number of received control packets (ACK, NACK, SYN, SYNACK, ...).

recv_net_data: number of received data packets.

recv_net_nack: number of received NACKs.

recv_user: number of packets delivered to application buffers.

send_net_ack: number of sent ACKs.

send_net_ctrl: number of sent control packets.

send_net_data: number of sent data packets.

send_net_nack: number of sent NACKs.

send_user: number of packets sent by the application.

sockets: number of open sockets connecting to the ParaStation5 protocol module.

timer_ack: number of expired delayed ACK timers.

timer_resend: number of expired resend timers.

5.2.2.Directory /proc/sys/ps4/ether

Within this directory, all Ethernet related parameters for the ParaStation5 p4sock protocol are grouped. All these entries can be read and written, newly written values will be used immediately.

AckDelay: maximum delay in "jiffies" for ACK messages. If no message is sent within this time frame, where an ACK for already received packets can be "hooked up", a single ACK message will generated. Must be less then ResendTimeout.

18

ParaStation5 Administrator's Guide

Image 22
Contents Administrators Guide ParaStation5 Administrators Guide Info@par-tec.comTable of Contents Problem different groups of nodes are seen as up or down Introduction History of ParaStationAbout this document Libraries Technical overviewRuntime daemon Kernel modulesLicense Prerequisites InstallationHardware Software Directory structureKernel version Man Installation via RPM packagesGetting the ParaStation5 RPM packages Mpi2, mpi2-intel, mpi2-pgi, mpi2-pscCompiling the ParaStation5 packages from source Installing the RPMsFile Version Etc/init.d/xinetd reload Installing the documentationParaStation entries # rpm -Uv psdoc-5.0.0-1.noarch.rpm Installing MPIFurther steps # rpm -Uv psmpi2.5.0.0-1.i586.rpmUninstalling ParaStation5 ParaStation5 Administrators Guide Copy template ConfigurationConfiguration of the ParaStation system Define Number of nodes# /opt/parastation/bin/testconfig Enable optimized network driversHostname id HWType runJob starter accounter Testing the installation # /opt/parastation/bin/psiadmin -s -c list # /opt/parastation/bin/testnodes -np nodesInsight ParaStation5 ParaStation5 pscom communication libraryDirectory /proc/sys/ps4/state # echo 10 /proc/sys/ps4/state/ResendTimeout# cat /proc/sys/ps4/state/connections Controlling process placement Directory /proc/sys/ps4/localUsing non-ParaStationapplications Using the ParaStation5 queuing facilityExporting environment variables for a task Controlling ParaStation5 communication paths Export LDPRELOAD=/opt/parastation/lib64/libp4tcp.soPSPP4S or PSPP4SOCK Authentication within ParaStation5Pspshm or Pspsharedmem Export PSPLIB=/opt/parastation/lib64/libpscomopenib.soParallel shell tool Homogeneous user ID spaceSingle system view Nodes and CPUsTok2env Integration with AFSIntegrating external queuing systems PSIRARGPRE0=/some/path/env2tokMulticasts # UseMCast Using ParaStation accountingCopying files in parallel Route add -net 224.0.0.0 netmask 240.0.0.0 dev ethXChanging the default ports for psid8 Using ParaStation process pinningUsing memory binding Spawning processes belonging to all groupsPort Problem psiadmin returns error TroubleshootingProblem node shown as down Problem different groups of nodes are seen as up or down Problem cannot start parallel taskProblem bad performance Problem cannot start process on frontendProblem pssh fails Problem psid does not startup, reports port in useProblem processes cannot access files on remote nodes Reference Pages ParaStation5 Administrators Guide Description InstallDir inst-dir , InstallationDir inst-dirParastation.conf ParametersStopscript SetupscriptStartscript StatusscriptMvapi P4sockOpenib ElanAccounter NrOfNodes numNode node17 16 HWType ethernet p4sock starter yes runJobs no $GENERATE 1-96 node$0,2 $0LogLevel num SelectTime timeDeadInterval num MCastGroup group-numDataSize size Core sizeCPUTime time MemLock sizeProc Processes maxprocs CPUmap mapRdpClosedTimeout ms RdpTimeout msStatusTimeout ms RdpResendTimeout msErrors See alsoParaStation5 Administrators Guide Synopsis PsiadminOptions Standard Output Standard ErrorStandard Input Extended descriptionAll ExitAllproc cnt count Hardware Count hw hwDown LoadSummary max max RdpQuit Group nodes Accounters nodesUser nodes Maxproc nodesHandleOldBins nodes Master nodesFreeOnSuspend nodes NodesSort nodesCpumap nodes Rlrss nodesRdpClosedTimeout nodes RdpTimeout nodesStatusTimeout nodes RdpResendTimeout nodesResolve nodes Restart nodesSelecttime time nodes Psiddebug mask nodesPattern Name Description HandleOldBins 0 1 nodes Rdpmaxretrans val nodes RdpClosedTimeout ms nodes RdpTimeout ms nodesStatusTimeout ms nodes RdpResendTimeout ms nodesNormal FilesQuiet VerbosePsid Debug=level Configfile=fileLogfile=file Num TestconfigFilename ? , --usage Show a help messageParaStation5 Administrators Guide Cnt num TestnodesNp num MapParaStation5 Administrators Guide Testpse Testpse -npnumParaStation5 Administrators Guide Net P4statSock ?,--helpParaStation5 Administrators Guide Add P4tcpDelete ParaStation5 Administrators Guide Psaccounter Pattern Description?, --help DumpcoreCoredir=dir Var/account/yyyymmdd Accounting files, one per dayPsaccview Lg,--ltotgroup Lj,--ljobsLu,--ltotuser Ls,--ltotsumCputime CpuweightAqtime EndInitialization file Mlisten ParaStation5 Administrators Guide Appendix A. Quick Installation Guide # chkconfig -a /etc/init.d/parastation # /opt/parastation/bin/psiadmin psiadmin addTesting Appendix B. ParaStation license Page Page Page Appendix C. Upgrading ParaStation4 to ParaStation5 Building and installing ParaStation5 packages# psiadmin -s Changes to the runtime environmentPage Glossary ARPSee ParaStation Logger To share a common address space within a node ParaStation5 Administrators Guide