HP Cluster Test Software manual Server health check

Page 37

Server health check

The server health check tool reports the overall health status of the nodes. It generates Temperature, Fan, and Power reports based on values retrieved from the management interface (LO100i or iLO2) of the server. This tool provides reports for every active node in the cluster. The health data is polled every five minutes on the head node if the Enable Health Check option on the Cluster Test interface is selected.

Once you select Enable Health Check, Cluster Test starts polling the health data for all servers, including the head node. Results are written to the following location: /opt/clustertest/logs/server-health/<node-name>.

You may also see the health check results via the Cluster Test toolbar at Tools→Server Health.

From the Server Health Status window, hold the left mouse button down over a node item to display a menu with the items Temperature, Fan, and Power. From this menu, select the report you'd like to view for that node.

Below is an example Temperature report. The data in the report are historic, beginning from the time Enable Health Check is selected on the Cluster Test interface.

Server health check 37

Image 37
Contents HP Cluster Test Administration Guide January Contents Sample test output Useful files and directories Utility commandsDocumentation feedback Glossary Index Varieties of Cluster Test CT Image using a networkCT Image RPM Starting Cluster Test Cluster Test GUIFiles generated by Cluster Test Running cluster testsCluster Test GUI Running cluster tests Configuration settings Running tests in a batch Using scripts to run tests Running cluster tests CrissCross Test descriptionsNodes monitoring window Monitoring tests and viewing resultsTest output window Monitoring tests and viewing results Performance analysis Test report Checking the InfiniBand fabric Cluster Test toolbar menus Cluster Test toolbar menusStarting accelerator tests Accelerator test GUIFiles generated by accelerator test GPU detection Running accelerator testsVerify Dgemm Double Precision General Matrix Multiply Test Sgemm Single Precision General Matrix Multiply TestBandWidth GPU Bandwidth Test Memory TestNvidia Linpack Cuda Accelerated Linpack Benchmark Configuring Cluster Test when using RPM Cluster Test procedure as recommended by HPAdditional software Accelerator test procedure Cluster Test procedure as recommended by HP Cluster Test procedure # checkadm Cluster Test procedure Cluster Test procedure as recommended by HP Performance monitor utility Performance monitorPerformance Monitor toolbar menu Xperf utility Hardware Inventory Cluster Test toolsFirmware Summary Server health check Excluding the head node from tests Disk Scrubber Cluster Test tools Running tests in parallel An example per-node directory Creating and changing per node filesAn example cloned per-node directory Nfs NFS performance tuningDetecting new hardware TroubleshootingCluster Test Troubleshooting Guide Troubleshooting Cluster TestScope of this document Support and other resourcesIntended audience Contacting HPRelated information New and changed information in this editionDocumentation WebsitesCustomer self repair Typographic conventionsCustomer self repair Cluster Test Useful Files and Directories Useful files and directoriesAnalyze Utility commandsConrep Files generated by ibfabriccheck Inspectibfabric.pl Inspectibfabric.pl Utility commands Ipmitool Pdsh CrissCross Sample test outputSample test output Test4 Pallas Mpibyte Sample test output Stream Node24 Triad 3078.7949 3355 3488 3536 CPU Disk TestUTK LinpackPassed Passed Passed Documentation feedback CMU GlossaryIndex MPI Accelerator
Related manuals
Manual 25 pages 60.17 Kb