HP Workgroup System AWSXCIG-1A manual Running the OVP to Verify Software and Hardware Components

Page 36

#lsid

Platform LSF HPC 6.2 for SLURM, LSF_build_date

Copyright 1992-2005 Platform Computing Corporation

My cluster name is hptclsf

My master name is lsfhost.localdomain

2.Verify that the lsf partition exists and all nodes are in the idle state:

#sinfo

PARTITION AVAIL TIMELIMIT NODES STATE NODELIST

lsf

up infinite

8 idle n[1-8]

3.Confirm that the ncpus value matches the expected total number of available processors:

# lshosts

 

 

 

 

 

 

 

HOST_NAME

type

model

cpuf ncpus maxmem maxswp server

RESOURCES

lsfhost.loc SLINUX6

Opteron8

16.0

60 3649M

-

Yes

(slurm)

4.Verify the dynamic resource information:

# bhosts

 

 

 

 

 

 

 

 

HOST_NAME

STATUS

JL/U

MAX

NJOBS

RUN

SSUSP

USUSP

RSV

lsfhost.localdomai

ok

-

16

0

0

0

0

0

See the troubleshooting information in the HP XC System Software Administration Guide if you do not receive a status of ok from the bhosts command.

5.11 Running the OVP to Verify Software and Hardware Components

The Operation Verification Program (OVP) verifies the major HP XC software and hardware components to provide a level of confidence that the system has been installed and configured correctly.

The OVP performs tests to verify the following:

The interconnect is functional.

Network connectivity has been established.

The administration network is operational.

A valid license key file is installed and the license manager servers are up.

All compute nodes are responding and are available to run applications.

SLURM control daemons are responding and partitioning is valid if LSF-HPC with SLURM is configured.

CPU usage on all nodes except the head node (by default).

Memory usage on all compute nodes except the head node (by default).

Start the Operation Verification Program

To start the OVP, follow these steps:

1.Login as the root user on the head node.

2.Start the OVP with no component-specific options to test the entire system:

# ovp [--verbose [--verbose]] [--timeout=0]

3.Follow along with the OVP command output.

4.Examine the test results to ensure that all tests passed. Test results are stored in a date-stamped log file located in the /hptc_cluster/adm/logs/ovp directory.

Test failures and warnings are clearly reported in the log file, and it contains some troubleshooting information. In some cases, the errors might be obvious, and the test output is terse.

The format of the OVP log file name includes the following:

The internal name of the head node.

The OVP run date in MMDDYYformat.

36 XC Software Installation

Image 36
Contents HP Workgroup System and XC Software Installation Guide Copyright 2008 Hewlett-Packard Development Company, L.P Table of Contents HP Workgroup System Specifications Thermal Stabilization Cabling IP AddressesList of Figures Enclosure Bay Numbering Example Rear ViewRemoving the Box OA IP AddressList of Tables List of Examples Modify DatabasePage About This Document Intended AudienceDocument Organization Typographic ConventionsDocumentation Updates and Release Notes HP Encourages Your CommentsUser input HP Workgroup System Overview HP Workgroup System Views3shows an example rear view of the HP Workgroup System Hardware Preinstallation Checklist Hardware PrerequisitesFirmware Requirements Can cause problems HP c-Class BladeSystem Enclosure Components and SwitchesHardware Setup Unpack the EnclosureRemoving the Ramp and Front Cushion Gently roll the unit down the ramp. Callout 1, FigureInstalling and Starting Up the Hardware Plug the unit into a power sourceInterconnect Switch Setting Boot Order Save user informationEnabling Telnet Access Installing and Starting Up the Hardware Page Software Preinstallation Checklist Software PrerequisitesDownloading XC Software Patches Copying the XC.lic File to Your LaptopAssociating the Enclosure DVD to the Head Node Check Choose DVD Connect to the enclosure DVDPage XC Software Installation Booting the DVDBoot linux ks=hdscd0/ks.cfg Boot linux ks=hdscd0/ks.cfg pci=nommconfRunning the clusterprep Command # cd /opt/hptc/config/sbin# ./clusterprep --enclosurebased Installing Patches from Your Laptop # mkdir /home/patches# cp /media/iLO2FOLDER/* /home/patches # cd /home/patchesRunning the discover Command Putting the License Key File in the Correct Location# cp /media/iLO2-FOLDER/* /opt/hptc/etc/license/XC.lic Running the clusterconfig Command # ./discover --enclosurebased --single --ic=AdminNet# ./clusterconfig Example 5-2 clusterconfig Command Output Ok, Respecify Interfaces O Running the startsys Command Configuring the Snmp TrapLSF Post-Configuration Tasks Verifying LSF-HPC with SlurmFollow along with the OVP command output Running the OVP to Verify Software and Hardware ComponentsVerify the dynamic resource information Nrg Command Nagios Web InterfaceCreating a Baseline Report of the System Configuration Creating a Baseline Copy of the DatabaseSetting Up Vlan Stop bits Flow controlConfirm saving to Flash y/n y Troubleshooting Unable to Manually Set IP Addresses for the iLOsChanging External IP Addresses For example Copy the file# sysstart imageandreboot Lost Connection to the iLORemoving a Bad Golden Image # siimage baseimageLost Terminal Window When in the IRC Page Additional Software Setup Information # scontrol reconfig# spconfig Page Additional Hardware Setup Information HP Workgroup System SpecificationsTable B-2 Thermal Stabilization Specification IP Addresses on a Corporate Network CablingIP Addresses Extend the Insight Display PanelFigure C-3 OA IP Address Configure sendmail Locate the section of the file that is similar to thisConfigure sendmail Glossary Network Base imageItrc NAT SMP Index LSFXC installation, 27 XC license, 24 XC patches