HP Insight Cluster Management Utility
 Copyright 2013 Hewlett-Packard Development Company, L.P
 Contents
 Defining a cluster with HP Insight CMU
Provisioning a cluster with HP Insight CMU
 Monitoring a cluster with HP Insight CMU
 Managing a cluster with HP Insight CMU
Advanced topics 112
Actions Alerts Alert reactions
 Troubleshooting 126
Detailed installation instructions 131
Support and other resources 123
HP Insight CMU manpages 139
 Glossary 187 Index 189
 Figures
 Date command Dmidecode command
Tables
Examples
 Features
HP Insight CMU configuration
Overview
Compute node monitoring
 Compute node administration
System disk replication
 Installing and upgrading HP Insight CMU
Installing HP Insight CMU
Management node hardware requirements
 Planning for compute node installation
Firmware upgrade requirements
Configuring the local smart array card
Configuring the management cards
 Configuring the Bios
NIC 1 PXE boot or PXE enabled NIC 2 Disabled
7.1 DL3xx, DL5xx, DL7xx, Blades
Speed 9600 Bd
 7.2 DL160 G5, DL165c G5, DL165c G6, and DL180 G5 Servers
IDE
 Share NIC mode Disabled
Dhcp Disabled
NIC1 control Enabled
NIC1 PXE Enabled
 7.4 SL2x170z G6 and DL170h G6 Servers Bios setting
NIC2 on the SL2x170z G6 Server
 Preparing for installation
Preinstallation limitations
HP Insight CMU kit delivery
 Operating system support
HP Insight CMU CD-ROM directory structure
Rhel 6 support
 HP Insight CMU installation checklist
Login privileges
SELinux and HP Insight CMU
 Installation procedures
Run /opt/cmu/bin/cmumgtconfig -c
# chkconfig --add cmu
 Installing HP Insight CMU with high availability
 Installing and upgrading HP Insight CMU
 Installing HP Insight CMU under HA
HA hardware requirements
Software prerequisites
Overview
 Configuring HA control of HP Insight CMU
HP Insight CMU HA service requirements
Installing and testing
 Cmuadmin1# /etc/init.d/cmu start
Start cmuserver
# /etc/init.d/cmuserver start
 Cmu hacmu service needs restart
Var/log/cmuservicehostname.log file for errors
Cmuadmin2# /opt/cmu/tools/cmuhapostinstall
# /etc/init.d/cmu setaudit # /etc/init.d/cmu stop
 HP Insight CMU configuration considerations
Upgrading HP Insight CMU HA service
Cmuha nothing to backup from the cmu HA share
Run cmuhapostinstall on server
 Stopping the HP Insight CMU service
Upgrading HP Insight CMU
Dependencies
Upgrading Java Runtime Environment
 Saving the HP Insight CMU database
Restoring the HP Insight CMU database
 HP Insight CMU service status
Defining a cluster with HP Insight CMU
Launching the HP Insight CMU GUI
HP Insight CMU main window
 Administrator mode
Quitting administrator mode
Click Options→Unprivileged Mode
Server requirements
 High-level checklist for building an HP Insight CMU cluster
Cluster administration
 Node management
Node management window
 Scanning nodes
Scan node dialog
 Management card password window
Adding nodes
 Cluster Administration→Node Management
Modifying nodes
 Network entity management
Importing nodes
Deleting nodes
Exporting nodes
 Adding network entities
Deleting network entities
 Provisioning a cluster with HP Insight CMU
Logical group management
 Modifying logical groups
Deleting logical groups
Renaming logical groups
 Autoinstall
Autoinstall requirements
Autoinstall templates
Autoinstall calling methods
 Using autoinstall from GUI
Enabling autoinstall
Creating an autoinstall logical group
Restart cmuserver
 Autoinstall compute nodes
Registering compute nodes
 Using autoinstall from CLI
Registering an autoinstall logical group
Adding nodes to autoinstall logical group
Cmu addtologicalgroup node to logicalgroupname
 Where nodes.txt is the list of nodes to autoinstall
Customization
 Backing up
Restrictions
Backing up a disk from a compute node in a logical group
 # /opt/cmu/image/logicalgroupname
 Cloning procedure
Cloning
 Preconfiguration
Cloning status
 Reconfiguration
Default content of prereconf.sh is
 Node static info
Rescan MAC
 HP Insight CMU image editor
Expanding an image
After editing the image, commit changes
# cmuimagecommit -i rh5u4x8664
 HP Insight CMU diskless environments
Modifying an image
Saving a modified cloning image
# /opt/cmu/bin/cmuimagecommit -i rh5u4x8664
 Modifying the Tftp server configuration
Operating systems supported
On the management node
On the golden node
 Activating the diskless feature
Populating the HP Insight CMU database
Creating a diskless image
Creating a diskless logical group
 Adding a new logical group
 Adding nodes into the logical group
From the CLI
# /opt/cmu/cmucli
Cmu probekernel
 Booting the compute nodes
Cmu boot net myTestImage node1 noden
Understanding the structure of a diskless image
Customizing your diskless image
 Using reconf-diskless-image.sh
 Do not update the /opt/cmu/image/imageName/root directory
Best practices for diskless clusters
Templates and image file
Node. /opt/cmu/image/imageName/snapshot/nodeName
 Configuring additional NFS servers
# chkconfig nfs on
On Red Hat
 When a diskless logical group is created
# chkconfig nfsserver on
On Sles
Sample file
 When a node is added to the diskless logical group
Comments on High Availability HA
 Installing the HP Insight CMU monitoring client
Monitoring a cluster with HP Insight CMU
Deploying the monitoring client
 Monitoring the cluster
Launch the HP Insight CMU GUI
 Node and group status
Selecting the central frame display
 Global cluster view in the central frame
Monitoring window
 Resource view in the central frame
Resource view overview
 Detail mode in resource view
Gauge widget
Node view in the central frame
 Using time view
Node details
 Getting started
Tagging nodes
Adaptive stacking
 Bindings and options
Mouse control
 Troubleshooting
Technical dependencies
 Archiving user groups
Visualizing history data
 Tuning HP Insight CMU monitoring
Stopping HP Insight CMU monitoring
Action and alert files
Limitations
 Actions
Alerts
 Alerts
Alert reactions
 Name of the alert that caused the reaction
Level of the alert
Text of the Description for this reaction
 Using collectl for gathering monitoring data
Installing and starting collectl on compute nodes
# chkconfig --add collectl
# /etc/init.d/collectl start
 # collectl -c 1 -s+C --export lexpr
ActionAndAlertFile.txt file
 # mkdir /var/log/collectl # vi /etc/exports
Var/log/collectl *rw,sync,noallsquash,norootsquash
# exportfs -r
# cp -a /var/www/html/colplot /opt/cmu/www/colplot
 # mkdir /var/log/collectl # vi /etc/fstab
# vi /etc/collectl.conf
Restart collectl
# /etc/init.d/collectl restart
 Monitoring GPUs and coprocessors
Monitoring Nvidia GPUs
Select plotting options, then click Generate Plot
 # cmuconfigamd
Monitoring AMD GPUs
 Monitoring Intel coprocessors
 Review the results and verify no errors are reported
# /opt/cmu/bin/cmuconfigintel
 Extended metric support
HP Insight CMU alert converted to SIM event
 Opt/cmu/bin/cmusubmitextendedmetrics -f datafile
 Administrator menu
Managing a cluster with HP Insight CMU
Unprivileged user menu
SSH connection
 Power off
Management card connection
Virtual serial port connection
Shutdown
 Boot
Reboot
Change UID LED status
 Multiple windows broadcast
Single window pdsh
Dshbak
 Cmudiff examples
 Cmupdsh dmidecode
Cmupdsh cmudiff -h
Cmupdsh cmudiff -d
 # vi /root/.ssh/config
Parallel distributed copy pdcp
 User group management
Adding user groups
 Viewing and analyzing Bios settings
HP Insight firmware management
Deleting user groups
Renaming user groups
 Checking Bios versions
Installing and upgrading firmware
Customizing the GUI menu
 Saving user settings
Basic commands
HP Insight CMU CLI
Starting a CLI interactive session
 Help commands
Getting help for a command
For example, to get more information about the halt command
Cmu help
 Executing a command on one node
Specifying nodes
Displaying logical groups of a cluster
Displaying nodes of a logical group
 Executing a command on a list of nodes
Executing a command on a range of nodes
Using wildcards
Executing a command on all nodes
 Administration and cloning commands
Executing a command on specific nodes of a logical group
Booting a set of nodes
Broadcasting commands to a set of nodes
 Rebooting a set of nodes
Powering off a set of nodes
Halting a set of nodes
 Setting the locator LED on or off
Cloning a set of nodes
Cmu locate on o185i192
Cmu locate off o185i192
 Adding a new logical group
Adding nodes to a logical group
Backing up a node
 Cmu backup
 Modifying a management card password
Cmu modifypassword ILOILOCMlo100i
Cmu modifypassword lo100i
Discovering MAC address for new nodes
 Administration utilities pdcp and pdsh
HP Insight CMU Linux shell commands
# /opt/cmu/bin/pdcp -w cn0001,cn0002 source /tmp/dest
# /opt/cmu/bin/pdsh -w cn0001,cn0002 ls
 Accessing the GUI for non-root users
Advanced topics
 Configuring sudo support
Custom menu options for non-root users
 Cjones ALL = Cmupower
Bstevens ALL = Nopasswd Cmupower Cmuimage
Sbarney ALL = Nopasswd Cmupower Cmuimage Cmuetc
HP Insight CMU diskless API
 Build diskless image
Delete diskless image
Name of the new logical group
 Configure diskless node
Unconfigure diskless node
Boot diskless node
 HP Insight CMU remote hardware control API
Diskless check
Lo100i
None
 Off
Osoff
Uidoff
Uidon
 CMUVALIDHARDWARETYPES=ILOlo100iILOCM
CMUVALIDHARDWARETYPES=ILOlo100iILOCMIPMI
 Support for ScaleMP
Cloning mechanisms
CMUvSMPPREFIX=vSMP
 Cloning mechanisms
 Advanced topics
 Support and other resources
Contacting HP
Related information
 Command
Typographic conventions
Computer output
User input
 Typographic conventions
 Troubleshooting
Network boot issues
HP Insight CMU logs
 Backup issues
Troubleshooting switch issues
Troubleshooting network boot
 Cloning issues
Administration command problems
GUI problems
If not, restart the HP Insight CMU service
 Certificate error
Detailed Java exception is
 Troubleshooting
 Detailed installation instructions
Install required RPMs
Activating xinetd services
# chkconfig nfs on # /etc/init.d/nfs start
 Firewall configuration
Java installation
Verifying the Dhcpd listen interface
On the HP Insight CMU management node
 Installing HP Insight CMU licensing
Setting the Java Path
Configuring the HP Insight CMU management server hostname
Method
 Cmu service needs restart
Starting HP Insight CMU
Edit the /opt/cmu/etc/cmuserver.conf file
# vi /opt/cmu/etc/cmuserver.conf
 Configuring HP Insight CMU to start automatically
Installing HP Insight CMU on the GUI client workstation
Verifying the HP Insight CMU state
Cmuserver utility reports the state of the daemons
 Configuring the GUI client on Linux workstations
Using an ssh tunnel
Using an X Window server
 Launching the HP Insight CMU GUI using a web browser
Activating the HP Insight CMU GUI
Launching the HP Insight CMU directly from the Java file
 HP Insight CMU GUI
 HP Insight CMU manpages
 Cmushownodes8
 # /opt/cmu/bin/cmushownodes -a -o %n %i %k %m default %b %t
# /opt/cmu/bin/cmushownodes
 Cmushowlogicalgroups8
# /opt/cmu/bin/cmushowlogicalgroups -h logicalgroupname
Help
# /opt/cmu/bin/cmushowlogicalgroups
 Cmushownetworkentities8
# /opt/cmu/bin/cmushownetworkentities -h networkentity
# /opt/cmu/bin/cmushownetworkentities
# /opt/cmu/bin/cmushownetworkentities rack1
 Cmushowusergroups8
# /opt/cmu/bin/cmushowusergroups -h usergroup
# /opt/cmu/bin/cmushowusergroups
# /opt/cmu/bin/cmushowusergroups user1
 Cmushowarchivedusergroups8
# /opt/cmu/bin/cmushowarchivedusergroups
# /opt/cmu/bin/cmushowarchivedusergroups -p -f
# /opt/cmu/bin/cmushowarchivedusergroups -f -s
 Cmuaddnode8
Mgtcardip -T--mgt-card ILOlo100iILOCM -R--arch architecture
Node-number num
 Command-line mode
Processing 1 node
# cat nodes.txt
# /opt/cmu/bin/cmuaddnode -f nodes.txt
 Cmuaddnetworkentity8
Filename inputfile
# /opt/cmu/bin/cmuaddnetworkentity rack1 rack2
# /opt/cmu/bin/cmuaddnetworkentity -f networkentitylist
 Cmuaddlogicalgroup8
Filename inputfile input logical groups from inputfile Name
# /opt/cmu/bin/cmuaddlogicalgroup -n test -d cciss/c0d0
# /opt/cmu/bin/cmuaddlogicalgroup -f logicalgroupfile
 Cmuaddtologicalgroupcandidates8
Logicalgroup
Nodenamefile
 Cmuaddusergroup8
# /opt/cmu/bin/cmuaddusergroup user1 user2
# /opt/cmu/bin/cmuaddusergroup -f usergrouplist
 Cmuaddtousergroup8
Usergroup
# /opt/cmu/bin/cmuaddtousergroup -t group1 cn0003
# /opt/cmu/bin/cmuaddtousergroup -t group1 -f nodenamefile
 Cmuchangeactivelogicalgroup8
To flag cn0001 active in logical group rh6u0x8664
 Cmuchangenetworkentity8
Networkentity
# /opt/cmu/bin/cmuchangenetworkentity -t rack1 cn0001
 Cmudelfromlogicalgroupcandidates8
Delete one or more nodes from a logical group
Delete nodes from this logical group
To delete one node from a logical group rh6u0x8664
 Cmudelfromnetworkentity8
# /opt/cmu/bin/cmudelfromnetworkentity -t rack1 node1
 # /opt/cmu/bin/cmudelarchivedusergroup -h -v-t timeout -d
Cmudelarchivedusergroup8
Cmudelarchivedusergroup -- Delete an archived user group
Delete an archived user group
 Cmudelfromusergroup8
# /opt/cmu/bin/cmudelfromusergroup -t user1 node1
 Cmudellogicalgroup8
# /opt/cmu/bin/cmudellogicalgroup rh5u5x8664 sles11sp1x8664
# /opt/cmu/bin/cmudellogicalgroup -f logicalgrouplist
 Cmudelnetworkentity8
# /opt/cmu/bin/cmudelnetworkentity rack1 rack2
# /opt/cmu/bin/cmudelnetworkentity -f networkentitylist
 Cmudelnode8
# /opt/cmu/bin/cmudelnode node1 node2
# /opt/cmu/bin/cmudelnode -f nodelist
 Cmudelsnapshots8
Delete monitoring snapshots from the history database
 Cmudelusergroup8
# /opt/cmu/bin/cmudelusergroup user1 user2
# /opt/cmu/bin/cmudelusergroup -f usergrouplist
# /opt/cmu/bin/cmudelusergroup -a -m 60 -f usergrouplist
 Cmuconsole8
Cmuconsole -- Connect to compute node management ports
# /opt/cmu/bin/cmuconsole computenodehostname
 Cmupower8
 Cmupower -p OFF -n cn0001
Cmupower -p OFF -u user1
Cmupower -p Boot -l rh6u0x8664
Cmupower -p Uidon -u user2
 Cmucustomrun8
Title
# /opt/cmu/bin/cmucustomrun -l
# /opt/cmu/bin/cmucustomrun -t auditlspci
 Cmuclone8
=node
# /opt/cmu/bin/cmuclone -f /tmp/nodelist -i sles11sp1x8664
# /opt/cmu/bin/cmuclone -n node1 -i rh6u2x8664
 Cmubackup8
Cmubackup -l myimage -n node20 -p cciss/c0d0p3,cciss/c0d0p1
Cmubackup -l myimage -n node20 -r 3 -e /tmp/err.log
 Cmuscanmacs8
Options naming
# /opt/cmu/bin/cmuscanmacs -h hostname -p hostnameprefix
 Options general
 Example
Example node definitions
 173
 Cmurescanmac8
Cmurescanmac -- Rescan the MAC address of a node
# /opt/cmu/tools/cmurescanmac -n nodename N NICnum -h
Node name in the HP Insight CMU database
 Cmumodnode8
Filename
Lg logicalgroup
 # /opt/cmu/bin/cmumodnode -f nodes.txt
 Cmumonstat8
 # /opt/cmu/bin/cmumonstat --nodes=n0001,n0002,n0001-11
# /opt/cmu/bin/cmumonstat --all-sensors --all-nodes
# /opt/cmu/bin/cmumonstat --all-nodes --all-sensors --stats
# /opt/cmu/bin/cmumonstat --all-nodes --color
 Cmuimageopen8
# /opt/cmu/bin/cmuimageopen -h -i imagename
To open the HP Insight CMU image rh5u5x8664
# /opt/cmu/bin/cmuimageopen -i rh5u5x8664
 Cmuimagecommit8
# /opt/cmu/bin/cmuimagecommit -i rh5u5x8664
 Cmuconfignvidia8
Cmuconfignvidia -- Configure Nvidia GPU monitoring
 Cmuconfigamd8
Cmuconfigamd -- Configure AMD GPU monitoring
 Cmuconfigintel8
Cmuconfigintel -- Configure Intel coprocessor monitoring
# /opt/cmu/bin/cmuconfigintel -h -r -n
 Cmumgtconfig8
Automatically defaults when reasonable
Eth num1num2bond num1
Num
 # /opt/cmu/bin/cmumgtconfig -c
# /opt/cmu/bin/cmumgtconfig -c -s dhcp
# /opt/cmu/bin/cmumgtconfig -t
 Cmufirmwaremgmt8
Cmufirmwaremgmt -- Verify and execute firmware
Ocmudiffparameters
Specifies a text file listing compute nodes
 Glossary
Ebipa
 Package management
 Index
 Index
 191