HP UX 11i Workload Management (gWLM/WLM) Software manual Manually clearing an SRD

Models: UX 11i Workload Management (gWLM/WLM) Software

1 72
Download 72 pages 18.09 Kb
Page 43
Image 43

For information on enabling and viewing these events, refer to OptimizeGlobal Workload ManagerEvents.

You can then view these events using the Event Lists item in the left pane of System Insight Manager.

The following sections explain how to handle some of the events.

“Node Failed to Rejoin SRD on Start-up” event

If you see the event “Node Failed to Rejoin SRD on Start-up”:

1.Restart the gwlmagent on each managed node in the affected SRD:

# /opt/gwlm/bin/gwlmagent --restart

2.Verify the agent rejoined the SRD by monitoring the Shared Resource Domain View in System Insight Manager or by using the gwlm monitor command.

3.If the problem persists, check the files /var/opt/gwlm/gwlmagent.log.0 and /var/ opt/gwlm/gwlmcmsd.log.0 for additional diagnostic messages.

“SRD Communication Issue” and “SRD Reformed with Partial Set of Nodes” events

NOTE: Reforming with a partial set of nodes requires a minimum of three managed nodes in the SRD.

NOTE: “SRD Communication Issue” events are not enabled by default. To see these events, configure your events in System Insight Manager through the HP Matrix OE visualization menu bar using ToolsGlobal Workload ManagerEvents.

If you have an SRD containing n nodes and you get n - 1 of the “SRD Communication Issue” events but no “SRD Reformed with Partial Set of Nodes” events within 5 minutes (assuming an allocation interval of 15 seconds) of the first “SRD Communication Issue” event, you might need to restart the gwlmagent on each managed node in the affected SRD:

#/opt/gwlm/bin/gwlmagent --restart

Manually clearing an SRD

If gWLM is unable to reform an SRD, you can manually clear the SRD, as described in the following section.

Clearing an SRD of A.02.50.00.04 (or later) agents

The following command is an advanced command for clearing an SRD. The recommended method for typically removing a host from management is by using the gwlm undeploy command.

Starting with A.02.50.00.04 agents, you can manually clear an SRD with the following command:

#gwlm reset --host=host

where host specifies the host with the SRD to be cleared.

If this command does not work, use the procedure given in the following section.

Clearing an SRD of agents of any version

The procedure in this section clears an SRD regardless of the version of the agents in the SRD.

The gwlm command is added to the path during installation. On HP-UX systems, the command is in /opt/gwlm/bin/. On Microsoft Windows systems, the command is in C:\Program Files\ HP\Virtual Server Environment\bin\gwlm\ by default. However, a different path might have been selected at installation.

NOTE: You must be logged in as root on HP-UX or into an account that is a member of the Administrators group on Windows to run the commands below.

Automatic restart of gWLM’s managed nodes in SRDs (high availability) 43

Page 43
Image 43
HP UX 11i Workload Management (gWLM/WLM) Software Manually clearing an SRD, Node Failed to Rejoin SRD on Start-up event