beautypg.com

Node failed to rejoin srd on start-up” event, Manually clearing an srd, Clearing an srd of a.02.50.00.04 (or later) agents – HP Matrix Operating Environment Software User Manual

Page 43: Clearing an srd of agents of any version

background image

For information on enabling and viewing these events, refer to Optimize

→Global Workload

Manager

→Events.

You can then view these events using the Event Lists item in the left pane of System Insight Manager.

The following sections explain how to handle some of the events.

“Node Failed to Rejoin SRD on Start-up” event

If you see the event “Node Failed to Rejoin SRD on Start-up”:
1.

Restart the gwlmagent on each managed node in the affected SRD:

# /opt/gwlm/bin/gwlmagent --restart

2.

Verify the agent rejoined the SRD by monitoring the Shared Resource Domain View in System
Insight Manager or by using the gwlm monitor command.

3.

If the problem persists, check the files /var/opt/gwlm/gwlmagent.log.0 and /var/
opt/gwlm/gwlmcmsd.log.0

for additional diagnostic messages.

“SRD Communication Issue” and “SRD Reformed with Partial Set of Nodes” events

NOTE:

Reforming with a partial set of nodes requires a minimum of three managed nodes in the

SRD.

NOTE:

“SRD Communication Issue” events are not enabled by default. To see these events,

configure your events in System Insight Manager through the HP Matrix OE visualization menu
bar using Tools

→Global Workload Manager→Events.

If you have an SRD containing n nodes and you get n - 1 of the “SRD Communication Issue” events
but no “SRD Reformed with Partial Set of Nodes” events within 5 minutes (assuming an allocation
interval of 15 seconds) of the first “SRD Communication Issue” event, you might need to restart the
gwlmagent

on each managed node in the affected SRD:

# /opt/gwlm/bin/gwlmagent --restart

Manually clearing an SRD

If gWLM is unable to reform an SRD, you can manually clear the SRD, as described in the following
section.

Clearing an SRD of A.02.50.00.04 (or later) agents

The following command is an advanced command for clearing an SRD. The recommended method
for typically removing a host from management is by using the gwlm undeploy command.

Starting with A.02.50.00.04 agents, you can manually clear an SRD with the following command:

# gwlm reset --host=host

where host specifies the host with the SRD to be cleared.

If this command does not work, use the procedure given in the following section.

Clearing an SRD of agents of any version

The procedure in this section clears an SRD regardless of the version of the agents in the SRD.

The gwlm command is added to the path during installation. On HP-UX systems, the command is
in /opt/gwlm/bin/. On Microsoft Windows systems, the command is in C:\Program Files\
HP\Virtual Server Environment\bin\gwlm\

by default. However, a different path might

have been selected at installation.

NOTE:

You must be logged in as root on HP-UX or into an account that is a member of the

Administrators group on Windows to run the commands below.

Automatic restart of gWLM’s managed nodes in SRDs (high availability)

43