Node failed to rejoin srd on start-up” event, Manually clearing an srd, Clearing an srd of a.02.50.00.04 (or later) agents – HP Matrix Operating Environment Software User Manual
Page 43: Clearing an srd of agents of any version
For information on enabling and viewing these events, refer to Optimize
→Global Workload
Manager
→Events.
You can then view these events using the Event Lists item in the left pane of System Insight Manager.
The following sections explain how to handle some of the events.
“Node Failed to Rejoin SRD on Start-up” event
If you see the event “Node Failed to Rejoin SRD on Start-up”:
1.
Restart the gwlmagent on each managed node in the affected SRD:
# /opt/gwlm/bin/gwlmagent --restart
2.
Verify the agent rejoined the SRD by monitoring the Shared Resource Domain View in System
Insight Manager or by using the gwlm monitor command.
3.
If the problem persists, check the files /var/opt/gwlm/gwlmagent.log.0 and /var/
opt/gwlm/gwlmcmsd.log.0
for additional diagnostic messages.
“SRD Communication Issue” and “SRD Reformed with Partial Set of Nodes” events
NOTE:
Reforming with a partial set of nodes requires a minimum of three managed nodes in the
SRD.
NOTE:
“SRD Communication Issue” events are not enabled by default. To see these events,
configure your events in System Insight Manager through the HP Matrix OE visualization menu
bar using Tools
→Global Workload Manager→Events.
If you have an SRD containing n nodes and you get n - 1 of the “SRD Communication Issue” events
but no “SRD Reformed with Partial Set of Nodes” events within 5 minutes (assuming an allocation
interval of 15 seconds) of the first “SRD Communication Issue” event, you might need to restart the
gwlmagent
on each managed node in the affected SRD:
# /opt/gwlm/bin/gwlmagent --restart
Manually clearing an SRD
If gWLM is unable to reform an SRD, you can manually clear the SRD, as described in the following
section.
Clearing an SRD of A.02.50.00.04 (or later) agents
The following command is an advanced command for clearing an SRD. The recommended method
for typically removing a host from management is by using the gwlm undeploy command.
Starting with A.02.50.00.04 agents, you can manually clear an SRD with the following command:
# gwlm reset --host=host
where host specifies the host with the SRD to be cleared.
If this command does not work, use the procedure given in the following section.
Clearing an SRD of agents of any version
The procedure in this section clears an SRD regardless of the version of the agents in the SRD.
The gwlm command is added to the path during installation. On HP-UX systems, the command is
in /opt/gwlm/bin/. On Microsoft Windows systems, the command is in C:\Program Files\
HP\Virtual Server Environment\bin\gwlm\
by default. However, a different path might
have been selected at installation.
NOTE:
You must be logged in as root on HP-UX or into an account that is a member of the
Administrators group on Windows to run the commands below.
Automatic restart of gWLM’s managed nodes in SRDs (high availability)
43