Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 325501

Summary: conga doesn't handle the cluster restart operation properly
Product: [Retired] Red Hat Cluster Suite Reporter: Ryan McCabe <rmccabe>
Component: congaAssignee: Ryan McCabe <rmccabe>
Status: CLOSED ERRATA QA Contact: Brian Brock <bbrock>
Severity: high Docs Contact:
Priority: high    
Version: 4CC: cluster-maint, jparsons, kanderso, pkennedy, rkenna, rmccabe
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: RHSA-2007-0983 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2007-11-21 21:52:01 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Ryan McCabe 2007-10-09 20:06:07 UTC
+++ This bug was initially created as a clone of Bug #315631 +++

Description of problem:
Problems occur when attemping the 'restart cluster' operation. From what I've
seen, different things will happen depending on what state the cluster is in
before the restart is executed.

Scenario 1: Start with the cluster stopped
Then, after attempting the 'restart' with conga, the cluster will usually start
properly on all nodes, sometimes it will fail to start the service on one of the
nodes in the cluster.

Scenario 2: Start with the cluster started but no clvmd or rgmanager
Then, after attempting the 'restart' with conga, the cluster will most likey end
up with all nodes in the stopped state and appears to not even have tried the
start operation, but sometimes it will start on a subset of the cluster.

Scenario 3: Start with the cluster started and all services running
Then, after attempting the 'restart' with conga, the cluster will either end up
completely stopped with out a start attempted, end up in some hung loop due to
timing issues, or the restart will appear to work on just a subset of the nodes.


I've tried these cmds manually countless times and never seen issues, that is:
for i in rgmanager clvmd cman
do 
service $i stop
done

for i in cman clvmd rgmaneger
do 
service $i start
done



Version-Release number of selected component (if applicable):
2.6.18-48.el5
cman-2.0.73-1.el5
luci-0.10.0-6.el5

-- Additional comment from rmccabe on 2007-10-02 16:38 EST --
Problems occur because some nodes may be starting while others are still in the
process of stopping. According to sdake, this ought to work (it definitely
doesn't consistently work presently), but we can easily work around it in conga.

Comment 3 errata-xmlrpc 2007-11-21 21:52:01 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2007-0983.html