325501 – conga doesn't handle the cluster restart operation properly

Bug 325501 - conga doesn't handle the cluster restart operation properly

Summary: conga doesn't handle the cluster restart operation properly

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Cluster Suite
Classification:	Retired
Component:	conga
Sub Component:
Version:	4
Hardware:	All
OS:	Linux
Priority:	high
Severity:	high
Target Milestone:	---
Assignee:	Ryan McCabe
QA Contact:	Brian Brock
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2007-10-09 20:06 UTC by Ryan McCabe
Modified:	2009-04-16 19:53 UTC (History)
CC List:	6 users (show)
Fixed In Version:	RHSA-2007-0983
Clone Of:
Environment:
Last Closed:	2007-11-21 21:52:01 UTC
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHSA-2007:0983	0	normal	SHIPPED_LIVE	Moderate: conga security, bug fix, and enhancement update	2007-11-29 17:36:39 UTC

Description Ryan McCabe 2007-10-09 20:06:07 UTC

+++ This bug was initially created as a clone of Bug #315631 +++

Description of problem:
Problems occur when attemping the 'restart cluster' operation. From what I've
seen, different things will happen depending on what state the cluster is in
before the restart is executed.

Scenario 1: Start with the cluster stopped
Then, after attempting the 'restart' with conga, the cluster will usually start
properly on all nodes, sometimes it will fail to start the service on one of the
nodes in the cluster.

Scenario 2: Start with the cluster started but no clvmd or rgmanager
Then, after attempting the 'restart' with conga, the cluster will most likey end
up with all nodes in the stopped state and appears to not even have tried the
start operation, but sometimes it will start on a subset of the cluster.

Scenario 3: Start with the cluster started and all services running
Then, after attempting the 'restart' with conga, the cluster will either end up
completely stopped with out a start attempted, end up in some hung loop due to
timing issues, or the restart will appear to work on just a subset of the nodes.


I've tried these cmds manually countless times and never seen issues, that is:
for i in rgmanager clvmd cman
do 
service $i stop
done

for i in cman clvmd rgmaneger
do 
service $i start
done



Version-Release number of selected component (if applicable):
2.6.18-48.el5
cman-2.0.73-1.el5
luci-0.10.0-6.el5

-- Additional comment from rmccabe on 2007-10-02 16:38 EST --
Problems occur because some nodes may be starting while others are still in the
process of stopping. According to sdake, this ought to work (it definitely
doesn't consistently work presently), but we can easily work around it in conga.

Comment 3 errata-xmlrpc 2007-11-21 21:52:01 UTC

An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2007-0983.html

Note You need to log in before you can comment on or make changes to this bug.