Bug 226700 - cman cluster needs restart when going from >=3 to 2 nodes and 2 to >= 3 nodes
Summary: cman cluster needs restart when going from >=3 to 2 nodes and 2 to >= 3 nodes
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: conga   
(Show other bugs)
Version: 5.0
Hardware: All
OS: Linux
medium
medium
Target Milestone: ---
: ---
Assignee: Ryan McCabe
QA Contact: Corey Marthaler
URL:
Whiteboard:
Keywords:
Depends On: 240508
Blocks:
TreeView+ depends on / blocked
 
Reported: 2007-02-01 00:06 UTC by Paul Kennedy
Modified: 2015-04-20 00:47 UTC (History)
7 users (show)

Fixed In Version: RHSA-2007-0640
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Environment:
Last Closed: 2007-11-07 15:37:00 UTC
Type: ---
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)


External Trackers
Tracker ID Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2007:0640 normal SHIPPED_LIVE Moderate: conga security, bug fix, and enhancement update 2007-11-08 14:19:24 UTC

Description Paul Kennedy 2007-02-01 00:06:59 UTC
Description of problem:
Tried to delete a node and add it back into the cluster. Received this error
message when trying to add the node back in:

The following errors occurred:
    * A Ricci error occurred on tng3-4.lab.msp.redhat.com: ccs_tool failed
    * Unable to update the cluster node list for my_rh_cluster

Further investigation revealed this:
[root@tng3-3 ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   X     12                        tng3-5.lab.msp.redhat.com
   2   M     12   2007-01-31 17:26:00  tng3-4.lab.msp.redhat.com
   3   M      4   2007-01-31 17:26:00  tng3-3.lab.msp.redhat.com

[root@tng3-4 ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   X      8                        tng3-5.lab.msp.redhat.com
   2   M      4   2007-01-31 12:25:03  tng3-4.lab.msp.redhat.com
   3   M     12   2007-01-31 12:25:09  tng3-3.lab.msp.redhat.com


Version-Release number of selected component (if applicable):


How reproducible:
Create three-node cluster. Remove one node and add it back.

Steps to Reproduce:
1. 
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 1 Paul Kennedy 2007-02-01 00:13:28 UTC
Tried adding different node. Same results.

Comment 2 Ryan McCabe 2007-02-01 15:03:36 UTC
This looks like a bug somewhere deeper in the cluster stack. Could you try to
manually propagate the conf file with ccs_tool and see what the error message is?

Comment 4 Ryan McCabe 2007-02-01 16:14:10 UTC
After I remove the node, then try to add it back manually, here's what I see:

[root@huey cluster]# ccs_tool update .cluster.conf 
Unable to open connection to dewey.lab.boston.redhat.com: Bad file descriptor

Failed to update config file.
[root@huey cluster]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   M     12   2007-02-01 11:06:44  louey.lab.boston.redhat.com
   2   M      4   2007-02-01 11:06:44  huey.lab.boston.redhat.com
   3   X     12                        dewey.lab.boston.redhat.com

[root@dewey cluster]# pwd;ls -la
/etc/cluster
total 32
drwxr-xr-x  2 root root  4096 Feb  1 11:10 .
drwxr-xr-x 91 root root 12288 Feb  1 11:07 ..
-rw-r-----  1 root root   461 Feb  1 11:07 .cluster.conf

[root@dewey cluster]# ps auxww|egrep "[a]isexec|[c]cs|[c]man|[c]lurg";service
cman status
ccsd is stopped

Anyone have any insight into what's going wrong here?

Comment 5 Ryan McCabe 2007-02-01 16:22:16 UTC
Sorry. About 5 minutes after I posted here, i realized that the cluster needs a
restart when going from 3->2 and 2->3 nodes. That's what's causing the problem.

Comment 6 Kiersten (Kerri) Anderson 2007-04-23 17:06:27 UTC
Fixing Product Name.  Cluster Suite was merged into Enterprise Linux for version
5.0.

Comment 7 Ryan McCabe 2007-05-18 16:44:31 UTC
Marking this modified and depending on
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=240508

That fix will remove the need for any special handling on the part of conga for
clusters going from > 2 nodes to 2 and from 2 to > 2.

Comment 10 errata-xmlrpc 2007-11-07 15:37:00 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2007-0640.html



Note You need to log in before you can comment on or make changes to this bug.