Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
For bugs related to Red Hat Enterprise Linux 5 product line. The current stable release is 5.10. For Red Hat Enterprise Linux 6 and above, please visit Red Hat JIRA https://issues.redhat.com/secure/CreateIssue!default.jspa?pid=12332745 to report new issues.

Bug 226700

Summary: cman cluster needs restart when going from >=3 to 2 nodes and 2 to >= 3 nodes
Product: Red Hat Enterprise Linux 5 Reporter: Paul Kennedy <pkennedy>
Component: congaAssignee: Ryan McCabe <rmccabe>
Status: CLOSED ERRATA QA Contact: Corey Marthaler <cmarthal>
Severity: medium Docs Contact:
Priority: medium    
Version: 5.0CC: adstrong, cluster-maint, jbrassow, kanderso, kupcevic, ldimaggi, rmccabe
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: RHSA-2007-0640 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2007-11-07 15:37:00 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 240508    
Bug Blocks:    

Description Paul Kennedy 2007-02-01 00:06:59 UTC
Description of problem:
Tried to delete a node and add it back into the cluster. Received this error
message when trying to add the node back in:

The following errors occurred:
    * A Ricci error occurred on tng3-4.lab.msp.redhat.com: ccs_tool failed
    * Unable to update the cluster node list for my_rh_cluster

Further investigation revealed this:
[root@tng3-3 ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   X     12                        tng3-5.lab.msp.redhat.com
   2   M     12   2007-01-31 17:26:00  tng3-4.lab.msp.redhat.com
   3   M      4   2007-01-31 17:26:00  tng3-3.lab.msp.redhat.com

[root@tng3-4 ~]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   X      8                        tng3-5.lab.msp.redhat.com
   2   M      4   2007-01-31 12:25:03  tng3-4.lab.msp.redhat.com
   3   M     12   2007-01-31 12:25:09  tng3-3.lab.msp.redhat.com


Version-Release number of selected component (if applicable):


How reproducible:
Create three-node cluster. Remove one node and add it back.

Steps to Reproduce:
1. 
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 1 Paul Kennedy 2007-02-01 00:13:28 UTC
Tried adding different node. Same results.

Comment 2 Ryan McCabe 2007-02-01 15:03:36 UTC
This looks like a bug somewhere deeper in the cluster stack. Could you try to
manually propagate the conf file with ccs_tool and see what the error message is?

Comment 4 Ryan McCabe 2007-02-01 16:14:10 UTC
After I remove the node, then try to add it back manually, here's what I see:

[root@huey cluster]# ccs_tool update .cluster.conf 
Unable to open connection to dewey.lab.boston.redhat.com: Bad file descriptor

Failed to update config file.
[root@huey cluster]# cman_tool nodes
Node  Sts   Inc   Joined               Name
   1   M     12   2007-02-01 11:06:44  louey.lab.boston.redhat.com
   2   M      4   2007-02-01 11:06:44  huey.lab.boston.redhat.com
   3   X     12                        dewey.lab.boston.redhat.com

[root@dewey cluster]# pwd;ls -la
/etc/cluster
total 32
drwxr-xr-x  2 root root  4096 Feb  1 11:10 .
drwxr-xr-x 91 root root 12288 Feb  1 11:07 ..
-rw-r-----  1 root root   461 Feb  1 11:07 .cluster.conf

[root@dewey cluster]# ps auxww|egrep "[a]isexec|[c]cs|[c]man|[c]lurg";service
cman status
ccsd is stopped

Anyone have any insight into what's going wrong here?

Comment 5 Ryan McCabe 2007-02-01 16:22:16 UTC
Sorry. About 5 minutes after I posted here, i realized that the cluster needs a
restart when going from 3->2 and 2->3 nodes. That's what's causing the problem.

Comment 6 Kiersten (Kerri) Anderson 2007-04-23 17:06:27 UTC
Fixing Product Name.  Cluster Suite was merged into Enterprise Linux for version
5.0.

Comment 7 Ryan McCabe 2007-05-18 16:44:31 UTC
Marking this modified and depending on
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=240508

That fix will remove the need for any special handling on the part of conga for
clusters going from > 2 nodes to 2 and from 2 to > 2.

Comment 10 errata-xmlrpc 2007-11-07 15:37:00 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2007-0640.html