Bug 510510 - cluster view inconsistent after "service cman stop; service cman start"
Summary: cluster view inconsistent after "service cman stop; service cman start"
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: cman
Version: 5.3
Hardware: All
OS: Linux
urgent
medium
Target Milestone: rc
: ---
Assignee: Christine Caulfield
QA Contact: Cluster QE
URL:
Whiteboard:
Depends On: 472786
Blocks:
TreeView+ depends on / blocked
 
Reported: 2009-07-09 15:06 UTC by Benjamin Kahn
Modified: 2016-04-26 13:55 UTC (History)
10 users (show)

Fixed In Version: cman-2.0.98-1.el5_3.7
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2009-08-03 07:40:41 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2009:1192 0 normal SHIPPED_LIVE cman bug fix update 2009-08-03 07:40:36 UTC

Description Benjamin Kahn 2009-07-09 15:06:55 UTC
This bug has been copied from bug #472786 and has been proposed
to be backported to 5.3 z-stream (EUS).

Comment 3 Christine Caulfield 2009-07-13 12:04:35 UTC
cherry-picked into the RHEL53 branch

commit 2bac6a6e1a24cd56f08bb89299296bf2a9a6ea58
Author: Christine Caulfield <ccaulfie>
Date:   Wed Dec 3 10:46:20 2008 +0000

    cman: Fix inconsistent state if a node leaves/joins quickly

Comment 5 Nate Straz 2009-07-24 15:20:17 UTC
I am still able to get the cluster in an inconsistent state after running "service cman restart" on one node.

[nstraz@z ~]$ for i in z{1,2,3,4}; do echo "$i:"; qarsh -l root $i cman_tool nodes; done
z1:
Node  Sts   Inc   Joined               Name
   1   M   1228   2009-07-24 08:58:41  z1
   2   M   1240   2009-07-24 08:59:11  z2
   3   M   1232   2009-07-24 08:58:41  z3
   4   M   1232   2009-07-24 08:58:41  z4
z2:
Node  Sts   Inc   Joined               Name
   1   M   1240   2009-07-24 08:59:12  z1
   2   M   1236   2009-07-24 08:59:12  z2
   3   M   1240   2009-07-24 08:59:12  z3
   4   M   1240   2009-07-24 08:59:12  z4
z3:
Node  Sts   Inc   Joined               Name
   1   M   1232   2009-07-24 08:58:41  z1
   2   M   1240   2009-07-24 08:59:11  z2
   3   M   1216   2009-07-24 08:32:16  z3
   4   M   1224   2009-07-24 08:32:17  z4
z4:
Node  Sts   Inc   Joined               Name
   1   X   1224                        z1
   2   X   1224                        z2
   3   M   1224   2009-07-24 08:32:18  z3
   4   M   1220   2009-07-24 08:32:17  z4

Comment 6 Christine Caulfield 2009-07-28 09:47:48 UTC
Given that bz#472786 has been verified as fixed, and it's the same bug, are we sure that the Z-stream package has that patch included ?

Comment 7 Chris Feist 2009-07-29 17:33:03 UTC
Yes, I talked with nate and he is using cman-2.0.98-1.el5_3.7 which I verified contains the patch:

-       if (node->state == NODESTATE_DEAD) {
+       if (node->state == NODESTATE_DEAD || node->state == NODESTATE_LEAVING) {

Comment 8 Nate Straz 2009-07-29 21:20:58 UTC
I reinstalled the systems and tried the test again and it passed this time around.  Last time I was working with 3, 4, and 5 node clusters, perhaps I didn't have all nodes updated correctly.  Calling this verified now.

Comment 10 errata-xmlrpc 2009-08-03 07:40:41 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2009-1192.html


Note You need to log in before you can comment on or make changes to this bug.