1162727 – member weirdness when adding/removing nodes

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1162727 - member weirdness when adding/removing nodes

Summary: member weirdness when adding/removing nodes

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 7
Classification:	Red Hat
Component:	pacemaker
Sub Component:
Version:	7.1
Hardware:	Unspecified
OS:	Unspecified
Priority:	urgent
Severity:	urgent
Target Milestone:	rc
Target Release:	---
Assignee:	Andrew Beekhof
QA Contact:	cluster-qe@redhat.com
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1193499 1244101
TreeView+	depends on / blocked

Reported:	2014-11-11 15:02 UTC by Radek Steiger
Modified:	2015-11-19 12:12 UTC (History)
CC List:	9 users (show)
Fixed In Version:	pacemaker-1.1.13-3.el7
Doc Type:	Bug Fix
Doc Text:	Previously, removing and adding cluster nodes in some cases caused conflicting node IDs, which in turn led to certain Pacemaker components terminating unexpectedly. This update overhauls peer cache management to ensure that recycling node IDs is handled correctly, and the described problem thus no longer occurs.
Clone Of:
Clones:	1193499 1244101 (view as bug list)
Environment:
Last Closed:	2015-11-19 12:12:02 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHSA-2015:2383	0	normal	SHIPPED_LIVE	Moderate: pacemaker security, bug fix, and enhancement update	2015-11-19 10:49:49 UTC

Description Radek Steiger 2014-11-11 15:02:03 UTC

> Description of problem:


I'm recently running into a repeating problem when continuously adding/removing cluster nodes.

The issue as seen from pacemaker.log looks like this:

attrd:  warning: crm_find_peer:     Node 'virt-127' and 'virt-131' share the same cluster nodeid: 3 virt-131
attrd:    error: crm_abort:     crm_find_peer: Forked child 20878 to record non-fatal assert at membership.c:394 : member weirdness


I was able to reproduce this multiple times in various different sequences of steps, but here is how did I got to the latest one:

- i'm running a cluster of 7 nodes
- stopped virt-130
- removed node virt-126
- removed node virt-131
- removed node virt-127
- added back node virt-131

The initial node ID distribution:

1    virt-122
2    virt-126
3    virt-127
4    virt-128
5    virt-129
6    virt-130
7    virt-131

The final ID distribution after all additions/removals:

1    virt-122
3    virt-131
4    virt-128
5    virt-129
6    virt-130

As you can see virt-131 has been reassigned an ID previously used by node virt-127. I believe in theory this shouldn't be an issue as the node is long gone, but something in the logs disagrees with me. Here is more context to the log lines above, taken AFTER the final ID distribution:

crmd:     info: pcmk_quorum_notification:      Membership 92: quorum retained (4)
pacemakerd:     info: pcmk_quorum_notification:      Membership 92: quorum retained (4)
crmd:     info: crm_get_peer:  Created entry ad118388-45cd-41f0-9e30-e468433632ba/0x16f6e30 for node virt-131/3 (5 total)
crmd:     info: crm_get_peer:  Node 3 is now known as virt-131
crmd:     info: peer_update_callback:  virt-131 is now (null)
crmd:     info: crm_get_peer:  Node 3 has uuid 3
crmd:   notice: crm_update_peer_state:         pcmk_quorum_notification: Node virt-131[3] - state is now member (was (null))
crmd:     info: peer_update_callback:  virt-131 is now member (was (null))
pacemakerd:     info: crm_get_peer:  Created entry 95269e93-a643-4b65-b0e7-f1d6561e5990/0x1bbfcd0 for node virt-131/3 (5 total)
pacemakerd:     info: crm_get_peer:  Node 3 is now known as virt-131
pacemakerd:     info: crm_get_peer:  Node 3 has uuid 3
pacemakerd:   notice: crm_update_peer_state:         pcmk_quorum_notification: Node virt-131[3] - state is now member (was (null))
cib:     info: cib_process_ping:      Reporting our current digest to virt-129: a376e2e985dcd9996add4737f8416e41 for 0.15.22 (0x825930 0)
cib:     info: pcmk_cpg_membership:   Joined[14.0] cib.3
cib:     info: pcmk_cpg_membership:   Member[14.0] cib.1
cib:     info: pcmk_cpg_membership:   Member[14.1] cib.3
cib:     info: crm_update_peer_proc:  pcmk_cpg_membership: Node virt-127[3] - corosync-cpg is now online
cib:     info: pcmk_cpg_membership:   Member[14.2] cib.4
cib:     info: pcmk_cpg_membership:   Member[14.3] cib.5
attrd:     info: crm_dump_peer_hash:     crm_find_peer: Node 4/virt-128 = 0x2506230 - 7455b5f9-5dbf-4699-b29c-fc95c789dbbb
attrd:     info: crm_dump_peer_hash:     crm_find_peer: Node 6/virt-130 = 0x2506370 - 59912113-f30a-4b72-b615-809c7c7f97d0
attrd:     info: crm_dump_peer_hash:     crm_find_peer: Node 5/virt-129 = 0x25062d0 - d777a0c1-aac6-4998-bbc3-662541f2a22c
attrd:     info: crm_dump_peer_hash:     crm_find_peer: Node 7/virt-131 = 0x2506440 - d1aa5661-f9d0-4bdc-93d5-3062e0b44659
attrd:     info: crm_dump_peer_hash:     crm_find_peer: Node 1/virt-122 = 0x24830f0 - ad564e5c-7241-4f3f-8d2d-c3ac28fdcfcb
attrd:     info: crm_dump_peer_hash:     crm_find_peer: Node 2/virt-126 = 0x2506610 - dff4adfe-62f0-4e4d-a3c0-5e07b012b6b7
attrd:     info: crm_dump_peer_hash:     crm_find_peer: Node 3/virt-127 = 0x2508f40 - 45e96061-3b19-4a19-9837-5586e0ef1752
attrd:  warning: crm_find_peer:     Node 'virt-127' and 'virt-131' share the same cluster nodeid: 3 virt-131
attrd:    error: crm_abort:     crm_find_peer: Forked child 20878 to record non-fatal assert at membership.c:394 : member weirdness

I do not know how pacemaker's internal peer cache works, but this snippet of a log hints it is still somehow aware of the nodes after they've been removed.

When checking runtime CIB, a CIB stored in /var/lib/pacemaker, corosync quorum and corosync.conf file, the virt-127 node was mentioned i neither of those (besides a related fencing resource).



> Version-Release number of selected component (if applicable):

pacemaker-1.1.12-9.el7.x86_64
corosync-2.3.4-3.el7.x86_64
pcs-0.9.135-1.el7.x86_64 


> How reproducible:

Relatively Easy


> Steps to Reproduce:

1) Setup a cluster of i.e. 7 nodes, with nodes marked n1 to n7
2) Stop node n6 with pcs
3) Remove node n2 with pcs
4) Remove node n7 with pcs
5) Remove node n3 with pcs
6) Add node n7 with pcs
7) Compare node ID assignment before and after these steps - if a different ID was reused by n7 when added back to the cluster, it will most likely cause these errors.


> Actual results:

When the node is added back with a reused ID, membership errors appear in the logs and some pacemakr components will start crashing:

attrd:  warning: crm_find_peer:     Node 'virt-127' and 'virt-131' share the same cluster nodeid: 3 virt-131
attrd:    error: crm_abort:     crm_find_peer: Forked child 20878 to record non-fatal assert at membership.c:394 : member weirdness 


> Expected results:

Reusing of a node ID is not fatal and doesn't cause any membership errors.


> Additional info:

Feel free to reassign this bug to a different component shouldn't pacemaker be at fault here.

Comment 5 Andrew Beekhof 2015-01-23 00:58:18 UTC

Patch for this is:
   https://github.com/beekhof/pacemaker/commit/ddccf97

Forgot to update bugzilla with build information

Comment 9 Andrew Beekhof 2015-03-27 05:24:58 UTC

Guilty.

The patch https://github.com/ClusterLabs/pacemaker/commit/bf15d36 assumed that 'node->state' was hooked up for the cib.  This seems not to be the case, so I'm testing a patch that would address that part so that the original patch will work.

Apologies.

Comment 10 Andrew Beekhof 2015-03-27 05:29:43 UTC

This looks better:

Mar 27 16:26:10 [27091] pcmk-1        cib:     info: crm_update_peer_proc: 	pcmk_cpg_membership: Node pcmk-3[103] - corosync-cpg is now offline
Mar 27 16:26:10 [27091] pcmk-1        cib:     info: cib_peer_update_callback: 	2 103 member
Mar 27 16:26:10 [27091] pcmk-1        cib:   notice: crm_update_peer_state: 	cib_peer_update_callback: Node pcmk-3[103] - state is now lost (was member)
Mar 27 16:26:10 [27091] pcmk-1        cib:     info: cib_peer_update_callback: 	1 103 lost
Mar 27 16:26:10 [27091] pcmk-1        cib:   notice: crm_reap_dead_member: 	Removing pcmk-3/103 from the membership list
Mar 27 16:26:10 [27091] pcmk-1        cib:   notice: reap_crm_member: 	Purged 1 peers with id=103 and/or uname=(null) from the membership cache

The node dropping out from CPG is now automatically removing the node from the cache - so there is nothing for the new node to conflict with.

Comment 11 Andrew Beekhof 2015-03-29 23:23:50 UTC

Also needed:

+ 0b98ef1: Fix: stonith-ng: Correctly track node state  (HEAD, 1.1)
+ 72b3a9a: Fix: stonith-ng: No reply is needed for CRM_OP_RM_NODE_CACHE

Comment 16 errata-xmlrpc 2015-11-19 12:12:02 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHSA-2015-2383.html

Note You need to log in before you can comment on or make changes to this bug.