RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1067028 - CPG membership may be inconsistent after node pause
Summary: CPG membership may be inconsistent after node pause
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Enterprise Linux 7
Classification: Red Hat
Component: corosync
Version: 7.0
Hardware: Unspecified
OS: Unspecified
urgent
urgent
Target Milestone: rc
: ---
Assignee: Jan Friesse
QA Contact: Cluster QE
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2014-02-19 14:25 UTC by Jan Friesse
Modified: 2014-06-18 00:31 UTC (History)
4 users (show)

Fixed In Version: corosync-2.3.3-2.el7
Doc Type: Bug Fix
Doc Text:
Cause: Corosync on one of node is paused. On other nodes, cpg clients are killed. Consequence: Corosync on paused node after resume never finds out that on other nodes cpg clients were killed and still believes they are alive. Membership (cpg one) is different between nodes. Fix: Make sure that corosync properly updates it's internal informations about other nodes cpg clients. Result: Killed cpg clients are properly removed from internal structures so cpg membership is equivalent between nodes.
Clone Of:
: 1067043 (view as bug list)
Environment:
Last Closed: 2014-06-13 12:27:43 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
Proposed patch - part 1 - cpg: Refactor mh_req_exec_cpg_procleave (3.00 KB, patch)
2014-02-19 14:27 UTC, Jan Friesse
no flags Details | Diff
Proposed patch - part 2 - cpg: Make sure nodid is always logged as hex num (2.08 KB, patch)
2014-02-19 14:28 UTC, Jan Friesse
no flags Details | Diff
Proposed patch - part3 - cpg: Make sure left nodes are really removed (2.44 KB, patch)
2014-02-19 14:29 UTC, Jan Friesse
no flags Details | Diff

Description Jan Friesse 2014-02-19 14:25:23 UTC
Description of problem:
When node is paused and other nodes has in meantime exited cpg process,
paused node after resume doesn't update it's membership correctly so on
previously paused node exited cpg process is still visible.

Version-Release number of selected component (if applicable):
Any

How reproducible:
100%

Steps to Reproduce:
1. Start 3 nodes and on every node execute cpg client (for example testcpg)
2. Pause one of node
3. On another node, stop and execute cpg client again
4. Unpause paused node
5. On paused node membership will be different then on other nodes (probe by corosync-cpgtool). Paused node will have 4 members (2 will be from node where cpg client was stopped and executed), other nodes will have 3 members (each node has exactly one client)

Actual results:
Membership contains non-existing process

Expected results:
Membership contains only existing processes

Additional info:

Comment 1 Jan Friesse 2014-02-19 14:27:10 UTC
Created attachment 865089 [details]
Proposed patch - part 1 - cpg: Refactor mh_req_exec_cpg_procleave

Comment 2 Jan Friesse 2014-02-19 14:28:45 UTC
Created attachment 865090 [details]
Proposed patch - part 2 - cpg: Make sure nodid is always logged as hex num

Comment 3 Jan Friesse 2014-02-19 14:29:15 UTC
Created attachment 865091 [details]
Proposed patch - part3 - cpg: Make sure left nodes are really removed

Comment 9 Ludek Smid 2014-06-13 12:27:43 UTC
This request was resolved in Red Hat Enterprise Linux 7.0.

Contact your manager or support representative in case you have further questions about the request.


Note You need to log in before you can comment on or make changes to this bug.