Bug 2310114 - [Stretch Mode] Cluster unresponsive and commands are stuck during Netsplit scenario b/w the two data sites
Summary: [Stretch Mode] Cluster unresponsive and commands are stuck during Netsplit sc...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RADOS
Version: 7.0
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: 7.1z2
Assignee: Kamoltat (Junior) Sirivadhna
QA Contact: Pawan
URL:
Whiteboard:
: 2310115 (view as bug list)
Depends On: 2249962
Blocks: 2267614 2298578 2298579
TreeView+ depends on / blocked
 
Reported: 2024-09-05 02:24 UTC by Kamoltat (Junior) Sirivadhna
Modified: 2025-03-08 04:25 UTC (History)
14 users (show)

Fixed In Version: ceph-18.2.1-237.el9cp
Doc Type: Bug Fix
Doc Text:
.Monitors no longer get stuck in elections during crash/shutdown tests Previously, the `disallowed_leaders` attribute of the MonitorMap was conditionally filled only when entering `stretch_mode`. However, there were instances wherein Monitors that just got revived would not enter `stretch_mode` right away because they would be in a `probing` state. This led to a mismatch in the `disallowed_leaders` set between the monitors across the cluster. Due to this, Monitors would fail to elect a leader, and the election would be stuck, resulting in Ceph being unresponsive. With this fix, Monitors do not have to be in `stretch_mode` to fill the `disallowed_leaders` attribute. Monitors no longer get stuck in elections during crash/shutdown tests.
Clone Of: 2249962
Environment:
Last Closed: 2024-11-07 14:39:19 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 67050 0 None None None 2024-09-05 02:24:06 UTC
Github ceph ceph pull 58687 0 None open reef: mon/ElectionLogic: tie-breaker mon ignore proposal from marked down mon 2024-09-05 02:24:06 UTC
Red Hat Issue Tracker RHCEPH-9701 0 None None None 2024-09-05 02:26:30 UTC
Red Hat Product Errata RHBA-2024:9010 0 None None None 2024-11-07 14:39:31 UTC

Description Kamoltat (Junior) Sirivadhna 2024-09-05 02:24:07 UTC
+++ This bug was initially created as a clone of Bug #2249962 +++

Comment 17 errata-xmlrpc 2024-11-07 14:39:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 7.1 security, bug fix, and enhancement updates), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:9010

Comment 18 Red Hat Bugzilla 2025-03-08 04:25:14 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days


Note You need to log in before you can comment on or make changes to this bug.