Bug 2142141

Summary: Monitor crash - ceph_assert(m < ranks.size()) - observed when number of monitors were reduced from 5 to 3 using ceph orchestrator
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Kamoltat (Junior) Sirivadhna <ksirivad>
Component: RADOSAssignee: Kamoltat (Junior) Sirivadhna <ksirivad>
Status: CLOSED ERRATA QA Contact: Pawan <pdhiran>
Severity: high Docs Contact: Akash Raj <akraj>
Priority: unspecified    
Version: 5.0CC: akraj, akupczyk, amathuri, anrao, bhubbard, bkunal, ceph-eng-bugs, cephqe-warriors, choffman, ckulal, ekristov, gfarnum, hyelloji, ksirivad, lflores, mgowri, mmurthy, msaini, muagarwa, nojha, pasik, pdhange, pdhiran, pnataraj, rfriedma, rmandyam, rzarzyns, sostapov, sseshasa, sunnagar, vashastr, vereddy, vumrao
Target Milestone: ---   
Target Release: 5.3   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-16.2.10-88.el8cp Doc Type: Bug Fix
Doc Text:
.Ceph Monitors no longer crash when using `ceph orch apply mon <num>` command Previously, when the command `ceph orch apply mon <num>` was used to decrease monitors in a cluster, the monitors were removed before shutting down in `ceph-adm` causing the monitors to crash. With this fix, a sanity check is added to all code paths that check whether the peer rank is more than or equal to the size of the ranks from the monitor map. If the condition is satisfied, then skip certain operations that lead to the monitor crashing. The peer rank eventually resolves itself in the next version of the monitor map. The monitors no longer crash when removed from the monitor map before shutting down.
Story Points: ---
Clone Of: 1945266 Environment:
Last Closed: 2023-01-11 17:42:24 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1945266    
Bug Blocks: 1959686, 2121452, 2126049, 2142674, 2142983, 2150223    

Comment 5 Vikhyat Umrao 2022-11-29 16:27:02 UTC
*** Bug 2149286 has been marked as a duplicate of this bug. ***

Comment 40 Kamoltat (Junior) Sirivadhna 2023-01-11 09:14:46 UTC
Hi Akash,

here are the doc type and text.

Thanks

Comment 41 errata-xmlrpc 2023-01-11 17:42:24 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 5.3 security update and Bug Fix), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:0076