Bug 2142141 - Monitor crash - ceph_assert(m < ranks.size()) - observed when number of monitors were reduced from 5 to 3 using ceph orchestrator
Summary: Monitor crash - ceph_assert(m < ranks.size()) - observed when number of monit...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RADOS
Version: 5.0
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: 5.3
Assignee: Kamoltat (Junior) Sirivadhna
QA Contact: Pawan
Akash Raj
URL:
Whiteboard:
: 2149286 (view as bug list)
Depends On: 1945266
Blocks: 1959686 2121452 2126049 2142674 2142983 2150223
TreeView+ depends on / blocked
 
Reported: 2022-11-11 19:17 UTC by Kamoltat (Junior) Sirivadhna
Modified: 2023-01-17 07:52 UTC (History)
33 users (show)

Fixed In Version: ceph-16.2.10-88.el8cp
Doc Type: Bug Fix
Doc Text:
.Ceph Monitors no longer crash when using `ceph orch apply mon <num>` command Previously, when the command `ceph orch apply mon <num>` was used to decrease monitors in a cluster, the monitors were removed before shutting down in `ceph-adm` causing the monitors to crash. With this fix, a sanity check is added to all code paths that check whether the peer rank is more than or equal to the size of the ranks from the monitor map. If the condition is satisfied, then skip certain operations that lead to the monitor crashing. The peer rank eventually resolves itself in the next version of the monitor map. The monitors no longer crash when removed from the monitor map before shutting down.
Clone Of: 1945266
Environment:
Last Closed: 2023-01-11 17:42:24 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github ceph ceph pull 48320 0 None Merged pacific: mon/Elector: Added sanity check when pinging a peer monitor 2022-11-11 19:20:38 UTC
Github ceph ceph pull 49444 0 None open pacific: mon/Elector.cc: Compress peer >= rank_size sanity check into send_pee… 2022-12-16 17:51:03 UTC
Red Hat Issue Tracker RHCEPH-5599 0 None None None 2022-11-11 19:38:56 UTC
Red Hat Product Errata RHSA-2023:0076 0 None None None 2023-01-11 17:43:38 UTC

Comment 5 Vikhyat Umrao 2022-11-29 16:27:02 UTC
*** Bug 2149286 has been marked as a duplicate of this bug. ***

Comment 40 Kamoltat (Junior) Sirivadhna 2023-01-11 09:14:46 UTC
Hi Akash,

here are the doc type and text.

Thanks

Comment 41 errata-xmlrpc 2023-01-11 17:42:24 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 5.3 security update and Bug Fix), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:0076


Note You need to log in before you can comment on or make changes to this bug.