Bug 2176079 - [MDR][Stretch Cluster] Monitor crash observed during upgrade from 5.3 to 5.3z1 GA Versions
Summary: [MDR][Stretch Cluster] Monitor crash observed during upgrade from 5.3 to 5.3z...
Keywords:
Status: NEW
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RADOS
Version: 5.3
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: 6.1z2
Assignee: Neha Ojha
QA Contact: Pawan
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-03-07 10:26 UTC by Pawan
Modified: 2023-07-12 12:36 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHCEPH-6236 0 None None None 2023-03-07 10:27:01 UTC

Comment 2 Brad Hubbard 2023-03-10 04:54:58 UTC
https://github.com/ceph/ceph/blob/19428c64d26b4faec85822b08771a37159f8442f/src/mon/OSDMonitor.cc#L14668-L14679

So it appears we are trying to manipulate a bucket that no longer exists in the
current crushmap and that is considered a fatal error.

Looking at that it looks like we might be able to get a much better idea of
what's happening by gathering the monitor log at debug_mon=20 and debug_paxos=20
when this occurs. Are you in a position to be able to attempt to reproduce this
with the above log settings?

Comment 4 Scott Ostapovicz 2023-07-12 12:36:43 UTC
Missed the 6.1 z1 window.  Retargeting to 6.1 z2.


Note You need to log in before you can comment on or make changes to this bug.