Bug 2305677 - [7.1 backport] [CEE]Ceph mgr crashed after a mgr failover with the message mgr operator() Failed to run module in active mode ('cephadm') [NEEDINFO]
Summary: [7.1 backport] [CEE]Ceph mgr crashed after a mgr failover with the message mg...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Cephadm
Version: 7.1
Hardware: x86_64
OS: Linux
urgent
urgent
Target Milestone: ---
: 7.1z2
Assignee: Adam King
QA Contact: skanta
URL:
Whiteboard:
: 2317045 (view as bug list)
Depends On: 2305678
Blocks:
TreeView+ depends on / blocked
 
Reported: 2024-08-19 09:54 UTC by Varuni Sawant
Modified: 2025-03-05 06:49 UTC (History)
15 users (show)

Fixed In Version: ceph-18.2.1-235.el9cp
Doc Type: Bug Fix
Doc Text:
Previously, cephadm osd removal queue did not have a parameter for original_weight. As a result, the cephadm module would crash during OSD removal. With this fix, the original_weight field is added as an attribute for the osd removal queue and the cephadm no longer crashes during OSD removal.
Clone Of:
Environment:
Last Closed: 2024-11-07 14:39:28 UTC
Embargoed:
gabrioux: needinfo? (allee)
adking: needinfo? (vasawant)
adking: needinfo? (allee)


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 67329 0 None None None 2024-08-20 18:35:16 UTC
Github ceph ceph pull 59318 0 None open mgr/cephadm: add "original_weight" parameter to OSD class 2024-08-20 18:35:16 UTC
Red Hat Issue Tracker RHCEPH-9453 0 None None None 2024-08-21 05:35:39 UTC
Red Hat Product Errata RHBA-2024:9010 0 None None None 2024-11-07 14:39:31 UTC

Description Varuni Sawant 2024-08-19 09:54:42 UTC
Description of problem:

After host removal, stray host and stray daemons warning reported for the removed host. At the same time OSDs from another host were being drained. To mitigate the stray host warning a mgr failover was performed, however the mgr crashed with the error message:

mgr operator() Failed to run module in active mode ('cephadm')

Version-Release number of selected component (if applicable):
Red Hat Ceph Storage 7.1 - 7.1 (18.2.1-194.el9cp)

Comment 49 errata-xmlrpc 2024-11-07 14:39:28 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 7.1 security, bug fix, and enhancement updates), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:9010


Note You need to log in before you can comment on or make changes to this bug.