Bug 2130867 - [ceph-mgr] Cluster in health_err state with error : Module 'devicehealth' has failed: unknown operation
Summary: [ceph-mgr] Cluster in health_err state with error : Module 'devicehealth' has...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Ceph-Mgr Plugins
Version: 6.0
Hardware: All
OS: All
medium
urgent
Target Milestone: ---
: 7.0z1
Assignee: Patrick Donnelly
QA Contact: Harsh Kumar
Rivka Pollack
URL:
Whiteboard:
Depends On:
Blocks: 2248719 2260311
TreeView+ depends on / blocked
 
Reported: 2022-09-29 08:40 UTC by Pawan
Modified: 2024-03-07 11:39 UTC (History)
17 users (show)

Fixed In Version: ceph-18.2.0-139.el9cp
Doc Type: Bug Fix
Doc Text:
Previously, `libcephsqlite` would blocklist active connections when RADOS access was lost. As a result, some `ceph-mgr` modules, such as `devicehealth` became unavailable. With this fix, database connections are reopened when a blocklist occurs and the database blocklist no longer causes terminal failure.
Clone Of:
Environment:
Last Closed: 2024-03-07 11:39:26 UTC
Embargoed:
rpollack: needinfo-


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 62022 0 None None None 2023-11-08 14:22:13 UTC
Github ceph ceph pull 50291 0 None open pybind/mgr: reopen database handle on blocklist 2023-03-27 18:21:22 UTC
Red Hat Issue Tracker RHCEPH-5383 0 None None None 2022-09-29 08:44:35 UTC
Red Hat Product Errata RHBA-2024:1214 0 None None None 2024-03-07 11:39:33 UTC

Comment 10 Patrick Donnelly 2023-05-15 17:26:48 UTC
Moving to 6.1z1 for bug fix release.

Comment 12 Patrick Donnelly 2023-07-12 13:30:39 UTC
*** Bug 2222254 has been marked as a duplicate of this bug. ***

Comment 13 Scott Ostapovicz 2023-09-20 13:36:56 UTC
Time is up for z2 as well.  Retargeting to z3.

Comment 14 krishnaram Karthick 2023-09-21 04:28:10 UTC
(In reply to Scott Ostapovicz from comment #13)
> Time is up for z2 as well.  Retargeting to z3.

This bug fix was important for RDR and we were tracking the fix to be made available in 6.1z2. 
Any chance this could be retargeted at 6.1z2? (I believe the fix is ready too).

Comment 15 Scott Ostapovicz 2023-09-21 15:44:37 UTC
It looks like there is a potential fix for this upstream, but it is not yet tested and is unlikely to be tested by the weeks end.  Unless this status changes it is equally unlikely that this will be included in the z2 build.

Comment 29 errata-xmlrpc 2024-03-07 11:39:26 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 7.0 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:1214


Note You need to log in before you can comment on or make changes to this bug.