Bug 2124417 - mgr/stats: be resilient to offline MDS rank-0
Summary: mgr/stats: be resilient to offline MDS rank-0
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: CephFS
Version: 5.2
Hardware: All
OS: Linux
medium
medium
Target Milestone: ---
: 5.3z1
Assignee: Jos Collin
QA Contact: julpark
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-09-06 05:33 UTC by Jos Collin
Modified: 2023-02-28 10:06 UTC (History)
7 users (show)

Fixed In Version: ceph-16.2.10-102.el8cp
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-02-28 10:05:18 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 54479 0 None None None 2022-09-06 05:33:40 UTC
Red Hat Issue Tracker RHCEPH-5202 0 None None None 2022-09-06 06:13:26 UTC
Red Hat Product Errata RHSA-2023:0980 0 None None None 2023-02-28 10:06:07 UTC

Description Jos Collin 2022-09-06 05:33:41 UTC
Description of problem:
mgr/stats can repeatedly report stale perf stats when MDS rank-0 becomes offline. Even after a standby daemon transitions to active rank-0, the metrics reported are still stale.
To fix this, reregister user queries when a new MDS rank-0 is seen.

Version-Release number of selected component (if applicable):
5.3

How reproducible:
Always

Steps to Reproduce:
1. Create a filesystem, mount it and run `watch ceph fs perf stats`.
2. Fail the rank0 mds.
3. `ceph fs perf stats` output shows stale metrics or no metrics.

Actual results:
`ceph fs perf stats` output shows stale metrics or no metrics.

Expected results:
`ceph fs perf stats` should display valid metrics.

Additional info:

Comment 14 errata-xmlrpc 2023-02-28 10:05:18 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat Ceph Storage 5.3 Bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:0980


Note You need to log in before you can comment on or make changes to this bug.