.The MDS daemon no longer crashes when receiving unsupported metrics
Previously, the MDS daemon could not handle the new metrics from the kernel client causing the MDS daemons to crash on receiving any unsupported metrics.
With this release, the MDS discards any unsupported metrics and works as expected.
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory (Red Hat Ceph Storage 5.0 Bug Fix update), and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.
https://access.redhat.com/errata/RHBA-2022:0466
Description of problem: Ceph MDS repeatedly crashing with: Dec 08 14:11:45 cephmds1 conmon[3431]: debug -1> 2021-12-08T13:11:45.526+0000 7f08eb0aa700 -1 /builddir/build/BUILD/ceph-16.2.0/src/include/cephfs/metrics/Types.h: In function 'std::ostream& operator<<(std::ostream&, const ClientMetricType&)' thread 7f08eb0aa700 time 2021-12-08T13:11:45.526200+0000 Dec 08 14:11:45 cephmds1 conmon[3431]: /builddir/build/BUILD/ceph-16.2.0/src/include/cephfs/metrics/Types.h: 56: ceph_abort_msg("abort() called") Dec 08 14:11:45 cephmds1 conmon[3431]: Dec 08 14:11:45 cephmds1 conmon[3431]: ceph version 16.2.0-117.el8cp (0e34bb74700060ebfaa22d99b7d2cdc037b28a57) pacific (stable) Dec 08 14:11:45 cephmds1 conmon[3431]: 1: (ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0xe5) [0x7f08f3d15cd4] Dec 08 14:11:45 cephmds1 conmon[3431]: 2: (operator<<(std::ostream&, ClientMetricType const&)+0x10e) [0x7f08f3f9a2ce] Dec 08 14:11:45 cephmds1 conmon[3431]: 3: (MClientMetrics::print(std::ostream&) const+0x1a1) [0x7f08f3f9a4a1] Dec 08 14:11:45 cephmds1 conmon[3431]: 4: (DispatchQueue::entry()+0x1be2) [0x7f08f3f50312] Dec 08 14:11:45 cephmds1 conmon[3431]: 5: (DispatchQueue::DispatchThread::entry()+0x11) [0x7f08f3fff9b1] Dec 08 14:11:45 cephmds1 conmon[3431]: 6: /lib64/libpthread.so.0(+0x814a) [0x7f08f2ab714a] Dec 08 14:11:45 cephmds1 conmon[3431]: 7: clone() Version-Release number of selected component (if applicable): RHCS 5.0 How reproducible: 100% for this env Actual results: Ceph MDS crashes repeatedly Expected results: Ceph MDS not to crash Additional info: Appears to be fixed via upstream: # Tracker: https://tracker.ceph.com/issues/50822 # PR: https://github.com/ceph/ceph/pull/41596