Bug 2266537

Summary: [7.0z backport] ceph-client.admin crashed in ceph-exporter thread with "throw_invalid_argument(char const*, boost::source_location const&)+0x37) [0x557c40cab267]"
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Bipin Kunal <bkunal>
Component: Ceph-MetricsAssignee: Juan Miguel Olmo <jolmomar>
Status: CLOSED ERRATA QA Contact: Sayalee <saraut>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 6.1CC: amagrawa, athakkar, brgardne, ceph-eng-bugs, cephqe-warriors, dkamboj, jolmomar, muagarwa, nagreddy, nthomas, odf-bz-bot, prsurve, rpollack, sagrawal, saraut, sostapov, srai, tdesala, tnielsen, tserlin, vdas
Target Milestone: ---Keywords: Automation, Reopened
Target Release: 7.1z3   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-18.2.1-174 Doc Type: Bug Fix
Doc Text:
.‘ceph-exporter’ is now able to handle exceptions during metric collection and maintains stable operation Previously, the ‘ceph-exporter’ thread in ‘ceph-client.admin’ was not properly handling exceptions, particularly when processing JSON data related to cluster metrics. As a result, the ‘ceph-exporter’ crashed with an ‘invalid_argument’ error while collecting metrics as the cluster approached its near-full ratio. The error caused the Ceph health status to enter a warning state. With this fix, exception handling is being handled properly in the ‘ceph-exporter’ thread. In addition, error logging was added to capture details of any exceptions. The ‘ceph-exporter’ now gracefully handles exceptions during metric collection, preventing crashes and maintaining stable operation even when the cluster approaches capacity limits.
Story Points: ---
Clone Of: 2266035
: 2266538 (view as bug list) Environment:
Last Closed: 2025-02-24 15:42:40 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2266035    
Bug Blocks: 2266538, 2266539    

Comment 1 Scott Ostapovicz 2024-04-10 13:47:37 UTC
This backport request needs to wait until the core fix has been validated.  At this date, it is still in POST and still says testing in progress.  

Retargeting to 7.0 z3.

Comment 8 errata-xmlrpc 2025-02-24 15:42:40 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 7.1 security, bug fix, enhancement, and known issue updates), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2025:1770