Bug 2228357 - mds: MDLog::_recovery_thread: handle the errors gracefully
Summary: mds: MDLog::_recovery_thread: handle the errors gracefully
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: CephFS
Version: 6.1
Hardware: All
OS: All
medium
medium
Target Milestone: ---
: 6.1z2
Assignee: Jos Collin
QA Contact: Hemanth Kumar
Akash Raj
URL:
Whiteboard:
Depends On:
Blocks: 2235257
TreeView+ depends on / blocked
 
Reported: 2023-08-02 07:35 UTC by Jos Collin
Modified: 2023-11-03 04:01 UTC (History)
7 users (show)

Fixed In Version: ceph-17.2.6-125.el9cp
Doc Type: Bug Fix
Doc Text:
.Errors are handled gracefully in `MDLog::_recovery_thread` Previously, a write would fail if the MDS was already blocklisted due to the `fs fail` issued by the QA tests. For instance, the QA test `test_rebuild_moved_file` (tasks/data-scan) would fail due to this reason. With this fix, the write failures are gracefully handled in `MDLog::_recovery_thread`.
Clone Of:
Environment:
Last Closed: 2023-10-12 16:34:36 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 61201 0 None None None 2023-08-02 07:35:54 UTC
Red Hat Issue Tracker RHCEPH-7316 0 None None None 2023-09-05 09:51:23 UTC
Red Hat Product Errata RHSA-2023:5693 0 None None None 2023-10-12 16:35:38 UTC

Description Jos Collin 2023-08-02 07:35:55 UTC
Description of problem:
A write fails if the MDS is already blocklisted due to the 'fs fail' issued by the qa tests.
Handle those write failures gracefully, even when the MDS is stopping.

Version-Release number of selected component (if applicable):
6.2

How reproducible:
test_rebuild_moved_file (tasks/data-scan) fails because mds crashes:
https://tracker.ceph.com/issues/61201

Steps to Reproduce:
https://tracker.ceph.com/issues/61201

Actual results:
asserts when the write fails in MDLog::_recovery_thread.

Expected results:
Handle those write failures gracefully.

Comment 8 errata-xmlrpc 2023-10-12 16:34:36 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 6.1 security, enhancement, and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:5693


Note You need to log in before you can comment on or make changes to this bug.