Bug 2228357

Summary: mds: MDLog::_recovery_thread: handle the errors gracefully
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Jos Collin <jcollin>
Component: CephFSAssignee: Jos Collin <jcollin>
Status: CLOSED ERRATA QA Contact: Hemanth Kumar <hyelloji>
Severity: medium Docs Contact: Akash Raj <akraj>
Priority: medium    
Version: 6.1CC: akraj, ceph-eng-bugs, cephqe-warriors, hyelloji, tserlin, vereddy, vshankar
Target Milestone: ---   
Target Release: 6.1z2   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: ceph-17.2.6-125.el9cp Doc Type: Bug Fix
Doc Text:
.Errors are handled gracefully in `MDLog::_recovery_thread` Previously, a write would fail if the MDS was already blocklisted due to the `fs fail` issued by the QA tests. For instance, the QA test `test_rebuild_moved_file` (tasks/data-scan) would fail due to this reason. With this fix, the write failures are gracefully handled in `MDLog::_recovery_thread`.
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-10-12 16:34:36 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2235257    

Description Jos Collin 2023-08-02 07:35:55 UTC
Description of problem:
A write fails if the MDS is already blocklisted due to the 'fs fail' issued by the qa tests.
Handle those write failures gracefully, even when the MDS is stopping.

Version-Release number of selected component (if applicable):
6.2

How reproducible:
test_rebuild_moved_file (tasks/data-scan) fails because mds crashes:
https://tracker.ceph.com/issues/61201

Steps to Reproduce:
https://tracker.ceph.com/issues/61201

Actual results:
asserts when the write fails in MDLog::_recovery_thread.

Expected results:
Handle those write failures gracefully.

Comment 8 errata-xmlrpc 2023-10-12 16:34:36 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 6.1 security, enhancement, and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:5693