Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use https://ibm-ceph.atlassian.net/ for all bug tracking management.

Bug 2228339

Summary: mds: MDLog::_recovery_thread: handle the errors gracefully
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Jos Collin <jcollin>
Component: CephFSAssignee: Jos Collin <jcollin>
Status: CLOSED ERRATA QA Contact: Hemanth Kumar <hyelloji>
Severity: medium Docs Contact: Akash Raj <akraj>
Priority: unspecified    
Version: 5.3CC: akraj, ceph-eng-bugs, cephqe-warriors, rmandyam, tserlin, vshankar
Target Milestone: ---   
Target Release: 5.3z5   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: ceph-16.2.10-203.el8cp Doc Type: Bug Fix
Doc Text:
.handle the errors gracefully in MDLog::_recovery_thread. A write fails if the MDS is already blocklisted due to the 'fs fail' issued by the qa tests. For instance, thes test test_rebuild_moved_file (tasks/data-scan) fails due to this reason. This fix handles those write failures gracefully in MDLog::_recovery_thread, even when the MDS is stopping.
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-08-28 09:40:56 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Jos Collin 2023-08-02 06:28:55 UTC
Description of problem:
A write fails if the MDS is already blocklisted due to the 'fs fail' issued by the qa tests.
Handle those write failures gracefully, even when the MDS is stopping.

Version-Release number of selected component (if applicable):
5.3

How reproducible:
test_rebuild_moved_file (tasks/data-scan) fails because mds crashes:
https://tracker.ceph.com/issues/61201

Steps to Reproduce:
https://tracker.ceph.com/issues/61201

Actual results:
asserts when the write fails in MDLog::_recovery_thread.

Expected results:
Handle those write failures gracefully.

Comment 9 Venky Shankar 2023-08-10 06:17:16 UTC
Jos - please rebase https://gitlab.cee.redhat.com/ceph/ceph/-/merge_requests/319

Comment 10 Jos Collin 2023-08-12 00:21:51 UTC
(In reply to Venky Shankar from comment #9)
> Jos - please rebase
> https://gitlab.cee.redhat.com/ceph/ceph/-/merge_requests/319

rebased.

Comment 17 errata-xmlrpc 2023-08-28 09:40:56 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 5.3 Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:4760