Bug 2228339 - mds: MDLog::_recovery_thread: handle the errors gracefully
Summary: mds: MDLog::_recovery_thread: handle the errors gracefully
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: CephFS
Version: 5.3
Hardware: All
OS: All
unspecified
medium
Target Milestone: ---
: 5.3z5
Assignee: Jos Collin
QA Contact: Hemanth Kumar
Akash Raj
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-08-02 06:28 UTC by Jos Collin
Modified: 2023-10-05 11:10 UTC (History)
6 users (show)

Fixed In Version: ceph-16.2.10-203.el8cp
Doc Type: Bug Fix
Doc Text:
.handle the errors gracefully in MDLog::_recovery_thread. A write fails if the MDS is already blocklisted due to the 'fs fail' issued by the qa tests. For instance, thes test test_rebuild_moved_file (tasks/data-scan) fails due to this reason. This fix handles those write failures gracefully in MDLog::_recovery_thread, even when the MDS is stopping.
Clone Of:
Environment:
Last Closed: 2023-08-28 09:40:56 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 61201 0 None None None 2023-08-02 06:28:54 UTC
Red Hat Issue Tracker RHCEPH-7158 0 None None None 2023-08-03 18:18:09 UTC
Red Hat Product Errata RHBA-2023:4760 0 None None None 2023-08-28 09:41:40 UTC

Description Jos Collin 2023-08-02 06:28:55 UTC
Description of problem:
A write fails if the MDS is already blocklisted due to the 'fs fail' issued by the qa tests.
Handle those write failures gracefully, even when the MDS is stopping.

Version-Release number of selected component (if applicable):
5.3

How reproducible:
test_rebuild_moved_file (tasks/data-scan) fails because mds crashes:
https://tracker.ceph.com/issues/61201

Steps to Reproduce:
https://tracker.ceph.com/issues/61201

Actual results:
asserts when the write fails in MDLog::_recovery_thread.

Expected results:
Handle those write failures gracefully.

Comment 9 Venky Shankar 2023-08-10 06:17:16 UTC
Jos - please rebase https://gitlab.cee.redhat.com/ceph/ceph/-/merge_requests/319

Comment 10 Jos Collin 2023-08-12 00:21:51 UTC
(In reply to Venky Shankar from comment #9)
> Jos - please rebase
> https://gitlab.cee.redhat.com/ceph/ceph/-/merge_requests/319

rebased.

Comment 17 errata-xmlrpc 2023-08-28 09:40:56 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 5.3 Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:4760


Note You need to log in before you can comment on or make changes to this bug.