Bug 2228339 - mds: MDLog::_recovery_thread: handle the errors gracefully
Summary: mds: MDLog::_recovery_thread: handle the errors gracefully
Keywords:
Status: VERIFIED
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: CephFS
Version: 5.3
Hardware: All
OS: All
unspecified
medium
Target Milestone: ---
: 5.3z5
Assignee: Jos Collin
QA Contact: Hemanth Kumar
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-08-02 06:28 UTC by Jos Collin
Modified: 2023-08-16 19:43 UTC (History)
4 users (show)

Fixed In Version: ceph-16.2.10-203.el8cp
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 61201 0 None None None 2023-08-02 06:28:54 UTC
Red Hat Issue Tracker RHCEPH-7158 0 None None None 2023-08-03 18:18:09 UTC

Description Jos Collin 2023-08-02 06:28:55 UTC
Description of problem:
A write fails if the MDS is already blocklisted due to the 'fs fail' issued by the qa tests.
Handle those write failures gracefully, even when the MDS is stopping.

Version-Release number of selected component (if applicable):
5.3

How reproducible:
test_rebuild_moved_file (tasks/data-scan) fails because mds crashes:
https://tracker.ceph.com/issues/61201

Steps to Reproduce:
https://tracker.ceph.com/issues/61201

Actual results:
asserts when the write fails in MDLog::_recovery_thread.

Expected results:
Handle those write failures gracefully.

Comment 9 Venky Shankar 2023-08-10 06:17:16 UTC
Jos - please rebase https://gitlab.cee.redhat.com/ceph/ceph/-/merge_requests/319

Comment 10 Jos Collin 2023-08-12 00:21:51 UTC
(In reply to Venky Shankar from comment #9)
> Jos - please rebase
> https://gitlab.cee.redhat.com/ceph/ceph/-/merge_requests/319

rebased.


Note You need to log in before you can comment on or make changes to this bug.