Bug 2181949

Summary: [ODF Tracker] [RFE] Catch MDS damage to the dentry's first snapid
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Mudit Agarwal <muagarwa>
Component: cephAssignee: Venky Shankar <vshankar>
ceph sub component: CephFS QA Contact: Elad <ebenahar>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: unspecified CC: ableisch, bkunal, bniver, ceph-eng-bugs, cephqe-warriors, etamir, flucifre, gfarnum, hyelloji, kramdoss, muagarwa, nberry, ocs-bugs, odf-bz-bot, pdonnell, sheggodu, sostapov, vshankar
Version: 4.10Keywords: FutureFeature
Target Milestone: ---   
Target Release: ODF 4.13.0   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: 4.13.0-184 Doc Type: Enhancement
Doc Text:
Story Points: ---
Clone Of: 2175307 Environment:
Last Closed: 2023-06-21 15:25:01 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2175307    
Bug Blocks:    

Description Mudit Agarwal 2023-03-27 04:44:07 UTC
+++ This bug was initially created as a clone of Bug #2175307 +++

Description of problem:

This RFE is for a functionality in the MDS to detect specific damage to the metadata "dentries". The damage is associated with a long-standing bug (#38452).

This change will catch the damage before it's persisted. If **new** damage is detected to be written to persistent storage (i.e. RADOS), the MDS will abort to avoid persisting damage. This will hopefully have the benefit of providing logs in the same time period that the damage was created for analysis.

https://tracker.ceph.com/issues/38452
https://tracker.ceph.com/issues/58482

Documentation for support when customers encounter the abort will be forthcoming and available before 6.1 is released.

Comment 4 krishnaram Karthick 2023-04-07 06:03:44 UTC
This is a ceph RFE and the change will be in 4.13 as part of RHCS 6.1. 
From ODF, I don't see any feature is dependent on the ceph RFE, so we will verify the bug based on regression.

Comment 11 Elad 2023-06-06 08:22:47 UTC
Moving to VERIFIED based on regression testing using ODF 4.13.0 builds starting 4.13.0-184

Comment 14 errata-xmlrpc 2023-06-21 15:25:01 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenShift Data Foundation 4.13.0 enhancement and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:3742