Bug 2228066

Summary: mds: do not evict clients if OSDs are laggy
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Dhairya Parmar <dparmar>
Component: CephFSAssignee: Dhairya Parmar <dparmar>
Status: CLOSED ERRATA QA Contact: Hemanth Kumar <hyelloji>
Severity: medium Docs Contact: Rivka Pollack <rpollack>
Priority: unspecified    
Version: 5.2CC: akraj, amk, ceph-eng-bugs, cephqe-warriors, gfarnum, hyelloji, ngangadh, rpollack, sumr, tserlin, vshankar
Target Milestone: ---   
Target Release: 7.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-18.2.0-30.el9cp Doc Type: Enhancement
Doc Text:
.Laggy clients are now evicted only if there are no laggy OSDs Previously, monitoring performance dumps from the MDS would sometimes show that the OSDs were laggy, `objecter.op_laggy` and `objecter.osd_laggy`, causing laggy clients (cannot flush dirty data for cap revokes). With this enhancement, if `defer_client_eviction_on_laggy_osds` is set to true and a client gets laggy because of a laggy OSD then client eviction will not take place until OSDs are no longer laggy.
Story Points: ---
Clone Of: 2228039 Environment:
Last Closed: 2023-12-13 15:21:19 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2228039, 2260003    
Bug Blocks: 2228065, 2237662, 2247187    

Description Dhairya Parmar 2023-08-01 09:37:22 UTC
+++ This bug was initially created as a clone of Bug #2228039 +++

Description of problem: If OSD(s) is/are laggy (due to certain conditions like network cut-off, etc) then it might make clients laggy(session might get idle or cannot flush dirty data for cap revokes). Therefore have a config option that can be used to defer evicting/removing "laggy" client sessions until OSDs are no longer laggy. Log a warning to the cluster indicating the situation.


Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

--- Additional comment from RHEL Program Management on 2023-08-01 08:00:02 UTC ---

Please specify the severity of this bug. Severity is defined here:
https://bugzilla.redhat.com/page.cgi?id=fields.html#bug_severity.

Comment 23 errata-xmlrpc 2023-12-13 15:21:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 7.0 Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:7780