Bug 2228039 - mds: do not evict clients if OSDs are laggy
Summary: mds: do not evict clients if OSDs are laggy
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: CephFS
Version: 5.2
Hardware: Unspecified
OS: Unspecified
unspecified
medium
Target Milestone: ---
: 5.3z6
Assignee: Dhairya Parmar
QA Contact: Hemanth Kumar
Ranjini M N
URL:
Whiteboard:
Depends On:
Blocks: 2260003 2228065 2228066 2258797
TreeView+ depends on / blocked
 
Reported: 2023-08-01 07:59 UTC by Dhairya Parmar
Modified: 2024-02-08 16:50 UTC (History)
7 users (show)

Fixed In Version: ceph-16.2.10-223.el8cp
Doc Type: Enhancement
Doc Text:
.Laggy clients are now evicted only if there are no laggy OSDs Previously, monitoring performance dumps from the MDS would sometimes show that the OSDs were laggy, `objecter.op_laggy` and `objecter.osd_laggy`, causing laggy clients (dirty data could not be flushed for cap revokes). With this enhancement, if the `defer_client_eviction_on_laggy_osds` option is set to true and a client gets laggy because of a laggy OSD then client eviction will not take place until OSDs are no longer laggy.
Clone Of:
: 2228065 2228066 2260003 (view as bug list)
Environment:
Last Closed: 2024-02-08 16:50:15 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 58023 0 None None None 2023-08-01 09:33:43 UTC
Red Hat Issue Tracker RHCEPH-7130 0 None None None 2023-08-01 08:13:58 UTC
Red Hat Product Errata RHSA-2024:0745 0 None None None 2024-02-08 16:50:32 UTC

Description Dhairya Parmar 2023-08-01 07:59:52 UTC
Description of problem: If OSD(s) is/are laggy (due to certain conditions like network cut-off, etc) then it might make clients laggy(session might get idle or cannot flush dirty data for cap revokes). Therefore have a config option that can be used to defer evicting/removing "laggy" client sessions until OSDs are no longer laggy. Log a warning to the cluster indicating the situation.


Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 1 RHEL Program Management 2023-08-01 08:00:02 UTC
Please specify the severity of this bug. Severity is defined here:
https://bugzilla.redhat.com/page.cgi?id=fields.html#bug_severity.

Comment 5 Scott Ostapovicz 2023-08-17 14:22:45 UTC
This is being retargeted to 5.3 z6 since it was not completed in time for the 5.3 z5 release.

Comment 14 errata-xmlrpc 2024-02-08 16:50:15 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 5.3 Security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:0745


Note You need to log in before you can comment on or make changes to this bug.