Bug 2227309 - [RFE] mds: add balance_automate fs setting
Summary: [RFE] mds: add balance_automate fs setting
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: CephFS
Version: 5.3
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 7.1
Assignee: Patrick Donnelly
QA Contact: Hemanth Kumar
Akash Raj
URL:
Whiteboard:
Depends On:
Blocks: 2255435 2255436 2267614 2298578 2298579
TreeView+ depends on / blocked
 
Reported: 2023-07-28 16:42 UTC by Manny
Modified: 2024-08-28 10:45 UTC (History)
10 users (show)

Fixed In Version: ceph-18.2.1-26.el9cp
Doc Type: Enhancement
Doc Text:
.MDS dynamic metadata balancer is off by default. Previously, poor balancer behavior would fragment trees in undesirable ways by increasing the `max_mds` file system setting. With this enhancement, MDS dynamic metadata balancer is off, by default. Operators must turn on the balancer explicitly to use it.
Clone Of:
Environment:
Last Closed: 2024-06-13 14:20:43 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github ceph ceph pull 52196 0 None open mds: add balance_automate fs setting 2023-07-28 20:42:39 UTC
Red Hat Bugzilla 2203258 0 unspecified CLOSED MDS Behind on trimming (145961/128) max_segments: 128, num_segments: 145961 2023-07-28 17:03:08 UTC
Red Hat Issue Tracker RHCEPH-7104 0 None None None 2023-07-28 16:43:40 UTC
Red Hat Knowledge Base (Solution) 7026124 0 None None None 2023-07-28 17:02:34 UTC
Red Hat Product Errata RHSA-2024:3925 0 None None None 2024-06-13 14:21:02 UTC

Description Manny 2023-07-28 16:42:52 UTC
Description of problem:  RFE: change default value of "mds_bal_interval" to "0", aka false

From case 03492882 and BZ (https://bugzilla.redhat.com/show_bug.cgi?id=2203258) we see that having "mds_bal_interval" enabled results in performance issue. Beyond the log evidence in BZ 2203258, the customer reports that Ceph FS latency has dropped by 75 percent and I/O in the cluster has doubled since "mds_bal_interval" was set to false on their system.

We'd like to see the change in 5.3z-whatever and 6.1z-whatever and beyond

See also KCS #7026124, (https://access.redhat.com/solutions/7026124)

This same customer commented if this feature hampers performance this much and there is no intention on fixing it, the default value should be changed.


Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 12 errata-xmlrpc 2024-06-13 14:20:43 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Critical: Red Hat Ceph Storage 7.1 security, enhancements, and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:3925


Note You need to log in before you can comment on or make changes to this bug.