Bug 1714814

Summary: MDS may try trimming all of its journal at once after recovery
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Patrick Donnelly <pdonnell>
Component: CephFSAssignee: Yan, Zheng <zyan>
Status: CLOSED ERRATA QA Contact: subhash <vpoliset>
Severity: high Docs Contact:
Priority: high    
Version: 3.1CC: ceph-eng-bugs, ceph-qe-bugs, edonnell, sweil, tchandra, tserlin, zyan
Target Milestone: rc   
Target Release: 3.3   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: RHEL: ceph-12.2.12-22.el7cp Ubuntu: ceph_12.2.12-18redhat1 Doc Type: Bug Fix
Doc Text:
.The MDS no longer tries many log segments after restart Previously, the Ceph Metadata Server (MDS) would sometimes try many log segments after restart. The MDS would then send too many OSD requests in a short period of time which could harm the Ceph cluster. This update limits the number of log segments, and the cluster is no longer harmed.
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-08-21 15:11:09 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1726135    

Description Patrick Donnelly 2019-05-28 23:19:09 UTC
Description of problem:

"If mds was behind on trim before failover, the new mds may trim too many log segments at the same time, and cause unhealthy heartbeat."

Version-Release number of selected component (if applicable):

3.1

How reproducible:

Needs synthetic test case with a lot of journal segments.

Comment 8 Yan, Zheng 2019-08-01 03:29:38 UTC
LGTM

Comment 11 errata-xmlrpc 2019-08-21 15:11:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2019:2538