Bug 1627553

Summary: MDS leaks file descriptors across respawn
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Patrick Donnelly <pdonnell>
Component: CephFSAssignee: Patrick Donnelly <pdonnell>
Status: CLOSED ERRATA QA Contact: ceph-qe-bugs <ceph-qe-bugs>
Severity: low Docs Contact:
Priority: medium    
Version: 3.0CC: ceph-eng-bugs, hnallurv, john.spray, kdreyer
Target Milestone: z1   
Target Release: 3.1   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: RHEL: ceph-12.2.5-51.el7cp Ubuntu: ceph_12.2.5-36redhat1xenial Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-11-09 00:59:32 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Patrick Donnelly 2018-09-10 20:52:36 UTC
Description of problem:

MDS leaks file descriptors across exec which causes it to run out after several respawns.

Version-Release number of selected component (if applicable):

3.0

How reproducible:

100%. Respawn mds a few dozen times via `ceph mds fail 0`. Use single MDS cluster (no standby) to see more easily.

Steps to Reproduce:
1. while sleep 0.5; do ceph mds fail 0; done
2. MDS will eventually fail to create an event file descriptor as noted in the log and then quit.

Actual results:

Exits with failure.

Expected results:

MDS continues respawning infinitely.

Comment 10 errata-xmlrpc 2018-11-09 00:59:32 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:3530