Description of problem: The MDS is not resetting the heartbeat while processing imported caps. The mons interpret this as the MDS being stuck and consequently removes it from the MDSMap. This may cause the MDSs to "flap" when there are large numbers of inodes to be loaded into cache. Version-Release number of selected component (if applicable): 3.0 How reproducible: Potentially difficult. It is necessary to have many clients with caps and millions of inodes in cache before testing failover.
Automation regression runs passed http://cistatus.ceph.redhat.com/ui/#cephci/launches/all%7Cpage.page=1&page.size=50&page.sort=start_time,number%2CDESC/5bc8bb4e36d1a000016d7470?page.page=1&page.size=50&page.sort=start_time%2CASC username: ceph, passwd: ceph Moving this bug to verified state.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:3530