Description of problem: When two or more active MDS repeatedly and concurrently fail over, it's possible for one MDS to become stuck in up:resolve state. Version-Release number of selected component (if applicable): 3.0 How reproducible: Unknown yet. Steps to Reproduce: Reproducer unknown yet.
Cherry-picked https://github.com/ceph/ceph/pull/23169 and ran on downstream, looks good http://pulpito.ceph.redhat.com/vasu-2018-07-27_19:33:34-fs-luminous-distro-basic-argo/
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:2375