Description of problem: standby-replay mds is removed from MDSMap unexpectedly. Change https://github.com/ceph/ceph/commit/20509bb6c82e872127ab838d45402be0d0b91b5f evicts MDSs when a garbage beacon or an invalid state transition is seen by the monitor. To reproduce this, the standby-replay daemon needs to be laggy and then when it resumes back to normal operation, the monitor would remove the standby-replay MDS.
Please specify the severity of this bug. Severity is defined here: https://bugzilla.redhat.com/page.cgi?id=fields.html#bug_severity.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: Red Hat Ceph Storage 5.3 security update and Bug Fix), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2023:0076