Description of problem: - Customer is running with 3 ceph-mgr daemons. - "ceph -s" was reporting no active mgr even though the services were up and running in 3 nodes. - Later customer restarted the ceph-mgr services in two nodes and that could fix the issue for two nodes."ceph -s" reported one active and one standby mgr in MGR group. - For the third mgr node the restart is yet to be done. - Probably the restart of the ceph-mgr will fix the issue for the third node also. Version-Release number of selected component (if applicable): - Red Hat Ceph Storage 4.3 - 4.3 ceph version 14.2.22-110.el8cp
@akraj Hi Akash, yes, it will be added to 7.0, we can mention it in the RN, not too important, but it will affect operators "Clusters can lose connection with MGR when there are some network issue and monclient failed to authenticate, in that situation, MGR could disconnect from the cluster without retries. The fix will add retry when hunting and connection are both failed"
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat Ceph Storage 7.0 Bug Fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2023:7780
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days