Bug 2244873
Summary: | [RDR]Ceph reports "no active mgr" after workload deployment | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat OpenShift Data Foundation | Reporter: | kmanohar |
Component: | ceph | Assignee: | Nitzan mordechai <nmordech> |
ceph sub component: | RADOS | QA Contact: | Elad <ebenahar> |
Status: | CLOSED WORKSFORME | Docs Contact: | |
Severity: | high | ||
Priority: | unspecified | CC: | amagrawa, bhubbard, bniver, edonnell, kramdoss, muagarwa, nojha, pdhange, prsurve, sagrawal, sheggodu, sostapov, srangana, tnielsen |
Version: | 4.14 | ||
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Known Issue | |
Doc Text: |
.Ceph reports "no active mgr" after workload deployment
After workload deployment, Ceph manager loses connectivity to MONs or is unable to respond to its liveness probe.
This causes the ODF cluster status to report that there is "no active mgr". This causes multiple operations that use the Ceph manager for request processing to fail. For example, volume provisioning, creating CephFS snapshots, and others.
To check the status of the ODF cluster, use the command `oc get cephcluster -n openshift-storage`. In the status output, the `status.ceph.details.MGR_DOWN` field will have the message "no active mgr" if your cluster has this issue.
To workaround this issue, restart the Ceph manager pods using the following commands:
+
----
# oc scale deployment -n openshift-storage rook-ceph-mgr-a --replicas=0
----
+
----
# oc scale deployment -n openshift-storage rook-ceph-mgr-a --replicas=1
----
After running these commands, the ODF cluster status reports a healthy cluster, with no warnings or errors regarding `MGR_DOWN`.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2024-06-24 04:58:25 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 2244409 |
Description
kmanohar
2023-10-18 16:58:34 UTC
Shyam, can you please help with documenting the workaround here? We have seen this issue in 4.14 testing. With 4.15, we have 2 MGRs, so we might need to see the behaviour with 4.15 testing. (In reply to krishnaram Karthick from comment #15) > We have seen this issue in 4.14 testing. > With 4.15, we have 2 MGRs, so we might need to see the behaviour with 4.15 > testing. Hi Karthick, I don't understand the reason behind making it blocker, in 4.15 we will have 2 mgr by default so according to https://bugzilla.redhat.com/show_bug.cgi?id=2255616#c3 we will have extra cushion in case this issue is hit. Please correct me if I my understanding is wrong. *** Bug 2255616 has been marked as a duplicate of this bug. *** Could this be related to https://bugzilla.redhat.com/show_bug.cgi?id=2171847? The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days |