Bug 2239802
| Summary: | [External mode]: Failed to run rbd commands from rook ceph operator pod | ||
|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat OpenShift Data Foundation | Reporter: | Joy John Pinto <jopinto> |
| Component: | rook | Assignee: | Subham Rai <srai> |
| Status: | CLOSED ERRATA | QA Contact: | Joy John Pinto <jopinto> |
| Severity: | urgent | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 4.14 | CC: | ebenahar, kramdoss, mrajanna, muagarwa, odf-bz-bot, tnielsen |
| Target Milestone: | --- | ||
| Target Release: | ODF 4.14.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | 4.14.0-157 | Doc Type: | No Doc Update |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2023-11-08 18:54:58 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Joy John Pinto
2023-09-20 09:41:23 UTC
With ODF 4.14.0-156 client.healthchecker doesnot have sufficient permissions (https://bugzilla.redhat.com/show_bug.cgi?id=2246484) and network fences are not created post marking the node as unschedulable (oc taint nodes <node-name> node.kubernetes.io/out-of-service=nodeshutdown:NoExecute). Hence marking the bug as FailedQA (venv) [jopinto@jopinto new]$ oc get networkfences.csiaddons.openshift.io No resources found (venv) [jopinto@jopinto new]$ Verified in OCP 4.14.0-0.nightly-2023-10-31-145859 and odf-operator.v4.14.0-158 1. Created external mode cluster, and created an app pod on compute-1 2. Powered off compute-1 and tainted the node using command '[jopinto@jopinto new]$ oc adm taint nodes compute-1 node.kubernetes.io/out-of-service=nodeshutdown:NoExecute' >>>node/compute-1 tainted 3. Networkfence and cidr entry was created and pod was running on new node [jopinto@jopinto new]$ oc get networkfences.csiaddons.openshift.io NAME DRIVER CIDRS FENCESTATE AGE RESULT compute-1 openshift-storage.rbd.csi.ceph.com ["10.1.160.199/32"] Fenced 53s Succeeded sh-5.1# ceph osd blocklist ls .... 10.0.211.1:6801/577936614 2023-11-03T07:05:05.293134+0000 cidr:10.1.160.199:0/32 2028-11-02T11:51:57.039577+0000 listed 18 entries 4. Untainted the node using command 'oc adm taint nodes compute-1 node.kubernetes.io/out-of-service=nodeshutdown:NoExecute-' 5. Network fence and cidr entry was removed [jopinto@jopinto new]$ oc get networkfences.csiaddons.openshift.io No resources found sh-5.1# ceph osd blocklist ls ... listed 17 entries Its working as expected on external mode cluster.. Hence closing the bug Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.14.0 security, enhancement & bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2023:6832 |