Bug 2319372 - [4.18]: Deployemnt failed with "Failed to set node Topology Map for StorageCluster"
Summary: [4.18]: Deployemnt failed with "Failed to set node Topology Map for StorageC...
Keywords:
Status: ON_QA
Alias: None
Product: Red Hat OpenShift Data Foundation
Classification: Red Hat Storage
Component: ocs-operator
Version: 4.18
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: ---
Assignee: Oded
QA Contact: Vijay Avuthu
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2024-10-17 13:41 UTC by Vijay Avuthu
Modified: 2024-10-18 05:51 UTC (History)
4 users (show)

Fixed In Version: 4.18.0-35
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github red-hat-storage ocs-operator pull 2855 0 None Merged Add the verb patch to core group 2024-10-17 13:52:27 UTC
Red Hat Issue Tracker OCSBZM-9404 0 None None None 2024-10-17 13:43:00 UTC

Description Vijay Avuthu 2024-10-17 13:41:51 UTC
Description of problem (please be detailed as possible and provide log
snippests):

In vSphere platform, no ceph related pods are created 


Version of all relevant components (if applicable):
openshift installer (4.18.0-0.nightly-2024-10-16-094159)
ocs-registry:4.18.0-34


Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
Yes

Is there any workaround available to the best of your knowledge?
No

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
1

Can this issue reproducible?
1/1

Can this issue reproduce from the UI?
Not tried

If this is a regression, please provide more details to justify this:
Yes

Steps to Reproduce:
1. install ODF 4.18 using ocs-ci
2. check all pod are created
3.


Actual results:

$ oc get pods
NAME                                               READY   STATUS    RESTARTS      AGE
ceph-csi-controller-manager-b57b45fd5-ss74x        2/2     Running   0             83m
csi-addons-controller-manager-5c6998fcd6-gms8s     2/2     Running   0             83m
noobaa-operator-67f9cfc66f-vblmc                   1/1     Running   0             83m
ocs-operator-675d5b87f6-8bsbb                      1/1     Running   3 (83m ago)   84m
ocs-provider-server-6665fddf55-rdbgw               1/1     Running   0             80m
odf-console-6965b9774f-t7hlr                       1/1     Running   0             84m
odf-operator-controller-manager-6fd77bb6c4-9m9xr   2/2     Running   0             84m
rook-ceph-operator-6d6b755fc4-ftj8r                1/1     Running   0             83m
ux-backend-server-5c8999c8d8-dpsmc                 2/2     Running   0             84m



Expected results:

all ceph related pods should be created


Additional info:

ocs-operator log

{"level":"error","ts":"2024-10-17T13:28:43Z","logger":"controllers.StorageCluster","msg":"Failed to set node Topology Map for StorageCluster.","Request.Namespace":"openshift-storage","Request.Name":"ocs-storagecluster","StorageCluster":{"name":"ocs-storagecluster","namespace":"openshift-storage"},"error":"nodes \"compute-0\" is forbidden: User \"system:serviceaccount:openshift-storage:ocs-operator\" cannot patch resource \"nodes\" in API group \"\" at the cluster scope","stacktrace":"github.com/red-hat-storage/ocs-operator/v4/controllers/storagecluster.(*ocsTopologyMap).ensureCreated\n\t/remote-source/app/controllers/storagecluster/topology.go:29\ngithub.com/red-hat-storage/ocs-operator/v4/controllers/storagecluster.(*StorageClusterReconciler).reconcilePhases\n\t/remote-source/app/controllers/storagecluster/reconcile.go:455\ngithub.com/red-hat-storage/ocs-operator/v4/controllers/storagecluster.(*StorageClusterReconciler).Reconcile\n\t/remote-source/app/controllers/storagecluster/reconcile.go:190\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).Reconcile\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:116\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).reconcileHandler\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:303\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).processNextWorkItem\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:263\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).Start.func2.2\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:224"}
{"level":"error","ts":"2024-10-17T13:28:43Z","msg":"Reconciler error","controller":"storagecluster","controllerGroup":"ocs.openshift.io","controllerKind":"StorageCluster","StorageCluster":{"name":"ocs-storagecluster","namespace":"openshift-storage"},"namespace":"openshift-storage","name":"ocs-storagecluster","reconcileID":"d340653b-16a9-45b0-bf51-6251bcdb650a","error":"nodes \"compute-0\" is forbidden: User \"system:serviceaccount:openshift-storage:ocs-operator\" cannot patch resource \"nodes\" in API group \"\" at the cluster scope","stacktrace":"sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).reconcileHandler\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:316\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).processNextWorkItem\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:263\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).Start.func2.2\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:224"}


job: https://url.corp.redhat.com/ab707c7
must-gather: https://url.corp.redhat.com/e53cca8


Note You need to log in before you can comment on or make changes to this bug.