Bug 2319372

Summary: [4.18]: Deployemnt failed with "Failed to set node Topology Map for StorageCluster"
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Vijay Avuthu <vavuthu>
Component: ocs-operatorAssignee: Oded <oviner>
Status: ON_QA --- QA Contact: Vijay Avuthu <vavuthu>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 4.18CC: mparida, muagarwa, nigoyal, odf-bz-bot
Target Milestone: ---Keywords: Automation, Regression
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: 4.18.0-35 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Vijay Avuthu 2024-10-17 13:41:51 UTC
Description of problem (please be detailed as possible and provide log
snippests):

In vSphere platform, no ceph related pods are created 


Version of all relevant components (if applicable):
openshift installer (4.18.0-0.nightly-2024-10-16-094159)
ocs-registry:4.18.0-34


Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
Yes

Is there any workaround available to the best of your knowledge?
No

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
1

Can this issue reproducible?
1/1

Can this issue reproduce from the UI?
Not tried

If this is a regression, please provide more details to justify this:
Yes

Steps to Reproduce:
1. install ODF 4.18 using ocs-ci
2. check all pod are created
3.


Actual results:

$ oc get pods
NAME                                               READY   STATUS    RESTARTS      AGE
ceph-csi-controller-manager-b57b45fd5-ss74x        2/2     Running   0             83m
csi-addons-controller-manager-5c6998fcd6-gms8s     2/2     Running   0             83m
noobaa-operator-67f9cfc66f-vblmc                   1/1     Running   0             83m
ocs-operator-675d5b87f6-8bsbb                      1/1     Running   3 (83m ago)   84m
ocs-provider-server-6665fddf55-rdbgw               1/1     Running   0             80m
odf-console-6965b9774f-t7hlr                       1/1     Running   0             84m
odf-operator-controller-manager-6fd77bb6c4-9m9xr   2/2     Running   0             84m
rook-ceph-operator-6d6b755fc4-ftj8r                1/1     Running   0             83m
ux-backend-server-5c8999c8d8-dpsmc                 2/2     Running   0             84m



Expected results:

all ceph related pods should be created


Additional info:

ocs-operator log

{"level":"error","ts":"2024-10-17T13:28:43Z","logger":"controllers.StorageCluster","msg":"Failed to set node Topology Map for StorageCluster.","Request.Namespace":"openshift-storage","Request.Name":"ocs-storagecluster","StorageCluster":{"name":"ocs-storagecluster","namespace":"openshift-storage"},"error":"nodes \"compute-0\" is forbidden: User \"system:serviceaccount:openshift-storage:ocs-operator\" cannot patch resource \"nodes\" in API group \"\" at the cluster scope","stacktrace":"github.com/red-hat-storage/ocs-operator/v4/controllers/storagecluster.(*ocsTopologyMap).ensureCreated\n\t/remote-source/app/controllers/storagecluster/topology.go:29\ngithub.com/red-hat-storage/ocs-operator/v4/controllers/storagecluster.(*StorageClusterReconciler).reconcilePhases\n\t/remote-source/app/controllers/storagecluster/reconcile.go:455\ngithub.com/red-hat-storage/ocs-operator/v4/controllers/storagecluster.(*StorageClusterReconciler).Reconcile\n\t/remote-source/app/controllers/storagecluster/reconcile.go:190\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).Reconcile\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:116\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).reconcileHandler\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:303\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).processNextWorkItem\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:263\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).Start.func2.2\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:224"}
{"level":"error","ts":"2024-10-17T13:28:43Z","msg":"Reconciler error","controller":"storagecluster","controllerGroup":"ocs.openshift.io","controllerKind":"StorageCluster","StorageCluster":{"name":"ocs-storagecluster","namespace":"openshift-storage"},"namespace":"openshift-storage","name":"ocs-storagecluster","reconcileID":"d340653b-16a9-45b0-bf51-6251bcdb650a","error":"nodes \"compute-0\" is forbidden: User \"system:serviceaccount:openshift-storage:ocs-operator\" cannot patch resource \"nodes\" in API group \"\" at the cluster scope","stacktrace":"sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).reconcileHandler\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:316\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).processNextWorkItem\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:263\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller[...]).Start.func2.2\n\t/remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:224"}


job: https://url.corp.redhat.com/ab707c7
must-gather: https://url.corp.redhat.com/e53cca8