seems like we are not looking at this right error... ``` 2023-04-20 10:17:06.055843 E | ceph-file-controller: failed to reconcile failed to detect running and desired ceph version: failed to detect ceph image version: failed to complete ceph version job: failed to run CmdReporter ceph-file-controller-detect-version successfully. failed to delete existing results ConfigMap ceph-file-controller-detect-version. failed to delete ConfigMap ceph-file-controller-detect-version. client rate limiter Wait returned an error: context canceled 2023-04-20 10:17:06.055874 E | ceph-file-controller: failed to reconcile CephFilesystem "openshift-storage/ocs-storagecluster-cephfilesystem". failed to detect running and desired ceph version: failed to detect ceph image version: failed to complete ceph version job: failed to run CmdReporter ceph-file-controller-detect-version successfully. failed to delete existing results ConfigMap ceph-file-controller-detect-version. failed to delete ConfigMap ceph-file-controller-detect-version. client rate limiter Wait returned an error: context canceled 2023-04-20 10:17:06.055903 E | ceph-csi: failed to reconcile failed to get csi ceph.conf configmap: failed to get csi ceph.conf configmap "csi-ceph-conf-override" (in "openshift-storage"): client rate limiter Wait returned an error: context canceled 2023-04-20 10:17:06.055921 E | ceph-nodedaemon-controller: node reconcile failed: failed to create ceph-exporter metrics service: failed to update service rook-ceph-exporter. client rate limiter Wait returned an error: context canceled ``` @athakkar is looking at this.
*** Bug 2187951 has been marked as a duplicate of this bug. ***
*** Bug 2190413 has been marked as a duplicate of this bug. ***
upstream tracker https://github.com/rook/rook/issues/12331
Let's track this with the upstream issue as there is no longer a downstream repro for this issue since OCS operator fixed the frequent cephcluster CR update issue.