Description of problem (please be detailed as possible and provide log snippests): We need to add 'managedBy' label to all the metrics, generated from rook-ceph-exporter. This label is needed for displaying results/alerts in openshift console UI. Version of all relevant components (if applicable): ODF 4.14 and above Does this issue impact your ability to continue to work with the product (please explain in detail what is the user impact)? Not a show stopper impact, but this will hinter the alert information provided to customer in multi storagecluster scenario and customer won't understand from which cluster the alert has come. Is there any workaround available to the best of your knowledge? No Rate from 1 - 5 the complexity of the scenario you performed that caused this bug (1 - very simple, 5 - very complex)? Can this issue reproducible? yes Can this issue reproduce from the UI? If this is a regression, please provide more details to justify this: No Steps to Reproduce: 1. 2. 3. Actual results: Expected results: Additional info:
Is this must for 4.15?
Yes Mudit, it is a blocker for 4.15
PR: https://github.com/red-hat-storage/ocs-operator/pull/2433, added
The fix is to add `managedBy` label to all the metrics produced/generated by `rook-ceph-exporter` pod. Confirm these TWO things, a. ServiceMonitor endpoint check `rook-ceph-exporter` ServiceMonitor should have the following 'spec' entry, spec -> endpoints -> relabeling -> {action: replace, replacement: ocs-storagecluster, targetLabel: managedBy} b. Check the metrics generated by `rook-ceph-exporter` In the ODF Console interface, under 'Observe' (left side) tab, select/click-on 'Metrics' label/link. One of the metrics generated by ceph-exporter is "ceph_mon_num_elections". Add this metric-name to the query text-space and execute the query. We should see 'managedBy' label in the query result, with the value of the current storagesystem name. This should verify the fix.
Adding the doc text, please take a look. Thanks
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.15.0 security, enhancement, & bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2024:1383