Bug 2283959 - prometheus not scraping metrics from service monitors, hence no metrics are avaialble on HCP AWS/ROSA like cluster
Summary: prometheus not scraping metrics from service monitors, hence no metrics are a...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenShift Data Foundation
Classification: Red Hat Storage
Component: ocs-operator
Version: 4.16
Hardware: Unspecified
OS: Unspecified
high
urgent
Target Milestone: ---
: ODF 4.16.3
Assignee: Kaustav Majumder
QA Contact: Daniel Osypenko
URL:
Whiteboard:
Depends On:
Blocks: 2295944
TreeView+ depends on / blocked
 
Reported: 2024-05-30 08:56 UTC by suchita
Modified: 2024-10-15 08:52 UTC (History)
8 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
Clone Of:
: 2295944 (view as bug list)
Environment:
Last Closed: 2024-10-15 08:52:41 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github red-hat-storage ocs-operator pull 2637 0 None open [release-4.16] Adding empty label selector for prom service monitor selector 2024-05-30 09:02:49 UTC
Red Hat Product Errata RHSA-2024:8113 0 None None None 2024-10-15 08:52:50 UTC

Description suchita 2024-05-30 08:56:10 UTC
Description of problem (please be detailed as possible and provide log
snippests):
prometheus not scraping metrics from service monitors, hence no metrics are avaialble on HCP AWS/ROSA like cluster.

Version of all relevant components (if applicable):
$ oc get clusterversion
NAME      VERSION       AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.16.0-rc.2   True        False         41h     Cluster version is 4.16.0-rc.2
[jenkins@temp-jagent-sgatfane-hr12 auth]$ oc get csv
NAME                                         DISPLAY                            VERSION             REPLACES   PHASE
mcg-operator.v4.16.0-108.stable              NooBaa Operator                    4.16.0-108.stable              Succeeded
ocs-client-operator.v4.16.0-108.stable       OpenShift Data Foundation Client   4.16.0-108.stable              Succeeded
ocs-operator.v4.16.0-108.stable              OpenShift Container Storage        4.16.0-108.stable              Succeeded
odf-csi-addons-operator.v4.16.0-108.stable   CSI Addons                         4.16.0-108.stable              Succeeded
odf-operator.v4.16.0-108.stable              OpenShift Data Foundation          4.16.0-108.stable              Succeeded
odf-prometheus-operator.v4.16.0-108.stable   Prometheus Operator                4.16.0-108.stable              Succeeded
recipe.v4.16.0-108.stable                    Recipe                             4.16.0-108.stable              Succeeded
rook-ceph-operator.v4.16.0-108.stable        Rook-Ceph                          4.16.0-108.stable              Succeeded


Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
Partially

Is there any workaround available to the best of your knowledge?
No

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
2

Can this issue reproducible?
yes

Can this issue reproduce from the UI?
yes

If this is a regression, please provide more details to justify this:


Steps to Reproduce:
1. Deploy the AWS HCP cluster with Rosa Tagging as guided by doc 
https://docs.google.com/document/d/1YzmzNAP4sEsH3x4FVDcCUVupgR7Q3Tf6CN4JpDjxXZU/edit?usp=sharing

2. oc port-forward prometheus-odf-prometheus-0 9090 -n odf-storage
3. open a web with "localhost:9090"
4. check the matrics graphs or brows for ceph related matrics


Actual results:
no metrics are avaialble on HCP AWS/ROSA like cluster.

Expected results:
Metrics should be avaialble.

Additional info:

Comment 4 Kaustav Majumder 2024-05-30 09:03:45 UTC
Fix present in main but not backported to 4.16.
Attached backport pr

Comment 8 Sunil Kumar Acharya 2024-06-18 06:45:26 UTC
Please update the RDT flag/text appropriately.

Comment 27 errata-xmlrpc 2024-10-15 08:52:41 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.16.3 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:8113


Note You need to log in before you can comment on or make changes to this bug.