Bug 2283959

Summary: prometheus not scraping metrics from service monitors, hence no metrics are avaialble on HCP AWS/ROSA like cluster
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: suchita <sgatfane>
Component: ocs-operatorAssignee: Kaustav Majumder <kmajumde>
Status: CLOSED ERRATA QA Contact: Daniel Osypenko <dosypenk>
Severity: urgent Docs Contact:
Priority: high    
Version: 4.16CC: asriram, kmajumde, kramdoss, muagarwa, nberry, nigoyal, odf-bz-bot, sheggodu
Target Milestone: ---   
Target Release: ODF 4.16.3   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of:
: 2295944 (view as bug list) Environment:
Last Closed: 2024-10-15 08:52:41 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2295944    

Description suchita 2024-05-30 08:56:10 UTC
Description of problem (please be detailed as possible and provide log
snippests):
prometheus not scraping metrics from service monitors, hence no metrics are avaialble on HCP AWS/ROSA like cluster.

Version of all relevant components (if applicable):
$ oc get clusterversion
NAME      VERSION       AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.16.0-rc.2   True        False         41h     Cluster version is 4.16.0-rc.2
[jenkins@temp-jagent-sgatfane-hr12 auth]$ oc get csv
NAME                                         DISPLAY                            VERSION             REPLACES   PHASE
mcg-operator.v4.16.0-108.stable              NooBaa Operator                    4.16.0-108.stable              Succeeded
ocs-client-operator.v4.16.0-108.stable       OpenShift Data Foundation Client   4.16.0-108.stable              Succeeded
ocs-operator.v4.16.0-108.stable              OpenShift Container Storage        4.16.0-108.stable              Succeeded
odf-csi-addons-operator.v4.16.0-108.stable   CSI Addons                         4.16.0-108.stable              Succeeded
odf-operator.v4.16.0-108.stable              OpenShift Data Foundation          4.16.0-108.stable              Succeeded
odf-prometheus-operator.v4.16.0-108.stable   Prometheus Operator                4.16.0-108.stable              Succeeded
recipe.v4.16.0-108.stable                    Recipe                             4.16.0-108.stable              Succeeded
rook-ceph-operator.v4.16.0-108.stable        Rook-Ceph                          4.16.0-108.stable              Succeeded


Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
Partially

Is there any workaround available to the best of your knowledge?
No

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
2

Can this issue reproducible?
yes

Can this issue reproduce from the UI?
yes

If this is a regression, please provide more details to justify this:


Steps to Reproduce:
1. Deploy the AWS HCP cluster with Rosa Tagging as guided by doc 
https://docs.google.com/document/d/1YzmzNAP4sEsH3x4FVDcCUVupgR7Q3Tf6CN4JpDjxXZU/edit?usp=sharing

2. oc port-forward prometheus-odf-prometheus-0 9090 -n odf-storage
3. open a web with "localhost:9090"
4. check the matrics graphs or brows for ceph related matrics


Actual results:
no metrics are avaialble on HCP AWS/ROSA like cluster.

Expected results:
Metrics should be avaialble.

Additional info:

Comment 4 Kaustav Majumder 2024-05-30 09:03:45 UTC
Fix present in main but not backported to 4.16.
Attached backport pr

Comment 8 Sunil Kumar Acharya 2024-06-18 06:45:26 UTC
Please update the RDT flag/text appropriately.

Comment 27 errata-xmlrpc 2024-10-15 08:52:41 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.16.3 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:8113