Bug 2227781

Summary: ceph_rbd_* metrics are missing
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Filip Balák <fbalak>
Component: ceph-monitoringAssignee: avan <athakkar>
Status: CLOSED CURRENTRELEASE QA Contact: Filip Balák <fbalak>
Severity: high Docs Contact:
Priority: unspecified    
Version: 4.13CC: jolmomar, kramdoss, ocs-bugs, odf-bz-bot
Target Milestone: ---Keywords: Regression
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-07-31 14:44:12 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Filip Balák 2023-07-31 12:44:10 UTC
Description of problem (please be detailed as possible and provide log
snippests):
There are missing rbd metrics:

ceph_rbd_write_ops
ceph_rbd_read_ops
ceph_rbd_write_bytes
ceph_rbd_read_bytes
ceph_rbd_write_latency_sum
ceph_rbd_write_latency_count


Version of all relevant components (if applicable):
ODF 4.13.1-9
OCP 4.13


Steps to Reproduce:
1. Install OCP/ODF cluster
2. After installation, check whether Prometheus provides values for ceph_rbd_* metrics listed above.

Actual results:
OCP Prometheus provides no values for any of the ceph_rbd_* metrics listed above.

Expected results:
OCP Prometheus provides values for all ceph_rbd_* metrics listed above.

Additional info:
This was discovered as part of regression runs:
https://reportportal-ocs4.apps.ocp-c1.prod.psi.redhat.com/ui/#ocs/launches/465/12972/594265/594266/594269/log?item0Params=filter.eq.hasStats%3Dtrue%26filter.eq.hasChildren%3Dfalse%26filter.in.issueType%3Dti001%252Cti_1h7tquhpjupuu%252Cti_u7ukrfvrt1yu%252Cti_qxkzvw4t6ipf%252Cti_1h7u8s8jf8tvb
https://reportportal-ocs4.apps.ocp-c1.prod.psi.redhat.com/ui/#ocs/launches/465/12961/593770/593771/593774/log?item0Params=filter.eq.hasStats%3Dtrue%26filter.eq.hasChildren%3Dfalse%26filter.in.issueType%3Dti001%252Cti_1h7tquhpjupuu%252Cti_u7ukrfvrt1yu%252Cti_qxkzvw4t6ipf%252Cti_1h7u8s8jf8tvb
https://reportportal-ocs4.apps.ocp-c1.prod.psi.redhat.com/ui/#ocs/launches/465/12961/593770/593771/593774/log?item0Params=filter.eq.hasStats%3Dtrue%26filter.eq.hasChildren%3Dfalse%26filter.in.issueType%3Dti001%252Cti_1h7tquhpjupuu%252Cti_u7ukrfvrt1yu%252Cti_qxkzvw4t6ipf%252Cti_1h7u8s8jf8tvb

https://bugzilla.redhat.com/show_bug.cgi?id=1779336 was closed as won't fix but this test used to pass with previous version where those metrics were present. (e.g. https://reportportal-ocs4.apps.ocp-c1.prod.psi.redhat.com/ui/#ocs/launches/465/13057/597442/597443/597446/log)

Comment 3 Filip Balák 2023-07-31 14:44:12 UTC
Findings in this bug are not relevant because this failed only in external mode where a different version of ceph was used (16.2.10-172). -> CLOSED