Bug 2144532

Summary:	Alert 'ClusterObjectStoreState' is not triggered when RGW interface is unavailable
Product:	[Red Hat Storage] Red Hat OpenShift Data Foundation	Reporter:	Filip Balák <fbalak>
Component:	ceph-monitoring	Assignee:	Juan Miguel Olmo <jolmomar>
Status:	CLOSED CURRENTRELEASE	QA Contact:	Mahesh Shetty <mashetty>
Severity:	high	Docs Contact:
Priority:	unspecified
Version:	4.8	CC:	ebenahar, jolmomar, muagarwa, nthomas, ocs-bugs, odf-bz-bot, prasriva
Target Milestone:	---	Keywords:	Regression
Target Release:	ODF 4.12.0
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	No Doc Update
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2023-02-08 14:06:28 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description Filip Balák 2022-11-21 15:21:21 UTC

Description of problem (please be detailed as possible and provide log
snippests):
During the ocs-ci tier4a tests, the following test fails as the "ClusterObjectStoreState" alerts are not generated when the RGW interface is unavailable.

"tests/manage/monitoring/prometheus/test_rgw.py::test_rgw_unavailable "

Version of all relevant components (if applicable):
OCP 4.8
OCS 4.8.16-1

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?

The tier4a test execution results in failures

Is there any workaround available to the best of your knowledge?
NA

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
1

Can this issue reproducible?
?

Can this issue reproduce from the UI?
Yes

If this is a regression, please provide more details to justify this:
This was already fixed in https://bugzilla.redhat.com/show_bug.cgi?id=1948378

Steps to Reproduce:
1. Install ODF 4.8.16-1 
2. Perform downscaling of deployment rook-ceph-rgw-ocs-storagecluster-cephobjectstore-a 

oc -n openshift-storage scale --replicas=0 deployment/rook-ceph-rgw-ocs-storagecluster-cephobjectstore-a

3. Check for the alertname "ClusterObjectStoreState" that should be generated when the rgw interface is unavailable

Actual results:
No alert generated 

Expected results:
Alert should be generated when RGW interface is unavailable

Additional info:
Polarion link: https://reportportal-ocs4.apps.ocp-c1.prod.psi.redhat.com/ui/#ocs/launches/270/5882/239312/239332/239333/log?item0Params=filter.eq.hasStats%3Dtrue%26filter.eq.hasChildren%3Dfalse%26filter.in.issueType%3Dti001%252Cti_1h7tquhpjupuu%252Cti_u7ukrfvrt1yu%252Cti_qxkzvw4t6ipf%252Cti_1h7u8s8jf8tvb
Test logs: http://magna002.ceph.redhat.com/ocsci-jenkins/openshift-clusters/j-044vue1cslv33-t4c/j-044vue1cslv33-t4c_20221028T025045/logs/ocs-ci-logs-1666928754/by_outcome/failed/tests/manage/monitoring/prometheus/test_rgw.py/test_rgw_unavailable/logs

Comment 19 Red Hat Bugzilla 2023-12-08 04:31:27 UTC

The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days