Bug 2006342

Summary: CephMdsMissingReplicas alert is not propagated into PagerDuty
Product: [Red Hat Storage] Red Hat OpenShift Container Storage Reporter: Filip Balák <fbalak>
Component: odf-managed-serviceAssignee: Dhruv Bindra <dbindra>
Status: CLOSED CURRENTRELEASE QA Contact: Elena Bondarenko <ebondare>
Severity: high Docs Contact:
Priority: unspecified    
Version: 4.8CC: aeyal, dbindra, ebondare, ocs-bugs, omitrani, sabose
Target Milestone: ---Keywords: TestBlocker
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-12-16 19:49:56 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Filip Balák 2021-09-21 14:02:27 UTC
Description of problem:
When one of rook-ceph-mds-ocs-storagecluster-cephfilesystem deployments doesn't have replica, the alert CephMdsMissingReplicas is not shown in PagerDuty.

Version-Release number of selected component (if applicable):
ocs-operator.v4.8.1
ocs-osd-deployer-qe.v1.1.0

How reproducible:
2/2

Steps to Reproduce:
1. Scale down deployment rook-ceph-mds-ocs-storagecluster-cephfilesystem-a to 0
2. Check alerts in PagerDuty

Actual results:
The alert is not propagated. In internal prometheus it is visible but not in PagerDuty.

Expected results:
The alert should be propagated to PagerDuty when using ODF Managed Service addon.

Additional info:
To access internal prometheus locally, the port must be forwarded:
 $ oc port-forward svc/prometheus-operated 9090 -n openshift-storage

Comment 1 Elena Bondarenko 2021-12-14 15:37:52 UTC
Filip ran all pagerduty tests with 
ocs-operator.v4.8.5
ocs-osd-deployer-qe.v1.1.2
ocp 4.9.9

Alerts are working correctly