Bug 2004478

Summary: MGR related alerts are not working
Product: [Red Hat Storage] Red Hat OpenShift Container Storage Reporter: Filip Balák <fbalak>
Component: odf-managed-serviceAssignee: Dhruv Bindra <dbindra>
Status: CLOSED CURRENTRELEASE QA Contact: Filip Balák <fbalak>
Severity: urgent Docs Contact:
Priority: high    
Version: 4.8CC: aeyal, ebenahar, ocs-bugs, omitrani, sabose
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-12-16 19:49:18 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2006323    
Bug Blocks:    

Description Filip Balák 2021-09-15 12:21:16 UTC
Description of problem:
When Ceph MGR is missing, there is no alert sent to PagerDuty.

Version-Release number of selected component (if applicable):
ocs-operator.v4.8.1
ocs-osd-deployer-qe.v1.1.0

How reproducible:
2/2

Steps to Reproduce:
1. Scale deployment of rook-ceph-mgr-a to 0.
2. Check PagerDuty system

Actual results:
No alert is sent within 10 minutes

Expected results:
Alert is sent according to alerting rule (after 5 minutes)

Additional info:
Tested on ROSA cluster.

Comment 1 Filip Balák 2021-12-14 15:17:25 UTC
Alert is propagated correctly and when the deployment is up again, the alert is cleared correctly. --> VERIFIED

Tested with:
ocs-operator.v4.8.5
ocs-osd-deployer-qe.v1.1.2
ocp 4.9.9