Bug 2207713

Summary: Create RBD mirror monitoring related alerts in Ceph mixins
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Juan Miguel Olmo <jolmomar>
Component: RBD-MirrorAssignee: arun kumar mohan <amohan>
Status: CLOSED ERRATA QA Contact: Sunil Angadi <sangadi>
Severity: high Docs Contact:
Priority: high    
Version: 6.1CC: amohan, ceph-eng-bugs, cephqe-warriors, idryomov, jdurgin, sangadi, saraut, sostapov, tserlin
Target Milestone: ---   
Target Release: 7.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-18.2.1-145.el9cp Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2024-06-13 14:20:26 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Juan Miguel Olmo 2023-05-16 15:39:31 UTC

Comment 1 RHEL Program Management 2023-05-16 15:39:39 UTC
Please specify the severity of this bug. Severity is defined here:
https://bugzilla.redhat.com/page.cgi?id=fields.html#bug_severity.

Comment 2 Juan Miguel Olmo 2023-05-16 15:42:23 UTC
As monitoring user, we will need a new set of alert to reflect any problem in the rbd mirroring process. The alert should raise any time than an image is not being replicated properly, and inform about the problem and possible solution.

The list of alerts and description is:
CephRBDMirrorImagesPerDaemonHigh:
Number of image replications are now above 100

CephRBDMirrorImagesNotInSync
Some of the RBD mirror images are not in sync with the remote counter parts.

CephRBDMirrorImagesNotInSyncVeryHigh
Number of unsynchronized images are very high

CephRBDMirrorImageTransferBandwidthHigh:
The replication network usage has been increased over 80% in the last 30 minutes.

Comment 9 Ilya Dryomov 2024-03-22 11:06:05 UTC
*** Bug 2270945 has been marked as a duplicate of this bug. ***

Comment 11 arun kumar mohan 2024-03-28 09:28:18 UTC
Backport PR: https://github.com/ceph/ceph/pull/56552 (for reef branch) has submitted.
PS: please let me know if this has to be further backported to older ceph releases

Comment 18 errata-xmlrpc 2024-06-13 14:20:26 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Critical: Red Hat Ceph Storage 7.1 security, enhancements, and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:3925