Bug 2147346

Summary: [RFE] New metric to provide rbd mirror image status and snapshot replication information
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Juan Miguel Olmo <jolmomar>
Component: Ceph-Mgr PluginsAssignee: Divyansh Kamboj <dkamboj>
Ceph-Mgr Plugins sub component: prometheus QA Contact: Sunil Angadi <sangadi>
Status: CLOSED ERRATA Docs Contact: ceph-docs <ceph-docs>
Severity: high    
Priority: high CC: kdreyer, pdiazbou, rzarzyns, tserlin
Version: 6.0Keywords: FutureFeature
Target Milestone: ---   
Target Release: 6.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-17.2.6-54.el9cp Doc Type: Enhancement
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-06-15 09:16:25 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2147345, 2147348    
Bug Blocks:    

Description Juan Miguel Olmo 2022-11-23 18:40:30 UTC
As a monitoring user i need a new metric with information about the replication status for each image and detailed information about sync schedule and status. 

The "productor" of this metric will  be the prometheus manager module

The source of the information is the RADOS information stored in the ceph cluster that can be obtained using the CLI commands:
rbd mirror image status POOL_NAME/IMAGE_NAME

The information about last snapshot synchronization must stored by the RBD mirror daemon

Example:

ceph_rbd_mirror_image_state

{  image: 
   pool: 
   local_site: 
   peer_site: 
   mode: (journal/snapshot) 
   snap_last_sync: timestamp of the last sync operation
   snap_last_sync_duration: time used to execute the last sync
   snap_last_bytes_transferred: number of bytes transfer on last sync operation
   snap_next_sync: timestamp of the next sync operation }
 = (0 = Unknown,1 = Error,2 = Syncing,3 = Starting Replay,4 = Replaying,5 = Stopping Replay,6 = Stopped)
Observations: 

"last_sync_duration", "last_sync_duration", "last_bytes_transferred" are information that must be updated by the RBD mirror daemon when it updates the "last_sync" field after each sync operation.

Comment 18 errata-xmlrpc 2023-06-15 09:16:25 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 6.1 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:3623