Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use https://ibm-ceph.atlassian.net/ for all bug tracking management.

Bug 2298621

Summary: [RFE] multisite sync observability: tracking sync deltas over time(in Grafana)
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Ankush Behl <anbehl>
Component: Ceph-DashboardAssignee: Ankush Behl <anbehl>
Status: CLOSED ERRATA QA Contact: Chaithra <ckulal>
Severity: medium Docs Contact: Akash Raj <akraj>
Priority: unspecified    
Version: 8.0CC: aasharma, akraj, ceph-eng-bugs, cephqe-warriors, ckulal, epuertat, rpollack
Target Milestone: ---Keywords: FutureFeature
Target Release: 8.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-19.1.0-3 Doc Type: Enhancement
Doc Text:
.New RGW Sync overview dashboard in Grafana With this release, you can now track replication differences over a time per shard from within the new RGW Sync overview dashboard in Grafana.
Story Points: ---
Clone Of: Environment:
Last Closed: 2024-11-25 09:03:09 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2317218    

Description Ankush Behl 2024-07-18 08:10:19 UTC
This bug was initially created as a copy of Bug #2247183

I am copying this bug because: 



Description of problem:

Currently, there is no easy way for an administrator to check the sync replication status between zones.  

Goal:

multisite sync observability: tracking sync deltas over time

Our proposed feature will increase the observability of the RGW multisite sync operations. It will provide administrators with real-time information about the replication health between zones. This will enable the admin to assess if the pending sync replication work is converging as expected or diverging. If it diverges and increases beyond a certain threshold, an alert can be configured in the alert manager to fire a warning.

To present this information to the user, we will use Prometheus to gather data and create a Grafana dashboard with data points representing the oldest incremental change not applied from the sync status command to populate the graph over time. 

The Grafana dashboard will display a slope to help us assess if the pending sync deltas are reducing or increasing over time. The ‘deltas’ will be sent from all the zones replicated in the zone group to Prometheus via the node-exporter.

Ideally, further down the line, we will be able to do similar work to have per-bucket granularity sync information in Prometheus so we can adhere to the bucket sync policy granularity that provides the user with a way to enable/disable bucket sync through the S3 API.

Comment 1 Storage PM bot 2024-07-18 08:10:27 UTC
Please specify the severity of this bug. Severity is defined here:
https://bugzilla.redhat.com/page.cgi?id=fields.html#bug_severity.

Comment 12 errata-xmlrpc 2024-11-25 09:03:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 8.0 security, bug fix, and enhancement updates), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:10216

Comment 13 Red Hat Bugzilla 2025-03-26 04:25:43 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days