Bug 1878777 - [RFE][GSS]HAproxy/keepalived setup for Grafana & Prometheus on Ceph cluster
Summary: [RFE][GSS]HAproxy/keepalived setup for Grafana & Prometheus on Ceph cluster
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Cephadm
Version: 4.0
Hardware: x86_64
OS: Linux
unspecified
medium
Target Milestone: ---
: 8.0
Assignee: Redouane Kachach Elhichou
QA Contact: Vinayak Papnoi
URL:
Whiteboard:
Depends On:
Blocks: 2317218
TreeView+ depends on / blocked
 
Reported: 2020-09-14 13:49 UTC by Lijo Stephen Thomas
Modified: 2024-11-25 08:58 UTC (History)
14 users (show)

Fixed In Version: ceph-19.1.1-94.el9cp
Doc Type: Enhancement
Doc Text:
.High Availability can now be deployed for the Grafana, Prometheus, and Alertmanager monitoring stacks With this enhancement, the cephadm `mgmt-gateway` service offers better reliability and ensures uninterrupted monitoring by allowing these critical services to function seamlessly, even during the event of an individual instance failure. High availability is crucial for maintaining visibility into the health and performance of the Ceph cluster and responding promptly to any issues. Use High Availability for continuous, uninterrupted operations to improve the stability and resilience of the Ceph cluster. For more information, see data-security/dash_using_the_ceph_management_gateway.
Clone Of:
Environment:
Last Closed: 2024-11-25 08:58:42 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHCEPH-4328 0 None None None 2022-05-18 11:22:20 UTC
Red Hat Product Errata RHBA-2024:10216 0 None None None 2024-11-25 08:58:49 UTC

Description Lijo Stephen Thomas 2020-09-14 13:49:26 UTC
Description of problem:

Customer would like to have HA setup for Grafana & Prometheus on Ceph cluster. 
Because if a single node hosting prometheus + alertmanager is down or pods are not running, we will lose metrics & alerting , which is not very critical but it's better to have metrics and alerts all the time.


Version-Release number of selected component (if applicable):
RHCS 4.x


Additional info:

We already have HA-proxy setup for dashboard in upstream...[1] Can we have something similar for prometheus and grafana.


Let me know if any additional details is required around the same.

[1] https://docs.ceph.com/docs/master/mgr/dashboard/

Comment 18 errata-xmlrpc 2024-11-25 08:58:42 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 8.0 security, bug fix, and enhancement updates), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:10216


Note You need to log in before you can comment on or make changes to this bug.