Bug 1792225

Summary:	prometheus cluster is not configured correctly when deploying multiple instances
Product:	[Red Hat Storage] Red Hat Ceph Storage	Reporter:	Giulio Fidente <gfidente>
Component:	Ceph-Ansible	Assignee:	Dimitri Savineau <dsavinea>
Status:	CLOSED ERRATA	QA Contact:	Nathan Weinberg <nweinber>
Severity:	high	Docs Contact:
Priority:	medium
Version:	4.0	CC:	aschoen, ceph-eng-bugs, ceph-qe-bugs, dsavinea, epuertat, fpantano, gabrioux, gmeno, hyelloji, nthomas, nweinber, pasik, tserlin, ykaul
Target Milestone:	rc	Flags:	hyelloji: needinfo-
Target Release:	4.1
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:	ceph-ansible-4.0.15-1.el8, ceph-ansible-4.0.15-1.el7	Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2020-05-19 17:32:06 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:
Bug Depends On:
Bug Blocks:	1760354, 1806495

Description Giulio Fidente 2020-01-17 10:32:22 UTC

when deploying multiple prometheus instances, for ha purposes, we need to use the --cluster.* options [1] for them to visualize data correctly if a node is down

currently if a node goes down there will be gaps in the monitoring data on that node only

1. https://prometheus.io/docs/alerting/alertmanager/#high-availability

Comment 10 errata-xmlrpc 2020-05-19 17:32:06 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2020:2231