Bug 1792225

Summary: prometheus cluster is not configured correctly when deploying multiple instances
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Giulio Fidente <gfidente>
Component: Ceph-AnsibleAssignee: Dimitri Savineau <dsavinea>
Status: CLOSED ERRATA QA Contact: Nathan Weinberg <nweinber>
Severity: high Docs Contact:
Priority: medium    
Version: 4.0CC: aschoen, ceph-eng-bugs, ceph-qe-bugs, dsavinea, epuertat, fpantano, gabrioux, gmeno, hyelloji, nthomas, nweinber, pasik, tserlin, ykaul
Target Milestone: rcFlags: hyelloji: needinfo-
Target Release: 4.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-ansible-4.0.15-1.el8, ceph-ansible-4.0.15-1.el7 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-05-19 17:32:06 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1760354, 1806495    

Description Giulio Fidente 2020-01-17 10:32:22 UTC
when deploying multiple prometheus instances, for ha purposes, we need to use the --cluster.* options [1] for them to visualize data correctly if a node is down

currently if a node goes down there will be gaps in the monitoring data on that node only

1. https://prometheus.io/docs/alerting/alertmanager/#high-availability

Comment 10 errata-xmlrpc 2020-05-19 17:32:06 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2020:2231