Bug 1940882 - Incorrect number of alerts in prometheus and in console
Summary: Incorrect number of alerts in prometheus and in console
Keywords:
Status: CLOSED DUPLICATE of bug 1926598
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Monitoring
Version: 4.8
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: 4.8.0
Assignee: Sergiusz Urbaniak
QA Contact: Junqi Zhao
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-03-19 13:23 UTC by Pawel Krupa
Modified: 2021-03-23 09:22 UTC (History)
9 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-03-23 09:22:12 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Pawel Krupa 2021-03-19 13:23:56 UTC
Description of problem:

The number of alerting rules shown in Console is higher than in prometheus. Some rules are duplicated.
The number of rules in Console is the same as number returned from thanos-querier.

Version-Release number of selected component (if applicable): 4.8


How reproducible: Always


Steps to Reproduce:
1. Launch cluster
2. Port-forward to in-cluster prometheus
3. curl -s http://localhost:9090/api/v1/rules | jq '[.data.groups[].rules[] | select(.type=="alerting")] | length'
4. Port-forward to thanos-querier
5. curl -s http://localhost:9090/api/v1/rules | jq '[.data.groups[].rules[] | select(.type=="alerting")] | length'
6. Compare numbers from #3 and #5

Actual results: Numbers don't match


Expected results: Numbers should match


Additional info: 
The primary suspect is incorrect alert deduplication in thanos querier.

Initial slack thread: https://coreos.slack.com/archives/G79AW9Q7R/p1616158687116100

Comment 4 hongyan li 2021-03-23 09:13:10 UTC
Duplicate as bug Bug 1926598

Comment 5 hongyan li 2021-03-23 09:22:12 UTC

*** This bug has been marked as a duplicate of bug 1926598 ***


Note You need to log in before you can comment on or make changes to this bug.