Bug 1890808 - New etcd alerts need to be added to the monitoring stack
Summary: New etcd alerts need to be added to the monitoring stack
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Etcd
Version: 4.6
Hardware: Unspecified
OS: Linux
unspecified
low
Target Milestone: ---
: 4.7.0
Assignee: Sam Batschelet
QA Contact: ge liu
URL:
Whiteboard: aos-scalability-46
Depends On:
Blocks: 1960465
TreeView+ depends on / blocked
 
Reported: 2020-10-22 21:35 UTC by Naga Ravi Chaitanya Elluri
Modified: 2021-06-08 12:58 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: Enhancement
Doc Text:
Feature: improved etcd alerting - critical alert when the etcd database quota is 95% full. - warning alert when there is a sudden surge in etcd writes leading to increase in the etcd database quota size. - critical alert when 99th percentile of wal fsync duration is greater than 1 second. Reason: cluster admin should have accurate observability regarding operand health. Result: alerting will more accurately reflect actual observed health of etcd
Clone Of:
: 1960465 (view as bug list)
Environment:
Last Closed: 2021-02-24 15:27:53 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift cluster-monitoring-operator pull 963 0 None closed Bug 1890808: bump mixins to include new etcd alerts 2021-02-20 09:20:50 UTC
Red Hat Product Errata RHSA-2020:5633 0 None None None 2021-02-24 15:28:20 UTC

Description Naga Ravi Chaitanya Elluri 2020-10-22 21:35:28 UTC
Description of problem:
We have a couple of new alerts around etcd: https://github.com/etcd-io/etcd/pull/12249, https://github.com/etcd-io/etcd/pull/12266. Cluster monitoring operator need to be modified to pick them up.

Actual results:
New etcd alerts are missing.

Expected results:
New etcd alerts are present and active.

Comment 8 errata-xmlrpc 2021-02-24 15:27:53 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.7.0 security, bug fix, and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2020:5633


Note You need to log in before you can comment on or make changes to this bug.