Bug 1866020

Summary: Operator alerts for error count and should alert for error rate
Product: OpenShift Container Platform Reporter: Brett Jones <brejones>
Component: LoggingAssignee: Brett Jones <brejones>
Status: CLOSED ERRATA QA Contact: Anping Li <anli>
Severity: low Docs Contact:
Priority: unspecified    
Version: 4.6CC: aos-bugs, bdonahue
Target Milestone: ---   
Target Release: 4.6.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-10-27 15:09:34 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Brett Jones 2020-08-04 16:44:05 UTC
Description of problem:

The current alert for CLO error rate logs at an arbitrary limit of 10rps. It should be based on a percentage because message counts can increase and decrease over time where 10 errors might be a very small amount of logs. 

Version-Release number of selected component (if applicable):

4.6

How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 5 Anping Li 2020-08-18 08:13:27 UTC
Verified on clusterlogging.4.6.0-202008130129.p0 by making the kafka failed for 'error_class=Kafka::MessageSizeTooLarge'

Comment 7 errata-xmlrpc 2020-10-27 15:09:34 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.6.1 extras update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:4198