Description of problem: The current alert for CLO error rate logs at an arbitrary limit of 10rps. It should be based on a percentage because message counts can increase and decrease over time where 10 errors might be a very small amount of logs. Version-Release number of selected component (if applicable): 4.6 How reproducible: Steps to Reproduce: 1. 2. 3. Actual results: Expected results: Additional info:
Verified on clusterlogging.4.6.0-202008130129.p0 by making the kafka failed for 'error_class=Kafka::MessageSizeTooLarge'
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.6.1 extras update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:4198