Description of problem: PrometheusOperatorListErrors fires despite list errors gone. We should reduce the range to be less than the for. Version-Release number of selected component (if applicable): 4.6+ How reproducible: Steps to Reproduce: 1. 2. 3. Actual results: Expected results: Additional info:
Tested with 4.6.0-0.nightly-2020-08-12-155346, range is 10m for PrometheusOperatorListErrors/PrometheusOperatorWatchErrors - alert: PrometheusOperatorListErrors annotations: message: Errors while performing List operations in controller {{$labels.controller}} in {{$labels.namespace}} namespace. expr: | (sum by (controller,namespace) (rate(prometheus_operator_list_operations_failed_total{job="prometheus-operator",namespace="openshift-monitoring"}[10m])) / sum by (controller,namespace) (rate(prometheus_operator_list_operations_total{job="prometheus-operator",namespace="openshift-monitoring"}[10m]))) > 0.4 for: 15m labels: severity: warning - alert: PrometheusOperatorWatchErrors annotations: message: Errors while performing Watch operations in controller {{$labels.controller}} in {{$labels.namespace}} namespace. expr: | (sum by (controller,namespace) (rate(prometheus_operator_watch_operations_failed_total{job="prometheus-operator",namespace="openshift-monitoring"}[10m])) / sum by (controller,namespace) (rate(prometheus_operator_watch_operations_total{job="prometheus-operator",namespace="openshift-monitoring"}[10m]))) > 0.4 for: 15m labels: severity: warning
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.6 GA Images), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:4196