Description of problem: After upgrade to RHOCP 3.11.404 KubeAPILatencyHigh alerts are triggered. Below is the sample alert: Labels alertname = KubeAPILatencyHigh cluster = abc.example.com endpoint = https job = apiserver namespace = default prometheus = openshift-monitoring/k8s resource = controlplanes scope = namespace service = kubernetes severity = critical verb = DELETECOLLECTION Annotations message = The API server has an abnormal latency of 18812.224137931036 seconds for DELETECOLLECTION controlplanes. Checked etcd, api and controller logs and found it to be clean. Version-Release number of selected component (if applicable): 3.11.404 How reproducible: NA Steps to Reproduce: 1. 2. 3. Actual results: KubeAPILatencyHigh alerts are triggered. Expected results: RHOCP cluster should not trigger KubeAPILatencyHigh alerts. Additional info:
Did the alert clear out after some time?
The alert is not getting cleared. Regards Dhruv Gautam
tested with ose-cluster-monitoring-operator/images/v3.11.445,DELETECOLLECTION is excluded from the list of verbs taken into account by the KubeAPILatencyHigh alert
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: OpenShift Container Platform 3.11.452 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2150