Bug 2058517

Summary: SNO cluster: poddisruptionbudget ovn-raft-quorum-guard caused alert PodDisruptionBudgetAtLimit
Product: OpenShift Container Platform Reporter: hongyan li <hongyli>
Component: NetworkingAssignee: Ben Bennett <bbennett>
Networking sub component: ovn-kubernetes QA Contact: Anurag saxena <anusaxen>
Status: CLOSED DUPLICATE Docs Contact:
Severity: medium    
Priority: unspecified CC: ffernand, juzhao
Version: 4.9   
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-02-28 14:50:29 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description hongyan li 2022-02-25 08:19:11 UTC
Description of problem:
SNO cluster with networkType "OVNKubernetes": poddisruptionbudget ovn-raft-quorum-guard under openshift-ovn-kubernetes caused alert PodDisruptionBudgetAtLimit

% oc -n openshift-ovn-kubernetes get poddisruptionbudget
NAME                    MIN AVAILABLE   MAX UNAVAILABLE   ALLOWED DISRUPTIONS   AGE
ovn-raft-quorum-guard   1               N/A               0                     5h27m

 % oc -n openshift-monitoring exec -c prometheus prometheus-k8s-0 -- curl -k -H "Authorization: Bearer $token" 'https://alertmanager-main.openshift-monitoring.svc:9094/api/v1/alerts' | jq|grep -A20 PodDisruptionBudgetAtLimit                                                                                                                                                         
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  4258    0  4258    0     0   346k      0 --:--:-- --:--:-- --:--:--  346k
        "alertname": "PodDisruptionBudgetAtLimit",
        "namespace": "openshift-ovn-kubernetes",
        "poddisruptionbudget": "ovn-raft-quorum-guard",
        "prometheus": "openshift-monitoring/k8s",
        "severity": "warning"
      },
      "annotations": {
        "message": "The pod disruption budget is preventing further disruption to pods because it is at the minimum allowed level."
      },
      "startsAt": "2022-02-25T05:56:16.083Z",
      "endsAt": "2022-02-25T08:18:16.083Z",
      "generatorURL": "https://prometheus-k8s-openshift-monitoring.apps.qeci-34154.qe.devcluster.openshift.com/graph?g0.expr=max+by%28namespace%2C+poddisruptionbudget%29+%28kube_poddisruptionbudget_status_current_healthy+%3D%3D+kube_poddisruptionbudget_status_desired_healthy%29&g0.tab=1",
      "status": {
        "state": "active",
        "silencedBy": null,
        "inhibitedBy": null
      },
      "receivers": [
        "Default"
      ],
      "fingerprint": "d5593bc9daeaa692"


Version-Release number of selected component (if applicable):
4.9.17-assembly.art3819

How reproducible:
always

Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:
Found poddisruptionbudget ovn-raft-quorum-guard exists for profile
aos-4_9/upi-on-baremetal/versioned-installer-packet-sno_bip-disk_encryption-etcd_encryption
aos-4_9/ipi-on-aws/versioned-installer-customer_vpc-disconnected-sno-ci

Not exists for profile
aos-4_9/ipi-on-aws/versioned-installer

Suppose the poddisruptionbudget only exists when networkType is "OVNKubernetes"

Comment 2 ffernand 2022-02-28 14:50:29 UTC

*** This bug has been marked as a duplicate of bug 2058515 ***