Bug 1809719
Summary: | During rolling update, OCM doesn't release its lease | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Clayton Coleman <ccoleman> |
Component: | openshift-controller-manager | Assignee: | Gabe Montero <gmontero> |
Status: | CLOSED ERRATA | QA Contact: | wewang <wewang> |
Severity: | medium | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.3.z | CC: | aos-bugs, gmontero, mfojtik |
Target Milestone: | --- | ||
Target Release: | 4.5.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | devex | ||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause: the openshift controller manager was not using the ReleaseOnCancel option on its kube leader election configuration.
Consequence: leader establishment during a rolling update could be delayed as it could take more time for the new leader to obtain a lease since to old leader did not proactively release it prior to its shutdown
Fix: ReleaseOnCancel is not set
Result: leader establishment during a rolling update of the openshift controller manager deployment should proceed more consistenly
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2020-07-13 17:17:45 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Clayton Coleman
2020-03-03 18:18:06 UTC
@gabe if it's ok to verfiy the bug with follow steps, it cost about 51 seconds to new pod running Steps: [wewang@wangwen work]$ oc get configmap openshift-master-controllers -oyaml -n openshift-controller-manager apiVersion: v1 kind: ConfigMap metadata: annotations: control-plane.alpha.kubernetes.io/leader: '{"holderIdentity":"controller-manager-srqzr","leaseDurationSeconds":60,"acquireTime":"2020-05-26T07:58:09Z","renewTime":"2020-05-26T08:03:09Z","leaderTransitions":3}' creationTimestamp: "2020-05-26T06:37:10Z" name: openshift-master-controllers namespace: openshift-controller-manager resourceVersion: "56364" selfLink: /api/v1/namespaces/openshift-controller-manager/configmaps/openshift-master-controllers uid: 34654c77-4b7d-4004-a61c-84bc584d0024 [wewang@wangwen work]$ date ; oc delete pod controller-manager-srqzr -n openshift-controller-manager ; date; oc get pods -n openshift-controller-manager Tue May 26 16:08:50 CST 2020 pod "controller-manager-srqzr" deleted Tue May 26 16:09:41 CST 2020 NAME READY STATUS RESTARTS AGE controller-manager-k5ckk 1/1 Running 0 91m controller-manager-lxsj8 1/1 Running 0 91m controller-manager-xgt6h 1/1 Running 0 7s Perfect @Wen ... looks good marking verified Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:2409 |