Bug 1843877
Summary: | daemonset, deployment, and replicaset status can permafail | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Maciej Szulik <maszulik> |
Component: | kube-controller-manager | Assignee: | Maciej Szulik <maszulik> |
Status: | CLOSED ERRATA | QA Contact: | zhou ying <yinzhou> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 4.3.0 | CC: | aos-bugs, bparees, deads, mfojtik, yinzhou |
Target Milestone: | --- | Keywords: | UpcomingSprint |
Target Release: | 4.3.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause:
In certain cases NotFound error was swallowed by controller logic.
Consequence:
Missing NotFound event was causing the controller not be aware of missing pods.
Fix:
Properly react to NotFound events, which indicate that the pod was already removed by a different actor.
Result:
Controller (deployment, daemonset, replicaset and others) will properly react to pod NotFound event.
|
Story Points: | --- |
Clone Of: | 1843876 | Environment: | |
Last Closed: | 2020-08-05 10:54:06 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1843876 | ||
Bug Blocks: |
Description
Maciej Szulik
2020-06-04 11:16:26 UTC
This waiting to be merged in the queue. This waiting to be merged in the queue. Confirmed with payload: 4.3.0-0.nightly-2020-07-12-052232 , this issue has fixed: Delete one pod at the first terminal , at the same time scale down the deploy, no new pod created . [zhouying@dhcp-140-138 ~]$ oc get po NAME READY STATUS RESTARTS AGE mydeploy-6cb778bf69-6jngn 1/1 Running 0 5m7s mydeploy-6cb778bf69-ftl6v 1/1 Running 0 5m7s mydeploy-6cb778bf69-sgkwc 1/1 Running 0 15s [zhouying@dhcp-140-138 ~]$ oc delete po/mydeploy-6cb778bf69-sgkwc pod "mydeploy-6cb778bf69-sgkwc" deleted [zhouying@dhcp-140-138 ~]$ oc get po NAME READY STATUS RESTARTS AGE mydeploy-6cb778bf69-6jngn 1/1 Running 0 5m30s mydeploy-6cb778bf69-ftl6v 1/1 Running 0 5m30s [zhouying@dhcp-140-138 ~]$ oc scale deploy/mydeploy --replicas=2 deployment.extensions/mydeploy scaled [zhouying@dhcp-140-138 ~]$ oc get po NAME READY STATUS RESTARTS AGE mydeploy-6cb778bf69-6jngn 1/1 Running 0 5m20s mydeploy-6cb778bf69-ftl6v 1/1 Running 0 5m20s mydeploy-6cb778bf69-sgkwc 0/1 Terminating 0 28s Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.3.31 bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:3180 |