Bug 1888041
Summary: | non-terminating pods are going from running to pending | |||
---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | David Eads <deads> | |
Component: | Node | Assignee: | Ryan Phillips <rphillips> | |
Node sub component: | Kubelet | QA Contact: | Sunil Choudhary <schoudha> | |
Status: | CLOSED ERRATA | Docs Contact: | ||
Severity: | high | |||
Priority: | high | CC: | aos-bugs, jokerman | |
Version: | 4.6 | |||
Target Milestone: | --- | |||
Target Release: | 4.7.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | Bug Fix | ||
Doc Text: |
Cause: The Kubelet did not handle transitions properly when statuses were missing.
Consequence: This caused terminated pods to sometimes not get restarted.
Fix: Adds a ContainerStatus of failed to allow the container to be restarted (if need be).
Result: Kubelet pod handling does not result in an invalid state transition.
|
Story Points: | --- | |
Clone Of: | ||||
: | 1960291 (view as bug list) | Environment: | ||
Last Closed: | 2021-02-24 15:25:41 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1888847, 1960291 |
Description
David Eads
2020-10-13 20:54:25 UTC
Node team had a slack conversation with David, and he suggested this patch: https://github.com/kubernetes/kubernetes/pull/95561 We will go test this. If the patch works, then we will target a 4.6.z backport. The issue stems from reboots and the way crio wipes the container statuses upon reboot. The fix was in master when https://bugzilla.redhat.com/show_bug.cgi?id=1884035#c15 was verified and the same check applies to both. There were three fixes involved and the test verified them together. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.7.0 security, bug fix, and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2020:5633 |