Bug 1541476
| Summary: | Pods in crash loop backoff can't be deleted until the crash loop backoff period expires | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Clayton Coleman <ccoleman> |
| Component: | Node | Assignee: | Robert Krawitz <rkrawitz> |
| Status: | CLOSED ERRATA | QA Contact: | weiwei jiang <wjiang> |
| Severity: | medium | Docs Contact: | |
| Priority: | medium | ||
| Version: | 3.9.0 | CC: | aos-bugs, avagarwa, dma, jokerman, mmccomas, sjenning |
| Target Milestone: | --- | Keywords: | NeedsTestCase |
| Target Release: | 3.10.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | No Doc Update | |
| Doc Text: |
undefined
|
Story Points: | --- |
| Clone Of: | Environment: | ||
| Last Closed: | 2018-07-30 19:09:00 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Clayton Coleman
2018-02-02 16:44:58 UTC
Still working to run this down upstream https://github.com/kubernetes/kubernetes/issues/57865 The delay is only a factor of the terminationGracePeriod (30s), not the backoff timeout (up to 5m). So we are looking at a delay in the 10s of seconds, not minutes. Trying to figure out why the kubelet does not clean up the failed container once the pod gets its deletionTimestamp set. This has been an issue since at least 1.6 according to the upstream issue so it isn't a regression. It is annoying and effects pods in general, not just StatefulSet pods. Not a blocker in my mind though, so deferring to z-stream. Clayton, if you really want this to be a blocker, feel free to move it back. WIP upstream PR: https://github.com/kubernetes/kubernetes/pull/62170 Previous upstream PR abandon. New upstream PR: https://github.com/kubernetes/kubernetes/pull/63321 Origin PR: https://github.com/openshift/origin/pull/19580 Checked on # oc version oc v3.10.0-0.46.0 kubernetes v1.10.0+b81c8f8 features: Basic-Auth GSSAPI Kerberos SPNEGO Server https://ip-172-18-14-127.ec2.internal:8443 openshift v3.10.0-0.46.0 kubernetes v1.10.0+b81c8f8 And the terminating pods is deleted immediately now. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:1816 |