Bug 1893360
| Summary: | 4.7 node reboots are slower due to DNS containers not terminating cleanly | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Yu Qi Zhang <jerzhang> | ||||||
| Component: | Networking | Assignee: | Miciah Dashiel Butler Masters <mmasters> | ||||||
| Networking sub component: | DNS | QA Contact: | Hongan Li <hongli> | ||||||
| Status: | CLOSED DUPLICATE | Docs Contact: | |||||||
| Severity: | high | ||||||||
| Priority: | unspecified | CC: | aos-bugs, kgarriso, sdodson, skumari, walters, wking | ||||||
| Version: | 4.7 | Keywords: | Upgrades | ||||||
| Target Milestone: | --- | ||||||||
| Target Release: | --- | ||||||||
| Hardware: | Unspecified | ||||||||
| OS: | Unspecified | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2020-11-04 21:55:18 UTC | Type: | Bug | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Attachments: |
|
||||||||
|
Description
Yu Qi Zhang
2020-10-30 20:39:05 UTC
Created attachment 1725421 [details]
dns logs in crio
As a note: these slowdowns may not seem like a lot but when combined with the sdn bz (https://bugzilla.redhat.com/show_bug.cgi?id=1893362) they result in 50% longer reboots per node. This means that they break MCO CI(e2e-gcp-op) which already has a built-in cushion and will cause visible perf hits for customers rolling out updates to large clusters. Following git blame here leads to https://github.com/openshift/cluster-dns-operator/issues/65 Relevant commits: https://github.com/openshift/cluster-dns-operator/commit/52362da4821655981eabd64c111c71698e30e3d4 Annoyingly, this problem would mostly go away if Kubernetes supported a proper init system (like podman run --init). Ah it's probably not the bash code, it's likely fallout from https://github.com/openshift/cluster-dns-operator/commit/f094ddf7edc95dad8398179482687bc2a7a0c15b *** This bug has been marked as a duplicate of bug 1884053 *** |