Bug 2028153
Summary: | Unable to ensure pod container exists: failed to create container for [kubepods burstable ...] : Argument list too long | |||
---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Mridul Markandey <mmarkand> | |
Component: | Node | Assignee: | Kir Kolyshkin <kir> | |
Node sub component: | CRI-O | QA Contact: | Sunil Choudhary <schoudha> | |
Status: | CLOSED WONTFIX | Docs Contact: | ||
Severity: | high | |||
Priority: | high | CC: | aos-bugs, bhoppus, bsmitley, cshepher, dgautam, hgomes, jcrumple, kir, kyankovi, mfiedler, nagrawal, pehunt, sbelmasg, vwalek, wrussell | |
Version: | 4.7 | Keywords: | Reopened | |
Target Milestone: | --- | |||
Target Release: | --- | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | If docs needed, set a value | ||
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 2036853 (view as bug list) | Environment: | ||
Last Closed: | 2024-04-30 18:04:53 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | 2039326 | |||
Bug Blocks: |
Description
Mridul Markandey
2021-12-01 15:42:45 UTC
In addition to > sudo systemctl daemon-reload suggested by Neelesh earlier, you can also try > sudo systemctl reset-failed which may help. In any case, please collect the output of > sudo systemctl list-units --all BEFORE trying any workarounds. In case there are many failed .mount units in "systemctl list-units --all" output, backporting of https://github.com/systemd/systemd/pull/10980 may help. > The majority of those To be specific, 121587 out of 129434 > are from runc CVE fix Again, to be specific, I meant CVE-2019-5736, for which the default mitigation is to bind mount /proc/self/exe (see https://github.com/opencontainers/runc/blob/master/libcontainer/nsenter/cloned_binary.c#L399). This mount is performed every time runc needs to enter the container (i.e. whenever runc start/run/exec is run). Systemd sees the new mount and creates a mount unit. Due to a bug in systemd this unit is never removed. The bug is presumably fixed upstream in https://github.com/systemd/systemd/pull/10980; I am working on a backport. Side note: opened a PR to add "systemctl list-units --all" to sosreport: https://github.com/sosreport/sos/pull/2809 systemd backport: https://github.com/redhat-plumbers/systemd-rhel8/pull/244 > Can you please guide us about the next steps to follow? Are we planning for a permanent fix for this issue or running cronjob is the only available workaround at the moment? Yes, a permanent fix is in progress (see comment 31 above). Hello Kir Kolyshin, Could you please confirm in what particular version of OpenShift 4.7, is this current Bug #2028153 is resolved? Which I attached with case no. 03126124. Best Regards, Amarjit Das Hello Neeleah, Thank you for your update in comment 38. As the current bug #2028153 was fixed, that means in OpenShift 4.7.41 we don't have to stick to a workaround. Please confirm. Best Regards, Amarjit Das The systemd fix landed in systemd-239-45.el8_4.7, which is still not released. Releasing it is the subject of #2039326. Once released, RHCOS should pick it up. systemd-239-45.el8_4.8 with the fixes is available in 4.11 nightly 4.10.4 4.9.24 4.8.35 4.7.45 or later builds. I see this bug was fixed in https://bugzilla.redhat.com/show_bug.cgi?id=1984406 with systemd-239-45.el8. Marking it verified. OCP is no longer using Bugzilla and this bug appears to have been left in an orphaned state. If the bug is still relevant, please open a new issue in the OCPBUGS Jira project: https://issues.redhat.com/projects/OCPBUGS/summary The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days |