Bug 1948441
| Summary: | ImagePullBackOff: Source image rejected: Too many open files | |||
|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Andy Bartlett <andbartl> | |
| Component: | Node | Assignee: | Peter Hunt <pehunt> | |
| Node sub component: | CRI-O | QA Contact: | Sunil Choudhary <schoudha> | |
| Status: | CLOSED ERRATA | Docs Contact: | ||
| Severity: | urgent | |||
| Priority: | urgent | CC: | aos-bugs, bjarolim, dwalsh, fiezzi, jhou, jokerman, moddi, openshift-bugs-escalate, tsweeney | |
| Version: | 4.6 | |||
| Target Milestone: | --- | |||
| Target Release: | 4.6.z | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 1953071 (view as bug list) | Environment: | ||
| Last Closed: | 2021-05-12 12:18:10 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | 1953071 | |||
| Bug Blocks: | ||||
|
Description
Andy Bartlett
2021-04-12 07:51:28 UTC
The configured ulimits should be able to handle the number of open FDs CRI-O has, but I've also discovered a leak in CRI-O that we forgot to backport to 4.6: https://github.com/cri-o/cri-o/pull/4800 This should mitigate the situation (these connections would have been cleaned up, but it takes a while) I believe upgrading to a version of CRI-O with this patch will make this situation not happen anymore (or be *much* harder to reproduce). As such, moving this to POST here's another PR that *may* help once integrated (and is a leak regardless, so worth picking up) both attached PRs merged and will be in the next z stream Tried to trigger the issue locally by setting ulimit on an node just above what was currently being used and pulled an image. Could not reproduce the issue. Also from bug description I see the issue happened randomly. I will mark it verified based on comment 18, 19. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.6.28 bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2021:1487 |