Bug 1335941
| Summary: | Openshift should clean up containers from jobs no longer in the system | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Robert Rati <rrati> |
| Component: | Node | Assignee: | Andy Goldstein <agoldste> |
| Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | DeShuai Ma <dma> |
| Severity: | medium | Docs Contact: | |
| Priority: | medium | ||
| Version: | 3.2.0 | CC: | agoldste, aos-bugs, jokerman, jvyas, mmccomas, rmeggins, tstclair |
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2016-06-03 19:52:36 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1335939 | ||
| Bug Blocks: | |||
|
Description
Robert Rati
2016-05-13 15:09:54 UTC
Is this specific to jobs, pods, or both? Kubernetes will remove containers when their pod is deleted. If a pod is still around but its containers have died and been restarted, the old dead containers are preserved until the container GC thresholds are hit. I believe the defaults are 100 total dead containers, and up to 2 per pod. What this means is that until you have 101 dead containers, nothing will get GC'd (or maybe the 2 per pod cap is applied, I can't remember offhand). It would be useful to have a specific reproducer if possible. I am unable to reproduce based on the Steps to Reproduce listed above. openshift also needs to clean logs as well. See https://bugzilla.redhat.com/show_bug.cgi?id=1335951 and https://github.com/kubernetes/kubernetes/compare/master...jayunit100:LoggingSoak to systematically reproduce/test logging strain at scale. Ok... but this bz is about data remaining in /var/lib/docker/containers after the job has been deleted. I can't reproduce. Can you? updated https://bugzilla.redhat.com/show_bug.cgi?id=1335951 with details regarding the logging soak portion. that ticket also his details regarding oom exceptions when logging has no breaks. I'm sorry for being a stickler here, but please provide details on whether or not you can reproduce this. Under normal conditions, the container data is removed from /var/lib/docker/containers when the job (and its underlying pod) is deleted. Otherwise I'm going to close this. I have not ran any jobs in openshift, ive only reproduced similar errors using raw pod spinups with highly verbose logging.. What happened when you had a pod with verbose logging and you tried to delete it? Was the pod itself successfully deleted? Were the containers for the pod deleted? I did witness the issue, however I'm going to temporarily close this until we can find a valid reproducer. I've been unable to reproduce it. +1 to close i dont think its reproducible anymore on new openshift/docker versions. |