Bug 1952137
| Summary: | Observing lot of Defunct processes | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Anandhu B Raj <abraj> |
| Component: | Node | Assignee: | Sascha Grunert <sgrunert> |
| Node sub component: | CRI-O | QA Contact: | Sunil Choudhary <schoudha> |
| Status: | CLOSED DEFERRED | Docs Contact: | |
| Severity: | urgent | ||
| Priority: | high | CC: | acardena, aos-bugs, avoigtma, bhershbe, bleanhar, bverschu, deliedit, ealcaniz, eminguez, fsimonce, gdiotte, hgomes, jteagno+bugzilla, mchebbi, miminar, minmli, nagrawal, ofalk, openshift-bugs-escalate, palshure, pducai, pehunt, pratshar, prdeshpa, pweil, rcarrier, rphillips, rupatel, schoudha, sgrunert, sparpate, wking |
| Version: | 4.6 | Flags: | prdeshpa:
needinfo-
|
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2021-08-17 09:46:27 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Anandhu B Raj
2021-04-21 15:12:42 UTC
The sosreport indicates that there are other defunct processes other than conmon, so I suspect this issue is related to: https://bugzilla.redhat.com/show_bug.cgi?id=1932832 I really would like to give the runc fix a try which got mentioned in the first comment. Anandhu, do you think we can test it this way? Peter, referring to https://bugzilla.redhat.com/show_bug.cgi?id=1942375#c10, which OpenShift version ships the runc-1.0.0-85.rhaos4.6.git77a6f3c package? Since it was bumped in https://releases-rhcos-art.cloud.privileged.psi.redhat.com/contents.html?stream=releases%2Frhcos-4.6&release=46.82.202104281641-0 I would guess 2.6.28 will have it Anandhu, can we update the customer version to 2.6.28 to test the fix? (In reply to Sascha Grunert from comment #15) > Anandhu, can we update the customer version to 2.6.28 to test the fix? I think Peter meant 4.6.28 :) *** Bug 1932832 has been marked as a duplicate of this bug. *** Unsetting the target release because this issue affects 4.6. The upstream PR has been merged and should automatically land in the next OpenShift release. Package references: - https://brewweb.engineering.redhat.com/brew/buildinfo?buildID=1631695 - https://brewweb.engineering.redhat.com/brew/buildinfo?buildID=1631692 Hey Anandhu, 4.6.35 should contain the fix. FYI CRI-O needs the attached fix as well. for 4.6, it made it in 4.6.36 *** Bug 1980522 has been marked as a duplicate of this bug. *** test on 4.6.42, create some pods with liveness exec probe, don't find any defunct process. according to Comment 82, this issue still exist in production environment, so set assigned again. Thanks Peter, as discussed yesterday I'm closing this bug in favor of BZ#1994444 to focus our observations on the currently open issue. |