Bug 1781044
| Summary: | [must gather] oc adm must-gather failed to generate the directory, gather never finished: timed out waiting for the condition. | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Product: | Container Native Virtualization (CNV) | Reporter: | Ying Cui <ycui> | ||||||||||
| Component: | Providers | Assignee: | Avram Levitter <alevitte> | ||||||||||
| Status: | CLOSED ERRATA | QA Contact: | Ying Cui <ycui> | ||||||||||
| Severity: | high | Docs Contact: | |||||||||||
| Priority: | high | ||||||||||||
| Version: | 2.2.0 | CC: | alevitte, cnv-qe-bugs, danken, fdeutsch, maszulik, ncredi, pkliczew | ||||||||||
| Target Milestone: | --- | Keywords: | Regression | ||||||||||
| Target Release: | 2.2.0 | Flags: | maszulik:
needinfo-
|
||||||||||
| Hardware: | Unspecified | ||||||||||||
| OS: | Unspecified | ||||||||||||
| Whiteboard: | |||||||||||||
| Fixed In Version: | cnv-must-gather-container-v2.2.0-7 | Doc Type: | If docs needed, set a value | ||||||||||
| Doc Text: | Story Points: | --- | |||||||||||
| Clone Of: | Environment: | ||||||||||||
| Last Closed: | 2020-01-30 16:27:33 UTC | Type: | Bug | ||||||||||
| Regression: | --- | Mount Type: | --- | ||||||||||
| Documentation: | --- | CRM: | |||||||||||
| Verified Versions: | Category: | --- | |||||||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||||||
| Embargoed: | |||||||||||||
| Attachments: |
|
||||||||||||
|
Description
Ying Cui
2019-12-09 07:30:35 UTC
Created attachment 1643190 [details]
screen_messages_output_mustgather
Created attachment 1643191 [details]
ocgetpods
Created attachment 1643204 [details]
ocdescribepod
Maciej, I remember you wanted to investigate this one. We agreed that no matter what happens some gathered logs should be collected. It doesn't look like regression. In my opinion it never worked. Let's wait on Maciej to reply but I think he or anyone else from the platform should fix it. Piotr, what is "it" that never worked? I though that Ying was attempting a very basic use case which was tested before. What am I missing? Dan this issue was reported before as BZ #1755714. Maciej closed it as works on my machine and promised to investigate which seems like it never happened. Created attachment 1643655 [details]
mustgather_withoutimage_successful
It seems that it's failing specifically because of the 10 minute timeout built into `oc adm must-gather`. When I used the `--keep` flag (which will not delete the pod and namespace after execution), the pod finished after 13 minutes. The problem seems to be specifically in the gathering of the packagemanifests. That section has been taking close to 10 minutes. It takes around 3 seconds to execute `oc get packagemanifest $name -n $NS -o yaml >> ${NAMESPACE_PATH}/${NS}/packagemanifests` and on a test cluster there were 185 packagemanifests.
There's a pending pull request that should fix this in upstream: https://github.com/kubevirt/must-gather/pull/60 (In reply to Avram Levitter from comment #18) > There's a pending pull request that should fix this in upstream: > https://github.com/kubevirt/must-gather/pull/60 That's exactly the reason to move a bz to the POST state. VERIFIED this bug on cnv-must-gather-container-v2.2.0-7 Test Steps: $ oc adm must-gather --image=registry-proxy.engineering.redhat.com/rh-osbs/container-native-virtualization-cnv-must-gather-rhel8:v2.2.0-7 --dest-dir=/tmp The output directory generated, the issue is fixed. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2020:0307 |