Bug 1965423

Summary: ocs must-gather times out when using request-timeout=20m succeeds when using timeout=1200
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Dan Seals <dseals>
Component: must-gatherAssignee: Gobinda Das <godas>
Status: CLOSED WORKSFORME QA Contact: Raz Tamir <ratamir>
Severity: low Docs Contact:
Priority: unspecified    
Version: 4.6CC: aos-bugs, bkunal, godas, jokerman, maszulik, mfojtik, muagarwa, nberry, ocs-bugs, odf-bz-bot, resoni, sabose, tdesala
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-08-03 11:43:05 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Dan Seals 2021-05-27 17:27:43 UTC
Description of problem:
OCS must-gather times out when using --request-timeout=20m
The must-gather completes successfully when using --timeout=1200

The cluster is in a disconnected env

Command used that fails
oc adm must-gather --image=hub.fbond:5000/ocs4/ocs-must-gather-rhel8@sha256:e60eb1de328655fe484eccfaab307b277987c61dc2cf885a69c65416bad9fa24 --keep --request-timeout=20m 

The must-gather starts at 6:10:45 then times out at 6:25:39.
I0527 06:10:45.898687  206304 loader.go:375] Config loaded from file:  /root/ignition/auth/kubeconfig
[must-gather      ] OUT Using must-gather plugin-in image: hub.fbond:5000/ocs4/ocs-must-gather-rhel8@sha256:e60eb1de328655fe484eccfaab307b277987c61dc2cf885a69c65416bad9fa24
.......
.......
[must-gather-b8xzw] OUT gather never finished: timed out waiting for the condition
F0527 06:25:39.141702  206304 helpers.go:115] error: gather never finished for pod must-gather-b8xzw: timed out waiting for the condition




Command used that completes:
oc adm must-gather --image=hub.fbond:5000/ocs4/ocs-must-gather-rhel8@sha256:e60eb1de328655fe484eccfaab307b277987c61dc2cf885a69c65416bad9fa24 --keep --timeout=1200


Version-Release number of selected component (if applicable):
4.6.16


Additional info:
They have tested multiple times with the same results

Comment 2 Maciej Szulik 2021-05-31 15:15:17 UTC
Sending this over to OCS team to investigate.

Comment 16 Rewant 2021-07-22 06:56:09 UTC
I created my own image from master and it works with --request-timeout=20m