Bug 1703883

Summary:	[ci] must-gather pod test causes test case failures
Product:	OpenShift Container Platform	Reporter:	Clayton Coleman <ccoleman>
Component:	Master	Assignee:	Luis Sanchez <sanchezl>
Status:	CLOSED ERRATA	QA Contact:	zhou ying <yinzhou>
Severity:	high	Docs Contact:
Priority:	medium
Version:	4.1.0	CC:	aos-bugs, gblomqui, jokerman, mmccomas, sanchezl, xxia
Target Milestone:	---
Target Release:	4.1.0
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2019-06-04 10:48:10 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description Clayton Coleman 2019-04-29 02:01:46 UTC

The must gather pod launched by the CI is causing the crashlooping pods test to fail. It may be that the test should ignore pods that are run once and have no failing containers, but should be investigated.

https://openshift-gce-devel.appspot.com/build/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-4.1/340

[Feature:Platform] Managed cluster should have no crashlooping pods in core namespaces over two minutes [Suite:openshift/conformance/parallel] 2m2s

fail [github.com/openshift/origin/test/extended/operators/cluster.go:109]: Expected
    <[]string | len:1, cap:1>: [
        "Pod openshift-must-gather-hzg6c/must-gather-tnvmr was pending entire time: unknown error",
    ]
to be empty

Comment 2 Luis Sanchez 2019-05-01 16:26:18 UTC

Occurs when crashlooping pods test and must-gather test run concurrently.

Fixed by: https://github.com/openshift/origin/pull/22712

Comment 3 Luis Sanchez 2019-05-01 16:26:51 UTC

*** Bug 1704470 has been marked as a duplicate of this bug. ***

Comment 4 Luis Sanchez 2019-05-13 15:07:42 UTC

Fixed in 4.1.0

Comment 5 zhou ying 2019-05-15 03:37:23 UTC

Confirmed with latest payload, the issue has fixed:
Payload: 4.1.0-0.nightly-2019-05-14-202907

Open 2 terminals, at the same time, one run `oc adm must-gather` , other one run `openshift-tests run-test "[Feature:Platform] Managed cluster should have no crashlooping pods in core namespaces over two minutes [Suite:openshift/conformance/parallel]" ` . no error found.

Comment 7 errata-xmlrpc 2019-06-04 10:48:10 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2019:0758