Bug 2021324

Summary: oc explain tests are flaky in e2e-metal-ipi-ovn-ipv6 jobs
Product: OpenShift Container Platform Reporter: sdasu
Component: InstallerAssignee: sdasu
Installer sub component: OpenShift on Bare Metal IPI QA Contact: Amit Ugol <augol>
Status: CLOSED NOTABUG Docs Contact:
Severity: medium    
Priority: medium CC: wking
Version: 4.8Keywords: Triaged
Target Milestone: ---   
Target Release: 4.10.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-12-17 16:33:04 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description sdasu 2021-11-08 20:35:27 UTC
The "oc explain" tests fail pretty frequently where each time it fails for a different resource.
For example:
In https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-openshift-release-master-nightly-4.8-e2e-metal-ipi-ovn-ipv6/1449206905837719552, it is:
failed to explain ["clusterroles.rules" "--api-version=authorization.openshift.io/v1"]
In https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-openshift-release-master-nightly-4.8-e2e-metal-ipi-ovn-ipv6/1449992496774909952, it is:
failed to explain ["deploymentconfigs.status.replicas" "--api-version=apps.openshift.io/v1"]
In https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-openshift-release-master-nightly-4.8-e2e-metal-ipi-ovn-ipv6/1453388863056646144, it is:
failed to explain ["installplans" "--api-version=operators.coreos.com/v1alpha1"]
and so on.

And we see similar msgs in the logs:
[AfterEach] [sig-cli] oc explain
  github.com/openshift/origin/test/extended/util/client.go:141
STEP: Destroying namespace "e2e-test-oc-explain-hx9f4" for this suite.
fail [github.com/openshift/origin/test/extended/cli/explain.go:450]: Unexpected error:
    <*errors.errorString | 0xc002e7c7a0>: {
        s: "failed to explain [\"installplans\" \"--api-version=operators.coreos.com/v1alpha1\"]: exit status 1",
    }
    failed to explain ["installplans" "--api-version=operators.coreos.com/v1alpha1"]: exit status 1
occurred

Indicating that the Namespace required to run the test is deleted before the test has finished executing.

Comment 1 sdasu 2021-12-17 16:33:04 UTC
As a result of the fix added for https://bugzilla.redhat.com/show_bug.cgi?id=2018208, these tests are no longer failing frequently. So, at this point it is dafe to close this BZ and the attached PR.