Bug 1826448

Summary: [Feature:DeploymentConfig] deploymentconfigs with test deployments [Conformance] [It] should run a deployment to completion and then scale to zero [Suite:openshift/conformance/parallel/minimal]
Product: OpenShift Container Platform Reporter: Russell Teague <rteague>
Component: openshift-apiserverAssignee: Stefan Schimanski <sttts>
Status: CLOSED DEFERRED QA Contact: Xingxing Xia <xxia>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 3.11.0CC: aos-bugs, mfojtik, xtian
Target Milestone: ---   
Target Release: 3.11.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-05-19 14:10:53 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1808068    

Description Russell Teague 2020-04-21 17:36:00 UTC
Description of problem:
This e2e test seems to continually fail on release-3.11 across multiple PRs.

https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/pr-logs/pull/openshift_openshift-ansible/12119/pull-ci-openshift-openshift-ansible-release-3.11-e2e-aws/2649

/tmp/openshift/build-rpms/rpm/BUILD/origin-3.11.0/_output/local/go/src/github.com/openshift/origin/test/extended/deployments/deployments.go:320
Expected error:
    <*util.ExitError | 0xc422202a20>: {
        Cmd: "oc scale --config=/tmp/configfile647220998 --namespace=e2e-test-cli-deployment-jkvsh dc/deployment-test --replicas=1",
        StdErr: "Unable to connect to the server: dial tcp: lookup 3.235.106.45.xip.io on 10.142.0.102:53: no such host",
        ExitError: {
            ProcessState: {
                pid: 6494,
                status: 256,
                rusage: {
                    Utime: {Sec: 0, Usec: 257334},
                    Stime: {Sec: 0, Usec: 79784},
                    Maxrss: 98816,
                    Ixrss: 0,
                    Idrss: 0,
                    Isrss: 0,
                    Minflt: 14588,
                    Majflt: 0,
                    Nswap: 0,
                    Inblock: 0,
                    Oublock: 0,
                    Msgsnd: 0,
                    Msgrcv: 0,
                    Nsignals: 0,
                    Nvcsw: 1352,
                    Nivcsw: 12,
                },
            },
            Stderr: nil,
        },
    }
    exit status 1
not to have occurred
/tmp/openshift/build-rpms/rpm/BUILD/origin-3.11.0/_output/local/go/src/github.com/openshift/origin/test/extended/deployments/deployments.go:348


Version-Release number of selected component (if applicable):


How reproducible:

https://deck-ci.svc.ci.openshift.org/?job=*3.11-e2e-aws


Actual results:


Expected results:


Additional info:

Comment 2 Michal Fojtik 2020-05-19 13:12:25 UTC
This bug hasn't had any engineering activity in the last ~30 days. Maybe the problem got resolved, was a duplicate of something else, or became less pressing for some reason - or maybe it's still relevant but just hasn't been looked at yet.

As such, we're marking this bug as "LifecycleStale".

If you have further information on the current state of the bug, please update it and remove the "LifecycleStale" keyword, otherwise this bug will be automatically closed in 7 days. The information can be, for example, that the problem still occurs, that you still want the feature, that more information is needed, or that the bug is (for whatever reason) no longer relevant.

Comment 3 Russell Teague 2020-05-19 14:10:53 UTC
I'm no longer seeing continual failures of this job on 3.11 PRs.  It is still occurring, most recently as 4 days ago, but only matching 4% of failures over the last 14 days.

https://search.apps.build01.ci.devcluster.openshift.com/?search=Fail.*deploymentconfigs+with+test+deployments.*should+run+a+deployment+to+completion+and+then+scale+to+zero&maxAge=336h&context=1&type=build-log&name=3.11-e2e-aws&maxMatches=5&maxBytes=20971520&groupBy=job

"pull-ci-openshift-openshift-ansible-release-3.11-e2e-aws - 129 runs, 70% failed, 4% of failures match"

If this becomes a bigger problem I'll bring it back up.