Bug 1789440

Summary: [4.1.z] Upgrade tests timeout reaching AWS Load Balancer
Product: OpenShift Container Platform Reporter: Adam Kaplan <adam.kaplan>
Component: NetworkingAssignee: Casey Callendrello <cdc>
Networking sub component: openshift-sdn QA Contact: zhaozhanqi <zzhao>
Status: CLOSED DUPLICATE Docs Contact:
Severity: urgent    
Priority: unspecified CC: bbennett, wking
Version: 4.1.z   
Target Milestone: ---   
Target Release: 4.4.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: buildcop
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-01-10 03:34:56 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Adam Kaplan 2020-01-09 15:25:33 UTC
Description of problem:

release-openshift-origin-installer-e2e-aws-upgrade-4.1 jobs timeout trying to get the cluster operators through the AWS ELB:

```
Jan  9 14:41:10.825: INFO: unable to retrieve cluster version during upgrade: Get https://api.ci-op-s2rmnisg-281c8.origin-ci-int-aws.dev.rhcloud.com:6443/apis/config.openshift.io/v1/clusterversions/version: dial tcp 54.161.229.250:6443: connect: connection refused
Jan  9 14:42:01.114: INFO: Got error testing for reachability of http://ac343a9f832e911eabdcd12189ace398-13018433.us-east-1.elb.amazonaws.com:80/echo?msg=hello: Get http://ac343a9f832e911eabdcd12189ace398-13018433.us-east-1.elb.amazonaws.com:80/echo?msg=hello: net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)
```

Version-Release number of selected component (if applicable): 4.1.z

How Reproducible: 60% of last 3 hours


Additional info:

[1] https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-upgrade-4.1/1113

Comment 1 Adam Kaplan 2020-01-09 18:14:11 UTC
This is potentially an AWS infra issue (unclear if it is temporary or needs a work-around). 4.1 upgrade tests are passing again.

Comment 2 W. Trevor King 2020-01-10 03:34:56 UTC
E.g. see the error message quoted in [1].

[1]: https://bugzilla.redhat.com/show_bug.cgi?id=1690087#c1

*** This bug has been marked as a duplicate of bug 1690087 ***