Bug 1753472

Summary: e2e-azure: The HAProxy router should override the route host with a custom value
Product: OpenShift Container Platform Reporter: Kirsten Garrison <kgarriso>
Component: NetworkingAssignee: Miciah Dashiel Butler Masters <mmasters>
Networking sub component: router QA Contact: Hongan Li <hongli>
Status: CLOSED ERRATA Docs Contact:
Severity: medium    
Priority: medium CC: aos-bugs, bbennett, bperkins, dmace
Version: 4.2.0   
Target Milestone: ---   
Target Release: 4.3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-01-23 11:06:22 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Kirsten Garrison 2019-09-19 03:14:47 UTC
Description of problem: Flaky test in e2e-azure-serial:
[Conformance][Area:Networking][Feature:Router] The HAProxy router should override the route host with a custom value [Suite:openshift/conformance/parallel/minimal]:

fail [github.com/openshift/origin/test/extended/router/scoped.go:137]: Unexpected error:
    <*errors.errorString | 0xc003b17e10>: {
        s: "host command failed: error running &{/usr/bin/kubectl [kubectl --server=https://api.ci-op-8zqr6i6n-282fe.ci.azure.devcluster.openshift.com:6443 --kubeconfig=/tmp/admin.kubeconfig exec --namespace=e2e-test-router-scoped-wrv7w execpod -- /bin/sh -x -c \n\t\tset -e\n\t\tSTOP=$(($(date '+%s') + 180))\n\t\twhile [ $(date '+%s') -lt $STOP ]; do\n\t\t\tcode=$( curl -k -s -m 5 -o /dev/null -w '%{http_code}\\n' --header 'Host: 10.129.2.221' \"http://10.129.2.221:1936/healthz\" ) || rc=$?\n\t\t\tif [[ \"${rc:-0}\" -eq 0 ]]; then\n\t\t\t\techo $code\n\t\t\t\tif [[ $code -eq 200 ]]; then\n\t\t\t\t\texit 0\n\t\t\t\tfi\n\t\t\t\tif [[ $code -ne 503 ]]; then\n\t\t\t\t\texit 1\n\t\t\t\tfi\n\t\t\telse\n\t\t\t\techo \"error ${rc}\" 1>&2\n\t\t\tfi\n\t\t\tsleep 1\n\t\tdone\n\t\t] []  <nil> 500\n + set -e\n+ date +%s\n+ STOP=1568847433\n+ date +%s\n+ [ 1568847253 -lt 1568847433 ]\n+ curl -k -s -m 5 -o /dev/null -w %{http_code}\\n --header Host: 10.129.2.221 http://10.129.2.221:1936/healthz\n+ code=500\n+ [[ 0 -eq 0 ]]\n+ echo 500\n+ [[ 500 -eq 200 ]]\n+ [[ 500 -ne 503 ]]\n+ exit 1\ncommand terminated with exit code 1\n [] <nil> 0xc003128ae0 exit status 1 <nil> <nil> true [0xc00206e4d8 0xc00206e4f0 0xc00206e508] [0xc00206e4d8 0xc00206e4f0 0xc00206e508] [0xc00206e4e8 0xc00206e500] [0x95ade0 0x95ade0] 0xc00376f020 <nil>}:\nCommand stdout:\n500\n\nstderr:\n+ set -e\n+ date +%s\n+ STOP=1568847433\n+ date +%s\n+ [ 1568847253 -lt 1568847433 ]\n+ curl -k -s -m 5 -o /dev/null -w %{http_code}\\n --header Host: 10.129.2.221 http://10.129.2.221:1936/healthz\n+ code=500\n+ [[ 0 -eq 0 ]]\n+ echo 500\n+ [[ 500 -eq 200 ]]\n+ [[ 500 -ne 503 ]]\n+ exit 1\ncommand terminated with exit code 1\n\nerror:\nexit status 1\n\n",
    }
    host command failed: error running &{/usr/bin/kubectl [kubectl --server=https://api.ci-op-8zqr6i6n-282fe.ci.azure.devcluster.openshift.com:6443 --kubeconfig=/tmp/admin.kubeconfig exec --namespace=e2e-test-router-scoped-wrv7w execpod -- /bin/sh -x -c 
    		set -e
    		STOP=$(($(date '+%s') + 180))
    		while [ $(date '+%s') -lt $STOP ]; do
    			code=$( curl -k -s -m 5 -o /dev/null -w '%{http_code}\n' --header 'Host: 10.129.2.221' "http://10.129.2.221:1936/healthz" ) || rc=$?
    			if [[ "${rc:-0}" -eq 0 ]]; then
    				echo $code
    				if [[ $code -eq 200 ]]; then
    					exit 0
    				fi
    				if [[ $code -ne 503 ]]; then
    					exit 1
    				fi
    			else
    				echo "error ${rc}" 1>&2
    			fi
    			sleep 1
    		done
    		] []  <nil> 500
     + set -e
    + date +%s
    + STOP=1568847433
    + date +%s
    + [ 1568847253 -lt 1568847433 ]
    + curl -k -s -m 5 -o /dev/null -w %{http_code}\n --header Host: 10.129.2.221 http://10.129.2.221:1936/healthz
    + code=500
    + [[ 0 -eq 0 ]]
    + echo 500
    + [[ 500 -eq 200 ]]
    + [[ 500 -ne 503 ]]
    + exit 1
    command terminated with exit code 1
     [] <nil> 0xc003128ae0 exit status 1 <nil> <nil> true [0xc00206e4d8 0xc00206e4f0 0xc00206e508] [0xc00206e4d8 0xc00206e4f0 0xc00206e508] [0xc00206e4e8 0xc00206e500] [0x95ade0 0x95ade0] 0xc00376f020 <nil>}:
    Command stdout:
    500
    
    stderr:
    + set -e
    + date +%s
    + STOP=1568847433
    + date +%s
    + [ 1568847253 -lt 1568847433 ]
    + curl -k -s -m 5 -o /dev/null -w %{http_code}\n --header Host: 10.129.2.221 http://10.129.2.221:1936/healthz
    + code=500
    + [[ 0 -eq 0 ]]
    + echo 500
    + [[ 500 -eq 200 ]]
    + [[ 500 -ne 503 ]]
    + exit 1
    command terminated with exit code 1
    
    error:
    exit status 1
    
    
occurred

see: https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/canary-openshift-ocp-installer-e2e-azure-4.2/309

Comment 1 Kirsten Garrison 2019-09-19 03:21:24 UTC
Note this test is flakey, it failed ~ 25% runs today.

Comment 6 Hongan Li 2019-11-04 01:42:24 UTC
didn't see this issue since release-openshift-ocp-installer-e2e-azure-4.3 #198, so moving to verified.

Comment 8 errata-xmlrpc 2020-01-23 11:06:22 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0062