[sig-network] Services should respect internalTrafficPolicy=Local Pod and Node, to Pod (hostNetwork: true) [Feature:ServiceInternalTrafficPolicy] [Skipped:Network/OVNKubernetes] [Suite:openshift/conformance/parallel] [Suite:k8s] See the (to date) newest 4.10 parallel jobs: Jan 24 11:21:03: https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-shiftstack-shiftstack-ci-main-periodic-4.10-e2e-openstack-parallel/1485558245945774080 Jan 23 11:21:01: https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-shiftstack-shiftstack-ci-main-periodic-4.10-e2e-openstack-parallel/1485195853701320704 Jan 22 11:21:01: https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-shiftstack-shiftstack-ci-main-periodic-4.10-e2e-openstack-parallel/1484833462501249024 Jan 21 11:20:31: https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-shiftstack-shiftstack-ci-main-periodic-4.10-e2e-openstack-parallel/1484470930267705344
It might just be a short hostname vs FDQN thing, where openstack would behave differently than the other cloud platforms: fail [k8s.io/kubernetes.0/test/e2e/network/util.go:170]: Expected <string>: bl3gfvsx-1cd93-5v8rq-worker-0-5kmrj.novalocal to equal <string>: bl3gfvsx-1cd93-5v8rq-worker-0-5kmrj In that case the test would need to be fixed.
According to https://github.com/openshift/kubernetes/commit/752a532c3d0819759f98821a94f26193b494c3d5 the return from the agnhost "/hostname" endpoint should be the hostname and not the FQDN. Which would mean the is fine, and the "/hostname" endpoint is the one that needs fixing.
The test seems to be consistently failing the openstack 4.10 parallel job since Jan 21 11:20:31.
Continuing my exploration of the agnhost image. Looks like the "/hostname" endpoint implementation [1] simply return the output of os.Hostname() [2], which in turn gets the hostname from the uname syscall or /proc/sys/kernel/hostname [3] [1] https://github.com/kubernetes/kubernetes/blob/master/test/images/agnhost/netexec/netexec.go#L652 [2] https://pkg.go.dev/os#Hostname [3] https://cs.opensource.google/go/go/+/refs/tags/go1.17.6:src/os/sys_linux.go;drc=refs%2Ftags%2Fgo1.17.6;l=12 Which makes me think the assumption from https://github.com/openshift/kubernetes/commit/752a532c3d0819759f98821a94f26193b494c3d5 is incorrect.
Posted an upstream patch at https://github.com/kubernetes/kubernetes/pull/107786 to fix the failing test.
Setting blocker- because this bug has been triaged as sev LOW
Can we get this bug to verified so we can proceed with merging to 4.10?
Modified would also work according to bot, presumably we need to see https://github.com/openshift/origin/pull/26805 merge to origin so we can verify.
self-verifying, we don't see the issue anymore in CI.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:5069