Description of problem: When installing openshift 3.7 on AWS, the install fails during the openshift-hosted phase. Version-Release number of the following components: rpm -q openshift-ansible using latest from release-3.7 branch rpm -q ansible ansible-2.4.6.0-1.el7ae.noarch ansible --version ansible 2.4.6.0 How reproducible: This happens every time for me. Steps to Reproduce: 1. RHEL 7.6 image on AWS 2. Using all-in-one inventory 3. Latest installer on release-3.7 branch Actual results: STDOUT: Waiting for rollout to finish: 0 out of 1 new replicas have been updated... STDERR: error: replication controller "router-1" has failed progressing MSG: non-zero return code
The router-1 deployment log contains this: error: couldn't get deployment router-1: Get https://172.30.0.1:443/api/v1/namespaces/default/replicationcontrollers/router-1: dial tcp 172.30.0.1:443: getsockopt: no route to host
Created attachment 1509541 [details] ansible.log
Seems API is up (we verify that during master install), but routers can't reach it via SDN
There is no workaround? I'm using the aarch64 bistro and am running into the same issue. Any pointers would be super helpful. Thx! --Kurt (at Red Hat)
BTW same exact issue on a Centos-7.6 x86_64 VM with OpenShift 3.7.1 running on my local box.
The workaround can be found here: https://github.com/Project31/rh-middleware-on-arm/issues/10, downgrade iptables to the version that ships with 7.5.