Description of problem:
When installing openshift 3.7 on AWS, the install fails during the openshift-hosted phase.
Version-Release number of the following components:
rpm -q openshift-ansible
using latest from release-3.7 branch
rpm -q ansible
How reproducible: This happens every time for me.
Steps to Reproduce:
1. RHEL 7.6 image on AWS
2. Using all-in-one inventory
3. Latest installer on release-3.7 branch
STDOUT: Waiting for rollout to finish: 0 out of 1 new replicas have been updated...
STDERR: error: replication controller "router-1" has failed progressing
MSG: non-zero return code
The router-1 deployment log contains this:
error: couldn't get deployment router-1: Get https://172.30.0.1:443/api/v1/namespaces/default/replicationcontrollers/router-1: dial tcp 172.30.0.1:443: getsockopt: no route to host
Created attachment 1509541 [details]
Seems API is up (we verify that during master install), but routers can't reach it via SDN
There is no workaround? I'm using the aarch64 bistro and am running into the same issue. Any pointers would be super helpful. Thx! --Kurt (at Red Hat)
BTW same exact issue on a Centos-7.6 x86_64 VM with OpenShift 3.7.1 running on my local box.
The workaround can be found here: https://github.com/Project31/rh-middleware-on-arm/issues/10, downgrade iptables to the version that ships with 7.5.