Bug 1654343 - router deployment fails in openshift ansible installer 3.7
Summary: router deployment fails in openshift ansible installer 3.7
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Routing
Version: 3.7.1
Hardware: Unspecified
OS: Linux
Target Milestone: ---
: ---
Assignee: Dan Mace
QA Contact: Hongan Li
Depends On:
Blocks: 1644546
TreeView+ depends on / blocked
Reported: 2018-11-28 14:58 UTC by Paul Gier
Modified: 2019-04-22 01:37 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Last Closed: 2019-01-29 19:20:01 UTC
Target Upstream Version:

Attachments (Terms of Use)
ansible.log (118.30 KB, text/plain)
2018-11-28 15:09 UTC, Paul Gier
no flags Details

Description Paul Gier 2018-11-28 14:58:24 UTC
Description of problem:
When installing openshift 3.7 on AWS, the install fails during the openshift-hosted phase.

Version-Release number of the following components:

rpm -q openshift-ansible
  using latest from release-3.7 branch
rpm -q ansible
ansible --version

How reproducible:  This happens every time for me.

Steps to Reproduce:
1. RHEL 7.6 image on AWS
2. Using all-in-one inventory
3. Latest installer on release-3.7 branch

Actual results:

STDOUT: Waiting for rollout to finish: 0 out of 1 new replicas have been updated...
STDERR: error: replication controller "router-1" has failed progressing
MSG: non-zero return code

Comment 1 Paul Gier 2018-11-28 15:00:30 UTC
The router-1 deployment log contains this:

error: couldn't get deployment router-1: Get dial tcp getsockopt: no route to host

Comment 2 Paul Gier 2018-11-28 15:09:36 UTC
Created attachment 1509541 [details]

Comment 3 Vadim Rutkovsky 2018-11-28 15:10:13 UTC
Seems API is up (we verify that during master install), but routers can't reach it via SDN

Comment 5 Kurt Stam 2019-04-20 21:46:38 UTC
There is no workaround? I'm using the aarch64 bistro and am running into the same issue. Any pointers would be super helpful. Thx! --Kurt (at Red Hat)

Comment 6 Kurt Stam 2019-04-21 01:32:37 UTC
BTW same exact issue on a Centos-7.6 x86_64 VM with OpenShift 3.7.1 running on my local box.

Comment 7 Kurt Stam 2019-04-22 01:37:18 UTC
The workaround can be found here: https://github.com/Project31/rh-middleware-on-arm/issues/10, downgrade iptables to the version that ships with 7.5.

Note You need to log in before you can comment on or make changes to this bug.