Bug 1329521

Summary: Failed to install env when master and etcd are on the individual servers
Product: OpenShift Container Platform Reporter: Gan Huang <ghuang>
Component: InstallerAssignee: Andrew Butcher <abutcher>
Status: CLOSED ERRATA QA Contact: Ma xiaoqiang <xiama>
Severity: high Docs Contact:
Priority: high    
Version: 3.2.0CC: aos-bugs, bleanhar, gpei, jokerman, mmccomas, xtian
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-05-12 16:40:51 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Gan Huang 2016-04-22 07:16:51 UTC
Description of problem:
Failed to install when master isn't embedded with etcd

Version-Release number of selected component (if applicable):
openshift-ansible-3.0.83-1.git.0.2043786.el7.noarch

How reproducible:
Always

Steps to Reproduce:
1.Install a env which specifys another host (not master) as etcd
E.g:
[OSEv3:children]
masters
nodes
nfs
etcd

[OSEv3:vars]
ansible_ssh_user=root
deployment_type=openshift-enterprise

[masters]
x.x.172.101  openshift_ip=172.16.120.182 openshift_public_ip=x.x.172.101 openshift_hostname=172.16.120.182 openshift_public_hostname=x.x.172.101
[etcd]
x.x.172.93  openshift_ip=172.16.120.181 openshift_public_ip=x.x.172.93 openshift_hostname=172.16.120.181 openshift_public_hostname=x.x.172.93
[nodes]
x.x.172.101  openshift_ip=172.16.120.182 openshift_public_ip=x.x.172.101 openshift_hostname=172.16.120.182 openshift_public_hostname=x.x.172.101 openshift_schedulable=False
x.x.172.93  openshift_ip=172.16.120.181 openshift_public_ip=x.x.172.93 openshift_hostname=172.16.120.181 openshift_public_hostname=x.x.172.93

[nfs]
x.x.172.101  openshift_ip=172.16.120.182 openshift_public_ip=x.x.172.101 openshift_hostname=172.16.120.182 openshift_public_hostname=x.x.172.101



Actual results:

TASK: [etcd_certificates | Create the client csr] ***************************** 
fatal: [x.x.172.93] => One or more undefined variables: 'dict object' has no attribute 'etcd_hostname'

FATAL: all hosts have already failed -- aborting

PLAY RECAP ******************************************************************** 
           to retry, use: --limit @/root/config.retry

x.x.172.101               : ok=65   changed=14   unreachable=0    failed=0   
x.x.172.93                : ok=154  changed=31   unreachable=1    failed=0   
localhost                  : ok=21   changed=0    unreachable=0    failed=0  

Expected results:
Install successfully

Additional info:
It blocks to add new masters in pre-existing native ha env as the new master is not a etcd role.

Comment 1 Brenton Leanhardt 2016-04-22 11:42:04 UTC
I suspect this is a quick fix so if you have time try to get it in today.

Comment 2 Andrew Butcher 2016-04-22 14:18:51 UTC
Proposed fix: https://github.com/openshift/openshift-ansible/pull/1785

Comment 3 Gan Huang 2016-04-25 07:28:16 UTC
Verified with openshift-ansible-3.0.84-1.git.0.04b5245.el7.noarch

Launch a individual instance to be configured with etcd and the installation successfully.

Move it to verified.

Comment 5 errata-xmlrpc 2016-05-12 16:40:51 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2016:1065