Bug 1426936

Summary: Unable to get router replicas during installation when setting openshift_hosted_router_replicas
Product: OpenShift Container Platform Reporter: Gaoyun Pei <gpei>
Component: InstallerAssignee: Russell Teague <rteague>
Status: CLOSED CURRENTRELEASE QA Contact: Gaoyun Pei <gpei>
Severity: high Docs Contact:
Priority: high    
Version: 3.5.0CC: aos-bugs, jialiu, jokerman, mmccomas, wmeng
Target Milestone: ---Keywords: Regression
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-03-08 13:43:35 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Gaoyun Pei 2017-02-26 14:18:40 UTC
Description of problem:
When setting up an ocp-3.5 cluster, installer failed at step "openshift_hosted : set_fact replicas":

...
TASK [openshift_hosted : Retrieve list of openshift nodes matching router selector] ***
Sunday 26 February 2017  08:44:04 +0000 (0:00:00.154)       0:22:12.466 ******* 
skipping: [host-x.host.redhat.com] => {
    "changed": false, 
    "skip_reason": "Conditional check failed", 
    "skipped": true
}

TASK [openshift_hosted : set_fact replicas] ************************************
Sunday 26 February 2017  08:44:04 +0000 (0:00:00.187)       0:22:12.654 ******* 
fatal: [host-x.host.redhat.com]: FAILED! => {
    "failed": true
}

MSG:

the field 'args' has an invalid value, which appears to include a variable that is undefined. The error was: 'dict object' has no attribute 'results'

The error appears to have been in 'openshift-ansible/roles/openshift_hosted/tasks/router/router.yml': line 11, column 3, but may
be elsewhere in the file depending on the exact syntax problem.

The offending line appears to be:


- name: set_fact replicas
  ^ here



Version-Release number of selected component (if applicable):
openshift-ansible-3.5.15-1.git.0.8d2a456.el7.noarch.rpm

How reproducible:
Always

Steps to Reproduce:
1.Prepare ansible inventory file and start install playbook


Actual results:
As Description 

Expected results:


Additional info:
Should be a regression issue of https://github.com/openshift/openshift-ansible/pull/3423

Comment 2 Gaoyun Pei 2017-02-26 14:31:12 UTC
Met with a different error in another attempt with a same ansible inventory, also installing a containerized installation on AtomicHost with openshift-ansible-3.5.15-1.git.0.8d2a456.el7.noarch.rpm


TASK [openshift_hosted : include] **********************************************
Sunday 26 February 2017  14:11:40 +0000 (0:00:00.205)       0:22:31.469 ******* 
skipping: [host-x.redhat.com] => {
    "changed": false, 
    "skip_reason": "Conditional check failed", 
    "skipped": true
}

TASK [openshift_hosted : Set facts for persistent volume] **********************
Sunday 26 February 2017  14:11:40 +0000 (0:00:00.160)       0:22:31.630 ******* 
fatal: [host-x.redhat.com]: FAILED! => {
    "failed": true
}

MSG:

the field 'args' has an invalid value, which appears to include a variable that is undefined. The error was: 'pvc_volume_mounts' is undefined

The error appears to have been in 'openshift-ansible/roles/openshift_hosted/tasks/registry/registry.yml': line 67, column 3, but may
be elsewhere in the file depending on the exact syntax problem.

The offending line appears to be:


- name: Set facts for persistent volume
  ^ here

Comment 3 Gaoyun Pei 2017-02-27 04:49:10 UTC
After some more attempts, found this issue should be caused by setting option openshift_hosted_router_replicas in addition, no matter with rpm or containerized install.

For the error in Comment 2 , it's another issue about nfs/openstack registry storage kind, filed BZ#1427009 to track it separately.

Comment 4 Russell Teague 2017-02-27 20:48:19 UTC
The original issue should be addressed by
https://github.com/openshift/openshift-ansible/pull/3503

Comment 6 Gaoyun Pei 2017-02-28 05:26:58 UTC
Verify this bug with openshift-ansible-3.5.17-1.git.0.561702e.el7.noarch.rpm

With openshift_hosted_router_replicas=1 set in ansible inventory, router could be created successfully.