Bug 1403864

Summary: OCP deployments failing Actions::Fusor::Deployment::OpenShift::Deploy with message undefined method 'join' for nil:NilClass
Product: Red Hat Quickstart Cloud Installer Reporter: Landon LaSmith <llasmith>
Component: Installation - OpenShiftAssignee: Jason Montleon <jmontleo>
Status: CLOSED ERRATA QA Contact: Antonin Pagac <apagac>
Severity: unspecified Docs Contact: Derek <dcadzow>
Priority: unspecified    
Version: 1.1CC: apagac, arubin, bthurber, jmatthew, jmontleo, joden, qci-bugzillas
Target Milestone: ---Keywords: Triaged
Target Release: 1.1   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-02-28 01:42:01 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Bug Depends On: 1370282    
Bug Blocks:    

Description Landon LaSmith 2016-12-12 13:59:08 UTC
Description of problem: QCI deployments have been failing with the during the Actions::Fusor::Deployment::OpenShift::Deploy task with the message "undefined method 'join' for nil:NilClass".

QCI Media Version: QCI-1.1-RHEL-7-20161209.t.0

How reproducible: We've seen multiple instances of this error across RHV and RHVSH deployments with CFME


Steps to Reproduce:
1. Install QCI from ISO
2. Create RHV + CFME + OCP deployment
3. Start the deployment

Actual results: Deployment fails


Expected results: Deployment succeeds


Additional info: 
This has been occuring in our automated deployments.  Right now this has been isolated to API deployments since the OCP webui has changed.  Issue might be a change in the deployment json.

Currently there are no logs from the automated runs but will try to recreate manually and follow up with more info

Comment 3 Landon LaSmith 2016-12-12 15:59:58 UTC
Deployment is failing during subtask Actions::Fusor::Deployment::OpenShift::SetupOSE. First few lines of the stacktrace

 | /opt/theforeman/tfm/root/usr/share/gems/gems/fusor_server-1.1.16/lib/modules/ose_installer/launch.rb:99:in `write_inventory'
 | /opt/theforeman/tfm/root/usr/share/gems/gems/fusor_server-1.1.16/lib/modules/ose_installer/launch.rb:399:in `prepare'                                                                                                      
 | /opt/theforeman/tfm/root/usr/share/gems/gems/fusor_server-1.1.16/app/lib/actions/fusor/deployment/open_shift/setup_ose.rb:34:in `run'
 | /opt/theforeman/tfm/root/usr/share/gems/gems/dynflow-0.8.13.1/lib/dynflow/action.rb:506:in `block (3 levels) in execute_run'
 | /opt/theforeman/tfm/root/usr/share/gems/gems/dynflow-0.8.13.1/lib/dynflow/middleware/stack.rb:26:in `call'
 | /opt/theforeman/tfm/root/usr/share/gems/gems/dynflow-0.8.13.1/lib/dynflow/middleware/stack.rb:26:in `pass'
 | /opt/theforeman/tfm/root/usr/share/gems/gems/dynflow-0.8.13.1/lib/dynflow/middleware.rb:17:in `pass'
...

Comment 5 James Olin Oden 2016-12-12 19:23:27 UTC
I just saw this in a manual deployment.   I was installing:

   RHV(self hosted, 4 hypervisors) + OCP(4 nodes) + CFME

And it dies while deploying OCP, and when I finally got to the failed sub task it showed:

   undefined method `join' for nil:NilClass

The subtask was named "Deploy OpenShift Enterprise {"locale"=>"en"}"

Comment 6 Jason Montleon 2016-12-12 19:57:31 UTC
ansible-ocp is broken. This is due to the HA work and should be fixed fairly soon. This actual issue along with two or three others are resolved, but installs will fail due to a final issue. Once dgao submits a PR we'll get a working package over.

Comment 8 Antonin Pagac 2016-12-15 14:20:13 UTC
Didn't see the issue in QCI-1.1-RHEL-7-20161214.t.0, marking as verified.

Comment 11 errata-xmlrpc 2017-02-28 01:42:01 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2017:0335