Bug 1377630

Summary: Bringing up the openshift cluster fails at node registration.
Product: OpenShift Container Platform Reporter: Bhaskarakiran <byarlaga>
Component: InstallerAssignee: Scott Dodson <sdodson>
Status: CLOSED INSUFFICIENT_DATA QA Contact: Johnny Liu <jialiu>
Severity: medium Docs Contact:
Priority: medium    
Version: 3.3.0CC: aos-bugs, byarlaga, jokerman, mmccomas, mzywusko, wsun
Target Milestone: ---Flags: sdodson: needinfo? (byarlaga)
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-11-01 15:14:55 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
ansible log none

Description Bhaskarakiran 2016-09-20 08:55:39 UTC
Created attachment 1202780 [details]
ansible log

Description of problem:
======================

Trying to bring up the openshift cluster using atomic-openshift-utils install command giving the master and other node details. It fails at the node registration. Below is the snippet.

2016-09-20 13:14:43,527 p=31590 u=root |  TASK [openshift_manage_node : Wait for Node Registration] **********************
2016-09-20 13:14:43,993 p=31590 u=root |  ok: [10.70.43.60] => (item=dhcp43-60.lab.eng.blr.redhat.com)
2016-09-20 13:14:44,239 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (50 retries left).
2016-09-20 13:14:49,864 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (49 retries left).
2016-09-20 13:14:55,492 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (48 retries left).
2016-09-20 13:15:01,044 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (47 retries left).
2016-09-20 13:15:07,215 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (46 retries left).
2016-09-20 13:15:12,860 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (45 retries left).
2016-09-20 13:15:18,489 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (44 retries left).
2016-09-20 13:15:24,043 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (43 retries left).
2016-09-20 13:15:29,654 p=31590 u=root |  FAILED - RETRYING: TASK: openshift_manage_node : Wait for Node Registration (42 retries left).



2016-09-20 13:28:50,987 p=31590 u=root |  failed: [10.70.43.60] (item=dhcp42-85.lab.eng.blr.redhat.com) => {"changed": false, "cmd": ["oc", "get", "node", "dhcp42-85.lab.eng.blr.redhat.com"], "delta": "0:00:00.319734", "end": "2016-09-20 13:28:50.944550", "failed": true, "item": "dhcp42-85.lab.eng.blr.redhat.com", "rc": 1, "start": "2016-09-20 13:28:50.624816", "stderr": "Error from server: nodes \"dhcp42-85.lab.eng.blr.redhat.com\" not found", "stdout": "", "stdout_lines": [], "warnings": []}
2016-09-20 13:28:51,004 p=31590 u=root |        to retry, use: --limit @/usr/share/ansible/openshift-ansible/playbooks/byo/openshift-cluster/config.retry

2016-09-20 13:28:51,004 p=31590 u=root |  PLAY RECAP *********************************************************************
2016-09-20 13:28:51,005 p=31590 u=root |  10.70.41.215               : ok=99   changed=14   unreachable=0    failed=1
2016-09-20 13:28:51,005 p=31590 u=root |  10.70.41.243               : ok=99   changed=13   unreachable=0    failed=1
2016-09-20 13:28:51,005 p=31590 u=root |  10.70.42.85                : ok=99   changed=14   unreachable=0    failed=1
2016-09-20 13:28:51,007 p=31590 u=root |  10.70.43.60                : ok=390  changed=93   unreachable=0    failed=1
2016-09-20 13:28:51,007 p=31590 u=root |  localhost                  : ok=14   changed=8    unreachable=0    failed=0



Version-Release number of selected component (if applicable):
=============================================================
3.3.0.26-1

How reproducible:
=================
100%

Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:
Attaching the ansible log.

Comment 1 Scott Dodson 2016-10-27 00:39:15 UTC
Bhaskarakiran, if this is still happening can you check the logs from `journalctl -lu atomic-openshift-node` to see why the nodes aren't starting and registering properly?

If this is no longer an issue lets close this.