Bug 1504593 - Installer doesn't report the installer status correctly if openshift health checks failed
Summary: Installer doesn't report the installer status correctly if openshift health c...
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Installer
Version: 3.7.0
Hardware: Unspecified
OS: Unspecified
Target Milestone: ---
: 3.8.0
Assignee: Russell Teague
QA Contact: Gan Huang
Depends On:
TreeView+ depends on / blocked
Reported: 2017-10-20 09:50 UTC by Gan Huang
Modified: 2017-11-28 22:18 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
The OpenShift Health Checker was not part of an Installer Phase and was not reported after playbook execution. The OpenShift Health Checker section of the primary installer path has been moved to its own section and an installer 'phase' has been added to report on installer status.
Clone Of:
Last Closed: 2017-11-28 22:18:47 UTC
Target Upstream Version:

Attachments (Terms of Use)

System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2017:3188 0 normal SHIPPED_LIVE Moderate: Red Hat OpenShift Container Platform 3.7 security, bug, and enhancement update 2017-11-29 02:34:54 UTC

Description Gan Huang 2017-10-20 09:50:27 UTC
Description of problem:
If a openshift health check failure occurs during execution, installer didn't report the failed playbook in `INSTALLER STATUS`, instead it indicated that the whole playbook run successfully

Version-Release number of the following components:

How reproducible:

Steps to Reproduce:
1. Trigger installation with openshift health check enabled

Actual results:
Installer failed at openshift health check, but `INSTALLER STATUS` didn't report the failure.

PLAY RECAP *********************************************************************
localhost                  : ok=27   changed=0    unreachable=0    failed=0   
qe-jliu-37rpm-lb-1.1020-wcd.qe.rhcloud.com : ok=42   changed=13   unreachable=0    failed=0   
qe-jliu-37rpm-master-etcd-1.1020-wcd.qe.rhcloud.com : ok=44   changed=6    unreachable=0    failed=1   
qe-jliu-37rpm-master-etcd-2.1020-wcd.qe.rhcloud.com : ok=43   changed=6    unreachable=0    failed=1   
qe-jliu-37rpm-master-etcd-3.1020-wcd.qe.rhcloud.com : ok=43   changed=6    unreachable=0    failed=1   
qe-jliu-37rpm-node-primary-1.1020-wcd.qe.rhcloud.com : ok=42   changed=6    unreachable=0    failed=1   
qe-jliu-37rpm-node-primary-2.1020-wcd.qe.rhcloud.com : ok=42   changed=6    unreachable=0    failed=1   
qe-jliu-37rpm-node-registry-router-1.1020-wcd.qe.rhcloud.com : ok=42   changed=6    unreachable=0    failed=1   
qe-jliu-37rpm-node-registry-router-2.1020-wcd.qe.rhcloud.com : ok=42   changed=6    unreachable=0    failed=1   

INSTALLER STATUS ***************************************************************
Initialization             : Complete
etcd Install               : Complete
NFS Install                : Not Started
Load balancer Install      : Complete
Master Install             : Complete
Master Additional Install  : Complete
Node Install               : Complete
GlusterFS Install          : Not Started
Hosted Install             : Complete
Metrics Install            : Not Started
Logging Install            : Not Started
Service Catalog Install    : Not Started

Failure summary:

  1. Hosts:    qe-jliu-37rpm-master-etcd-1.1020-wcd.qe.rhcloud.com, qe-jliu-37rpm-master-etcd-2.1020-wcd.qe.rhcloud.com, qe-jliu-37rpm-master-etcd-3.1020-wcd.qe.rhcloud.com, qe-jliu-37rpm-node-primary-1.1020-wcd.qe.rhcloud.com, qe-jliu-37rpm-node-primary-2.1020-wcd.qe.rhcloud.com, qe-jliu-37rpm-node-registry-router-1.1020-wcd.qe.rhcloud.com, qe-jliu-37rpm-node-registry-router-2.1020-wcd.qe.rhcloud.com
     Play:     Verify Requirements
     Task:     openshift_health_check
     Message:  One or more checks failed
     Details:  check "docker_image_availability":
               One or more required Docker images are not available:
               Configured registries: registry.reg-aws.openshift.com:443, registry.access.redhat.com
               Checked by: timeout 10 skopeo inspect --tls-verify=false docker://{registry}/{image}

Expected results:

Additional info:
Please attach logs from ansible-playbook with the -vvv flag

Comment 1 Russell Teague 2017-10-26 12:26:55 UTC
I already have a PR open to address this.
Proposed: https://github.com/openshift/openshift-ansible/pull/5742

Comment 2 Russell Teague 2017-10-27 16:21:08 UTC
Merged: https://github.com/openshift/openshift-ansible/pull/5742

Comment 3 Russell Teague 2017-10-31 18:53:18 UTC
$ git tag --contains c66536bc27db98232ba1e231cfdee48a72936d5b

Comment 5 Gan Huang 2017-11-01 06:00:21 UTC
Tested in openshift-ansible-3.7.0-0.188.0.git.0.aebb674.el7.noarch.rpm

Installer aborted and reported the solution.

INSTALLER STATUS ***************************************************************
Initialization             : Complete
Health Check               : In Progress
	This phase can be restarted by running: playbooks/byo/openshift-checks/pre-install.yml

Comment 8 errata-xmlrpc 2017-11-28 22:18:47 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.


Note You need to log in before you can comment on or make changes to this bug.