Bug 1504515

Summary: Upgrade failed due to installer try to stop atomic-openshift-master-controllers on etcd host
Product: OpenShift Container Platform Reporter: liujia <jiajliu>
Component: Cluster Version OperatorAssignee: Russell Teague <rteague>
Status: CLOSED ERRATA QA Contact: liujia <jiajliu>
Severity: high Docs Contact:
Priority: high    
Version: 3.7.0CC: aos-bugs, jokerman, mmccomas
Target Milestone: ---   
Target Release: 3.7.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Corrected host group to specify masters group for controller restart.
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-11-28 22:18:08 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description liujia 2017-10-20 05:11:17 UTC
Description of problem:
Upgrade ocp with dedicated etcd from v3.6 to v3.7, upgrade failed at task [Stop atomic-openshift-master-controllers] due to wrong hosts group.

TASK [Stop atomic-openshift-master-controllers] *****************************************************************************************************************************
fatal: [x.x.x.x]: FAILED! => {"changed": false, "failed": true, "msg": "Could not find the requested service atomic-openshift-master-controllers: host"}
	to retry, use: --limit @/usr/share/ansible/openshift-ansible/playbooks/byo/openshift-cluster/upgrades/v3_7/upgrade.retry

[OSEv3:children]
masters
nodes
etcd
...
[masters]
A
..
[etcd]
B

- name: Cycle all controller services to force new leader election mode
  hosts: oo_etcd_to_config
  gather_facts: no
  tasks:
  - name: Stop {{ openshift.common.service_type }}-master-controllers
    systemd:
      name: "{{ openshift.common.service_type }}-master-controllers"
      state: stopped
  - name: Start {{ openshift.common.service_type }}-master-controllers
    systemd:
      name: "{{ openshift.common.service_type }}-master-controllers"
      state: started


Version-Release number of the following components:
openshift-ansible-3.7.0-0.167.0.git.0.0e34535.el7.noarch
ansible-2.4.1.0-0.1.beta2.el7.noarch

How reproducible:
always

Steps to Reproduce:
1. Install ocp 3.6 with dedicated etcd
2. Upgrade 3.6 to 3.7
3.

Actual results:
Upgrade failed.

Expected results:
Upgrade succeed.

Additional info:
Please attach logs from ansible-playbook with the -vvv flag

Comment 1 Russell Teague 2017-10-23 14:18:49 UTC
Merged: https://github.com/openshift/openshift-ansible/pull/5844

Comment 2 Russell Teague 2017-10-24 13:16:00 UTC
$ git tag --contains 27f062260c5717091786f638fcc31fcc2d4779af
openshift-ansible-3.7.0-0.177.0

Comment 4 liujia 2017-10-26 07:26:57 UTC
Version:
openshift-ansible-3.7.0-0.179.0.git.0.a2641b6.el7.noarch

The issue has been fixed.

Comment 7 errata-xmlrpc 2017-11-28 22:18:08 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2017:3188