Bug 1852801 - [osp16.1][update] Ansible update is not happening during undercloud update.
Summary: [osp16.1][update] Ansible update is not happening during undercloud update.
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-tripleo-common
Version: 16.1 (Train)
Hardware: Unspecified
OS: Unspecified
urgent
urgent
Target Milestone: ga
: 16.1 (Train on RHEL 8.2)
Assignee: Adriano Petrich
QA Contact: Sofer Athlan-Guyot
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-07-01 10:47 UTC by Sofer Athlan-Guyot
Modified: 2020-07-29 07:54 UTC (History)
11 users (show)

Fixed In Version: openstack-tripleo-common-11.3.3-0.20200611110656.f7715be
Doc Type: Known Issue
Doc Text:
When you update or upgrade `python3-tripleoclient`, Ansible does not receive the update or upgrade and Ansible or `ceph-ansible` tasks fail. + When you update or upgrade, ensure that Ansible also receives the update so that playbook tasks can run successfully.
Clone Of:
Environment:
Last Closed: 2020-07-29 07:53:30 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 738989 0 None ABANDONED Remove tht/tc/ta/tv update before UC upgrade. 2021-01-21 13:30:50 UTC
OpenStack gerrit 738990 0 None MERGED Refactor ceph-ansible upgrade 2021-01-21 13:30:50 UTC
RDO 28329 0 None None None 2020-07-02 08:57:37 UTC
RDO 28338 0 None None None 2020-07-08 14:59:05 UTC
Red Hat Product Errata RHBA-2020:3148 0 None None None 2020-07-29 07:54:24 UTC

Description Sofer Athlan-Guyot 2020-07-01 10:47:25 UTC
Description of problem: We recently switch to a newer version of ansible.  It was uncovered then that ansible itself wasn't update during undercloud update.

We actually explicitly exclude it:

https://github.com/openstack/tripleo-heat-templates/blob/master/deployment/undercloud/undercloud-upgrade.yaml#L212-L217

The reason this was introduced:

https://github.com/ansible/ansible/issues/56636

Doesn't happen anymore for 16.1 (ansible-2.8.12-1.el8ae.noarch) as shown there https://bugs.launchpad.net/tripleo/+bug/1885637/comments/9

Nevertheless the right way to update ansible is in python-tripleoclient so that it's taken into account for the undercloud update as well.

As a matter of fact the version on the overcloud doesn't matter so we don't need to check it here.

The easy workaround is to *manually* update ansible before undercloud update.

Comment 3 Sofer Athlan-Guyot 2020-07-01 14:25:17 UTC
This is an effective workaround:

add this the heat parameters:


parameter_defaults:
    CephAnsibleEnvironmentVariables:
      ANSIBLE_SSH_CONTROL_PATH_DIR: "/tmp/ceph_ansible_control_path"

Then re-run:

openstack overcloud prepare

with the above parameter file in.

Then re-run:

openstack overcloud external-update run \
    --stack qe-Cloud-0 \
    --tags ceph 2>&1


Then:

Wednesday 01 July 2020  14:18:30 +0000 (0:00:00.192)       0:18:48.582 ******** 
skipping: [undercloud] => {"changed": false, "skip_reason": "Conditional result was False"}

TASK [generate ceph-ansible group vars osds] ***********************************
Wednesday 01 July 2020  14:18:30 +0000 (0:00:00.191)       0:18:48.774 ******** 
skipping: [undercloud] => {"changed": false, "skip_reason": "Conditional result was False"}

PLAY RECAP *********************************************************************
ceph-0                     : ok=4    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0   
ceph-1                     : ok=3    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0   
ceph-2                     : ok=3    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0   
compute-0                  : ok=3    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0   
compute-1                  : ok=3    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0   
controller-0               : ok=3    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0   
controller-1               : ok=3    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0   
controller-2               : ok=3    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0   
undercloud                 : ok=61   changed=17   unreachable=0    failed=0    skipped=163  rescued=0    ignored=0   

Wednesday 01 July 2020  14:18:30 +0000 (0:00:00.059)       0:18:48.833 ******** 
=============================================================================== 

Updated nodes - None
Success

Comment 4 Sofer Athlan-Guyot 2020-07-01 14:49:40 UTC
Sorry for the confusion, previous comment and commit were for another bugzilla.  Holding on for the status of https://review.opendev.org/738774

Comment 7 Alex Schultz 2020-07-02 13:47:20 UTC
https://review.rdoproject.org/r/#/c/28338/ is needed for this as well. It needs to be downstreamed as I don't believe it's pulled in automatically.

Comment 13 Ronnie Rasouli 2020-07-21 14:11:37 UTC
2020-07-19 08:28:04 | TASK [Check if ceph-ansible is installed] **************************************
2020-07-19 08:28:04 | Sunday 19 July 2020  08:27:41 +0000 (0:00:00.100)       0:00:05.971 *********** 
2020-07-19 08:28:04 | ok: [undercloud] => {"changed": false, "cmd": "rpm -q ceph-ansible || true", "delta": "0:00:00.036752", "end": "2020-07-19 08:27:42.676088", "rc": 0, "start": "2020-07-19 08:27:42.639336", "stderr": "", "stderr_lines": [], "stdout": "ceph-ansible-4.0.25-1.el8cp.noarch", "stdout_lines": ["ceph-ansible-4.0.25-1.el8cp.noarch"]}
2020-07-19 08:28:04 | 
2020-07-19 08:28:04 | TASK [Warn about missing ceph-ansible] *****************************************
2020-07-19 08:28:04 | Sunday 19 July 2020  08:27:42 +0000 (0:00:00.841)       0:00:06.812 *********** 
2020-07-19 08:28:04 | skipping: [undercloud] => {"changed": false, "skip_reason": "Conditional result was False"}

Job is passing and ceph-anisble is the latest

Comment 15 errata-xmlrpc 2020-07-29 07:53:30 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:3148


Note You need to log in before you can comment on or make changes to this bug.