Description of problem: We recently switch to a newer version of ansible. It was uncovered then that ansible itself wasn't update during undercloud update. We actually explicitly exclude it: https://github.com/openstack/tripleo-heat-templates/blob/master/deployment/undercloud/undercloud-upgrade.yaml#L212-L217 The reason this was introduced: https://github.com/ansible/ansible/issues/56636 Doesn't happen anymore for 16.1 (ansible-2.8.12-1.el8ae.noarch) as shown there https://bugs.launchpad.net/tripleo/+bug/1885637/comments/9 Nevertheless the right way to update ansible is in python-tripleoclient so that it's taken into account for the undercloud update as well. As a matter of fact the version on the overcloud doesn't matter so we don't need to check it here. The easy workaround is to *manually* update ansible before undercloud update.
This is an effective workaround: add this the heat parameters: parameter_defaults: CephAnsibleEnvironmentVariables: ANSIBLE_SSH_CONTROL_PATH_DIR: "/tmp/ceph_ansible_control_path" Then re-run: openstack overcloud prepare with the above parameter file in. Then re-run: openstack overcloud external-update run \ --stack qe-Cloud-0 \ --tags ceph 2>&1 Then: Wednesday 01 July 2020 14:18:30 +0000 (0:00:00.192) 0:18:48.582 ******** skipping: [undercloud] => {"changed": false, "skip_reason": "Conditional result was False"} TASK [generate ceph-ansible group vars osds] *********************************** Wednesday 01 July 2020 14:18:30 +0000 (0:00:00.191) 0:18:48.774 ******** skipping: [undercloud] => {"changed": false, "skip_reason": "Conditional result was False"} PLAY RECAP ********************************************************************* ceph-0 : ok=4 changed=1 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0 ceph-1 : ok=3 changed=1 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0 ceph-2 : ok=3 changed=1 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0 compute-0 : ok=3 changed=1 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0 compute-1 : ok=3 changed=1 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0 controller-0 : ok=3 changed=1 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0 controller-1 : ok=3 changed=1 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0 controller-2 : ok=3 changed=1 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0 undercloud : ok=61 changed=17 unreachable=0 failed=0 skipped=163 rescued=0 ignored=0 Wednesday 01 July 2020 14:18:30 +0000 (0:00:00.059) 0:18:48.833 ******** =============================================================================== Updated nodes - None Success
Sorry for the confusion, previous comment and commit were for another bugzilla. Holding on for the status of https://review.opendev.org/738774
https://review.rdoproject.org/r/#/c/28338/ is needed for this as well. It needs to be downstreamed as I don't believe it's pulled in automatically.
2020-07-19 08:28:04 | TASK [Check if ceph-ansible is installed] ************************************** 2020-07-19 08:28:04 | Sunday 19 July 2020 08:27:41 +0000 (0:00:00.100) 0:00:05.971 *********** 2020-07-19 08:28:04 | ok: [undercloud] => {"changed": false, "cmd": "rpm -q ceph-ansible || true", "delta": "0:00:00.036752", "end": "2020-07-19 08:27:42.676088", "rc": 0, "start": "2020-07-19 08:27:42.639336", "stderr": "", "stderr_lines": [], "stdout": "ceph-ansible-4.0.25-1.el8cp.noarch", "stdout_lines": ["ceph-ansible-4.0.25-1.el8cp.noarch"]} 2020-07-19 08:28:04 | 2020-07-19 08:28:04 | TASK [Warn about missing ceph-ansible] ***************************************** 2020-07-19 08:28:04 | Sunday 19 July 2020 08:27:42 +0000 (0:00:00.841) 0:00:06.812 *********** 2020-07-19 08:28:04 | skipping: [undercloud] => {"changed": false, "skip_reason": "Conditional result was False"} Job is passing and ceph-anisble is the latest
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:3148