Description of problem: Overcloud install fails (1 ceph node): TASK [Enable tuned profile] **************************************************** Tuesday 06 October 2020 16:37:08 +0000 (0:00:00.103) 0:05:01.418 ******* skipping: [ceph-0] => {"changed": false, "skip_reason": "Conditional result was False"} PLAY [External deployment step 1] ********************************************** TASK [External deployment step 1] ********************************************** Tuesday 06 October 2020 16:37:08 +0000 (0:00:00.108) 0:05:01.526 ******* ok: [undercloud] => { "msg": "Use --start-at-task \"External deployment step 1\" to resume from this task" } TASK [include_tasks] *********************************************************** Tuesday 06 October 2020 16:37:09 +0000 (0:00:00.093) 0:05:01.620 ******* fatal: [undercloud]: FAILED! => {"reason": "couldn't resolve module/action 'warn'. This often indicates a misspelling, missing collection, or incorrect module path.\n\nThe error appears to be in '/usr/share/ansible/roles/ceph/tasks/ceph-dependencies-installed.yaml': line 6, column 3, but may\nbe elsewhere in the file depending on the exact syntax problem.\n\nThe offending line appears to be:\n\n\n- name: Warn about missing dependencies\n ^ here\n"} Overcloud configuration failed. PLAY RECAP ********************************************************************* ceph-0 : ok=111 changed=56 unreachable=0 failed=0 skipped=263 rescued=0 ignored=0 compute-0 : ok=139 changed=80 unreachable=0 failed=0 skipped=232 rescued=0 ignored=0 controller-0 : ok=154 changed=93 unreachable=0 failed=0 skipped=224 rescued=0 ignored=0 undercloud : ok=11 changed=5 unreachable=0 failed=1 skipped=0 rescued=0 ignored=0 Version-Release number of selected component (if applicable): Several related package version changes: -openstack-tripleo-validations.noarch 11.3.2-1.20200914170823.4db92ba.el8ost @rhelosp-16.1 +openstack-tripleo-validations.noarch 11.3.2-1.20200914170825.el8ost @rhelosp-16.1 -python3-tripleoclient.noarch 12.3.2-1.20200914164926.72161e0.el8ost @rhelosp-16.1 -python3-tripleoclient-heat-installer.noarch 12.3.2-1.20200914164926.72161e0.el8ost @rhelosp-16.1 +python3-tripleoclient.noarch 12.3.2-1.20200914164927.72161e0.el8ost @rhelosp-16.1 +python3-tripleoclient-heat-installer.noarch 12.3.2-1.20200914164927.72161e0.el8ost @rhelosp-16.1 tag: RHOS-16.1-RHEL-8-20201006.n.1 Additional info: similar error message with different path: https://bugs.launchpad.net/tripleo/+bug/1895507
The "warn" is supposed to be shipped in validations-common. Checking if we're not missing a commit or something...
weird. the "warn.py" is provided by validations-common-1.1.2-1.20200914180304.7f3c565.el8ost.noarch.rpm - which happens to be the installed version, apparently: validations-common.noarch 1.1.2-1.20200914180304.7f3c565.el8ost @rhelosp-16.1 The rpm DOES have that "warn.py" and pushes it in /usr/share/ansible/library as expected. There's something fishy. A live env was requested and should be available in a couple of hours in order to investigate a bit further.
After some more digging with the Team, we're missing 3 patches in tripleoclient, compared to stable/train: - https://code.engineering.redhat.com/gerrit/214372 - https://code.engineering.redhat.com/gerrit/214373 (the one from the LP, indeed needed) - https://code.engineering.redhat.com/gerrit/214374 Once we get them in, we should, at last, be good.
The issue is due because Ceph is called via the external deploy tasks in THT, which is triggered and run under Mistral in Train. Ceph playbook is using 'warn' module, which is a validation module. So we need to adjust the path in mistral Ansible action in order to get the validation Ansible modules. I have pushed a fix is here: https://review.opendev.org/756475 It should be correct then. I'm waiting for the CI to push downstream the patch.
This was a failure in Phase 1 which is now passing.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat OpenStack Platform 16.1 bug fix and enhancement advisory), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2020:4284