Bug 1885828

Summary: couldn't resolve module/action 'warn'
Product: Red Hat OpenStack Reporter: Attila Fazekas <afazekas>
Component: openstack-tripleo-commonAssignee: Gaël Chamoulaud <gchamoul>
Status: CLOSED ERRATA QA Contact: David Rosenfeld <drosenfe>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 16.1 (Train)CC: bdobreli, cjeanner, gchamoul, hbrock, jslagle, mbultel, mburns, slinaber
Target Milestone: z2Keywords: Triaged
Target Release: 16.1 (Train on RHEL 8.2)Flags: gchamoul: needinfo-
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-tripleo-common-11.4.1-1.20200914165651.el8ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-10-28 15:39:38 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Attila Fazekas 2020-10-07 05:26:59 UTC
Description of problem:
Overcloud install fails (1 ceph node):
TASK [Enable tuned profile] ****************************************************
Tuesday 06 October 2020  16:37:08 +0000 (0:00:00.103)       0:05:01.418 ******* 
skipping: [ceph-0] => {"changed": false, "skip_reason": "Conditional result was False"}

PLAY [External deployment step 1] **********************************************

TASK [External deployment step 1] **********************************************
Tuesday 06 October 2020  16:37:08 +0000 (0:00:00.108)       0:05:01.526 ******* 
ok: [undercloud] => {
    "msg": "Use --start-at-task \"External deployment step 1\" to resume from this task"
}

TASK [include_tasks] ***********************************************************
Tuesday 06 October 2020  16:37:09 +0000 (0:00:00.093)       0:05:01.620 ******* 
fatal: [undercloud]: FAILED! => {"reason": "couldn't resolve module/action 'warn'. This often indicates a misspelling, missing collection, or incorrect module path.\n\nThe error appears to be in '/usr/share/ansible/roles/ceph/tasks/ceph-dependencies-installed.yaml': line 6, column 3, but may\nbe elsewhere in the file depending on the exact syntax problem.\n\nThe offending line appears to be:\n\n\n- name: Warn about missing dependencies\n  ^ here\n"}
Overcloud configuration failed.


PLAY RECAP *********************************************************************
ceph-0                     : ok=111  changed=56   unreachable=0    failed=0    skipped=263  rescued=0    ignored=0   
compute-0                  : ok=139  changed=80   unreachable=0    failed=0    skipped=232  rescued=0    ignored=0   
controller-0               : ok=154  changed=93   unreachable=0    failed=0    skipped=224  rescued=0    ignored=0   
undercloud                 : ok=11   changed=5    unreachable=0    failed=1    skipped=0    rescued=0    ignored=0   

Version-Release number of selected component (if applicable):
Several related package version changes:

-openstack-tripleo-validations.noarch          11.3.2-1.20200914170823.4db92ba.el8ost          @rhelosp-16.1            
+openstack-tripleo-validations.noarch          11.3.2-1.20200914170825.el8ost                  @rhelosp-16.1 

-python3-tripleoclient.noarch                  12.3.2-1.20200914164926.72161e0.el8ost          @rhelosp-16.1            
-python3-tripleoclient-heat-installer.noarch   12.3.2-1.20200914164926.72161e0.el8ost          @rhelosp-16.1            
+python3-tripleoclient.noarch                  12.3.2-1.20200914164927.72161e0.el8ost          @rhelosp-16.1            
+python3-tripleoclient-heat-installer.noarch   12.3.2-1.20200914164927.72161e0.el8ost          @rhelosp-16.1  

tag: RHOS-16.1-RHEL-8-20201006.n.1


Additional info:
similar error message with different path:
https://bugs.launchpad.net/tripleo/+bug/1895507

Comment 2 Cédric Jeanneret 2020-10-07 06:03:09 UTC
The "warn" is supposed to be shipped in validations-common. Checking if we're not missing a commit or something...

Comment 3 Cédric Jeanneret 2020-10-07 06:20:55 UTC
weird. the "warn.py" is provided by validations-common-1.1.2-1.20200914180304.7f3c565.el8ost.noarch.rpm - which happens to be the installed version, apparently:
validations-common.noarch                     1.1.2-1.20200914180304.7f3c565.el8ost           @rhelosp-16.1

The rpm DOES have that "warn.py" and pushes it in /usr/share/ansible/library as expected.
There's something fishy. A live env was requested and should be available in a couple of hours in order to investigate a bit further.

Comment 4 Cédric Jeanneret 2020-10-07 07:50:30 UTC
After some more digging with the Team, we're missing 3 patches in tripleoclient, compared to stable/train:
- https://code.engineering.redhat.com/gerrit/214372
- https://code.engineering.redhat.com/gerrit/214373 (the one from the LP, indeed needed)
- https://code.engineering.redhat.com/gerrit/214374

Once we get them in, we should, at last, be good.

Comment 5 mathieu bultel 2020-10-07 10:45:53 UTC
The issue is due because Ceph is called via the external deploy tasks in THT, which is triggered and run under Mistral in Train.
Ceph playbook is using 'warn' module, which is a validation module.
So we need to adjust the path in mistral Ansible action in order to get the validation Ansible modules.

I have pushed a fix is here:
https://review.opendev.org/756475

It should be correct then.
I'm waiting for the CI to push downstream the patch.

Comment 8 David Rosenfeld 2020-10-09 13:34:48 UTC
This was a failure in Phase 1 which is now passing.

Comment 15 errata-xmlrpc 2020-10-28 15:39:38 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 16.1 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2020:4284