Description of problem: The validator undercloud-service-status.yaml is supposed to alert when openstack-heat-engine is not running, but instead it hangs and the user doesn't get any clear error message. Version-Release number of selected component (if applicable): openstack-tripleo-validations-7.4.2-1.el7ost.noarch How reproducible: 100% Steps to Reproduce: 1. sudo systemctl stop openstack-heat-engine 2. ANSIBLE_RETRY_FILES_ENABLED=false ansible-playbook -i /usr/bin/tripleo-ansible-inventory /usr/share/openstack-tripleo-validations/validations/undercloud-service-status.yaml Actual results: Validator hangs. Expected results: A quick and clear error message, that openstack-heat-engine is down.
After some tests and investigation, the issue is coming from the fact that the tripleo-ansible-inventory script is getting all the inventory informations from Heat. So if the heat-engine is stopped or not working, no validations are not working at all! Don't have any solution for the moment.
Tested and verified with openstack-tripleo-validations-9.3.2-0.20190420045627.361061f.el7ost.noarch (undercloud) [stack@undercloud ~]$ sudo docker stop heat_engine heat_engine (undercloud) [stack@undercloud ~]$ ANSIBLE_RETRY_FILES_ENABLED=false ansible-playbook -i /usr/bin/tripleo-ansible-inventory /usr/share/openstack-tripleo-validations/validations/undercloud-service-status.yaml -vvvv ansible-playbook 2.6.19 config file = /etc/ansible/ansible.cfg configured module search path = [u'/home/stack/.ansible/plugins/modules', u'/usr/share/ansible/plugins/modules'] ansible python module location = /usr/lib/python2.7/site-packages/ansible executable location = /usr/bin/ansible-playbook python version = 2.7.5 (default, Aug 7 2019, 00:51:29) [GCC 4.8.5 20150623 (Red Hat 4.8.5-39)] Using /etc/ansible/ansible.cfg as config file setting up inventory plugins [WARNING]: * Failed to parse /usr/bin/tripleo-ansible-inventory with script plugin: Inventory script (/usr/bin/tripleo-ansible-inventory) had an execution error: ERROR: Error creating inventory: Request to https://192.168.24.2:13004/v1/a3b547b7b3ac400495abacba4f0c89c9/stacks/overcloud timed out The inventory script is timing out after 30 sec if heat-engine is not up and working!
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2019:3745