Description of problem: An overcloud update fails on the first controller at the task "Wait for containers to start for step 2 using paunch". Times out after 650 retries (rougly 30 minutes). When the overcloud update is retried without any change done on the controller, it completes without the error. Afterwards, the overcloud is fully available. Version-Release number of selected component (if applicable): 16.1.4 to 16.1.6 How reproducible: Customer ran into this issue on the first 2 clusters they updated. I have not been able to reproduce yet. Steps to Reproduce: 1. 2. 3. Actual results: Overcloud update fails on the first controller that is updated. Expected results: Overcloud update completes on the first try without error. Additional info: Attaching case with sosreport from the controller and output of ansible job.
Now that BZ#2015325 is resolved, I'll ask my customer if it's possible to test the update from 16.1.4 to 16.1.7, and if it works well we can assume this bug is a duplicate of the other and close it.
Customer confirmed on the attached case that the workaround for this issue is the same detailed on BZ#2015325 (removing mariadb-server in advance), so this BZ can be closed as duplicate.
*** This bug has been marked as a duplicate of bug 2015325 ***