Bug 1805986

Summary: Deployment fails with Websocket timeout
Product: Red Hat OpenStack Reporter: Eliad Cohen <elicohen>
Component: python-tripleoclientAssignee: Emilien Macchi <emacchi>
Status: CLOSED DUPLICATE QA Contact: Sasha Smolyak <ssmolyak>
Severity: high Docs Contact:
Priority: medium    
Version: 16.0 (Train)CC: aschultz, bdobreli, elicohen, emacchi, hbrock, jjoyce, jschluet, jslagle, mburns, pkesavar, slinaber, tvignaud
Target Milestone: ---Keywords: Triaged, ZStream
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-02-24 13:58:46 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 2 Alex Schultz 2020-02-21 21:46:04 UTC
The sosreport is empty. Please provide one with logs

Comment 6 Alex Schultz 2020-02-21 22:43:47 UTC
This undercloud is not in a good state. The mistral executor service is not running which is why the overcloud deployment fails. No mistral workflows currently work.  It looks like the undercloud install hung at step3 but reported sucessful (per the install-undercloud.log)

------------------------------------------------------------------------------- 9.81s
2020-02-21 11:55:27.227 712548 WARNING tripleoclient.v1.tripleo_deploy.Deploy [  admin] Creating container startup configs for step_3 -------------------------------------------------------------------------------------------------------------------------------------------------------------- 9.54s
2020-02-21 11:55:28.248 712548 WARNING tripleoclient.v1.tripleo_deploy.Deploy [  admin] Install artifact is located at /home/stack/undercloud-install-20200221165527.tar.bzip2
2020-02-21 11:55:28.248 712548 WARNING tripleoclient.v1.tripleo_deploy.Deploy [  admin] 
########################################################

Deployment successful!

########################################################

2020-02-21 11:55:28.248 712548 WARNING tripleoclient.v1.tripleo_deploy.Deploy [  admin] Writing the stack virtual update mark file /var/lib/tripleo-heat-installer/update_mark_undercloud
2020-02-21 11:55:28.398 711841 WARNING tripleoclient.v1.undercloud.InstallUndercloud [  ] 
##########################################################

The Undercloud has been successfully installed.

Useful files:

Password file is at ~/undercloud-passwords.conf
The stackrc file is at ~/stackrc

Use these files to interact with OpenStack services, and
ensure they are secured.

##########################################################

Comment 10 Alex Schultz 2020-02-21 23:06:55 UTC
So the mistral executor container is in a bad state and could not be removed. However this should have been a failure.

As part of the deployment we see,

2020-02-21 18:00:26.599 341233 WARNING tripleoclient.v1.tripleo_deploy.Deploy [  admin]         "Error removing container: mistral_executor",
2020-02-21 18:00:26.599 341233 WARNING tripleoclient.v1.tripleo_deploy.Deploy [  admin]         "Error: cannot remove container fa94a72147eb66497f62baf346279a5f9f827e96f9874afd2cf8c6b75fa8910b as it has active exec sessions: container state improper",


However this was not fatal.  I think there is a different bug around this error and will have to dig it up next week.

Comment 11 Bogdan Dobrelya 2020-02-24 10:01:52 UTC
We need another backport https://review.opendev.org/#/c/708881 (https://code.engineering.redhat.com/gerrit/#/c/192577/)

Comment 12 Bogdan Dobrelya 2020-02-24 13:58:46 UTC

*** This bug has been marked as a duplicate of bug 1802573 ***