DescriptionFrancesco Vollero
2016-03-02 08:06:50 UTC
Created attachment 1132164[details]
heat-engine
Description of problem:
When I am trying to update an existing overcloud I am facing the most hated error in heat, the UPDATE_FAILED one.
I am executing this deployments on physical hardware, with a director having 4 vcores (running on kvm instance) and 32 gb or ram and having the engine_num_workers to 8 as suggested.
If the deployment is executed from scratch it succeed without any issues, but if is an update, it always (based on my number of trials, 15) fail.
I collected the logs from heat-engine, nova-conductor and neutron and it seems to be a problem 'related' to neutron, but is just speculation.
Version-Release number of selected component (if applicable):
7.3
How reproducible:
Run a deployment to update the number of compute nodes.
Steps to Reproduce:
The same for creating a normal deployment.
Actual results:
UPDATE_FAILED
Stack failed with status: resources.Compute: ResourceInError: resources[2].resources.NovaCompute: Went to status ERROR due to "Message: Unknown, Code: Unknown"
ERROR: openstack Heat Stack update failed.
Expected results:
Deployment succeeded
Additional info:
Comment 2Francesco Vollero
2016-03-02 08:08:02 UTC
It looks to me like a source of this problem is in Neutron - sometimes a 404 can indicate not enough floating IPs. The next step would be to get the neutron logs to see what's causing the 404.
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory, and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.
https://rhn.redhat.com/errata/RHEA-2016-2948.html
Comment 17Red Hat Bugzilla
2023-09-14 03:18:47 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days
Created attachment 1132164 [details] heat-engine Description of problem: When I am trying to update an existing overcloud I am facing the most hated error in heat, the UPDATE_FAILED one. I am executing this deployments on physical hardware, with a director having 4 vcores (running on kvm instance) and 32 gb or ram and having the engine_num_workers to 8 as suggested. If the deployment is executed from scratch it succeed without any issues, but if is an update, it always (based on my number of trials, 15) fail. I collected the logs from heat-engine, nova-conductor and neutron and it seems to be a problem 'related' to neutron, but is just speculation. Version-Release number of selected component (if applicable): 7.3 How reproducible: Run a deployment to update the number of compute nodes. Steps to Reproduce: The same for creating a normal deployment. Actual results: UPDATE_FAILED Stack failed with status: resources.Compute: ResourceInError: resources[2].resources.NovaCompute: Went to status ERROR due to "Message: Unknown, Code: Unknown" ERROR: openstack Heat Stack update failed. Expected results: Deployment succeeded Additional info: