RDO tickets are now tracked in Jira https://issues.redhat.com/projects/RDO/issues/
Bug 1225022 - When adding nodes to the cloud the update hangs and takes forever
Summary: When adding nodes to the cloud the update hangs and takes forever
Keywords:
Status: CLOSED EOL
Alias: None
Product: RDO
Classification: Community
Component: openstack-tripleo
Version: trunk
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: Kilo
Assignee: James Slagle
QA Contact: Shai Revivo
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2015-05-26 12:24 UTC by Udi Kalifon
Modified: 2016-05-19 15:37 UTC (History)
0 users

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-05-19 15:37:40 UTC
Embargoed:


Attachments (Terms of Use)
Log segment collected from the compute node (8.07 KB, text/plain)
2015-05-26 13:06 UTC, Udi Kalifon
no flags Details

Description Udi Kalifon 2015-05-26 12:24:06 UTC
Description of problem:
I am deploying on bare metals. The first deployment of 1 controller and 1 compute goes smoothly, but when trying to add more compute nodes or other nodes (ceph...) the stack is hung in UPDATE_IN_PROGRESS state.

I suceeded in updating the stack only once, but after that one success I fail 100% of the time.


Version-Release number of selected component (if applicable):
openstack-tripleo-0.0.6-dev1717.el7.centos.noarch
openstack-heat-engine-2015.1.1-dev8.el7.centos.noarch


How reproducible:
50%


Steps to Reproduce:
1. Deploy a default overcloud with "instack-deploy-overcloud --tuskar"
2. Add compute nodes by "export COMPUTESCALE=3" and running the deploy command again
3. Add Ceph nodes or more compute nodes and repeat the process until you get to a situation where the update never ends


I can't find additional info in the logs or by looking in the stack resources. There are no error messages - it's just stuck.

Comment 1 Udi Kalifon 2015-05-26 13:06:10 UTC
Created attachment 1029931 [details]
Log segment collected from the compute node

(03:53:29 PM) jprovazn: ukalifon: checking logs on compute node 2 - it seems that it's failing to get some heat metadata:
(03:53:39 PM) jprovazn: http://paste.openstack.org/show/237765/ (see attachment)
(03:57:08 PM) jprovazn: ukalifon: to be more specific it fails to send completion signal probably

Comment 4 Chandan Kumar 2016-05-19 15:37:40 UTC
This bug is against a Version which has reached End of Life.
If it's still present in supported release (http://releases.openstack.org), please update Version and reopen.


Note You need to log in before you can comment on or make changes to this bug.