Hide Forgot
rhel-osp-director: 7.3->8.0 Overcloud upgrade fails on running the deployment command with major-upgrade-pacemaker-init.yaml Environment: openstack-tripleo-heat-templates-0.8.12-1.el7ost.noarch openstack-tripleo-heat-templates-kilo-0.8.12-1.el7ost.noarch openstack-puppet-modules-7.0.15-1.el7ost.noarch instack-undercloud-2.2.6-1.el7ost.noarch Steps to reproduce: 1. Deploy overcloud 7.3 2. Upgrade undercloud 7.3->8.0 3. Attempt to upgrade the overcloud nodes: openstack overcloud deploy --templates tripleo-heat-templates -e tripleo-heat-templates/overcloud-resource-registry-puppet.yaml -e tripleo-heat-templates/environments/puppet-pacemaker.yaml -e tripleo-heat-templates/environments/network-isolation.yaml -e tripleo-heat-templates/environments/net-single-nic-with-vlans.yaml -e network_env.yaml -e tripleo-heat-templates/environments/major-upgrade-pacemaker-init.yaml -e rhos-release-8.yaml Result: The command fails: heat resource-list -n5 overcloud|grep -v COMPLETE +---------------------------------------------+-----------------------------------------------+---------------------------------------------------+-----------------+---------------------+----------------------------------------------------------------------------------------------+ | resource_name | physical_resource_id | resource_type | resource_status | updated_time | stack_name | +---------------------------------------------+-----------------------------------------------+---------------------------------------------------+-----------------+---------------------+----------------------------------------------------------------------------------------------+ | UpdateWorkflow | eab49f7f-7985-47bd-9699-cd59e72dcac3 | OS::TripleO::Tasks::UpdateWorkflow | CREATE_FAILED | 2016-03-21T19:45:11 | overcloud | | UpgradeInitControllerDeployment | 668f203a-5857-40c6-b23c-c9041a8850b1 | OS::Heat::SoftwareDeploymentGroup | CREATE_FAILED | 2016-03-21T19:45:15 | overcloud-UpdateWorkflow-2ji4kbavyqs5 | | 2 | | OS::Heat::SoftwareDeployment | CREATE_FAILED | 2016-03-21T19:45:58 | overcloud-UpdateWorkflow-2ji4kbavyqs5-UpgradeInitControllerDeployment-otsfhz7dpip4 | +---------------------------------------------+-----------------------------------------------+---------------------------------------------------+-----------------+---------------------+----------------------------------------------------------------------------------------------+
Created attachment 1138790 [details] heat-engine.log from the undercloud.
The issue seems to be intermittent.
Hi Sasha, had a look on the box you gave me the details for - first thing I noticed was you forgot to include the storage-environment.yaml in the update command (you have a ceph node and the deploy had -e /usr/share/openstack-tripleo-heat-templates/environments/storage-environment.yaml) Can you try again and include all the environment files please (also note the info about undercloud resources at https://bugzilla.redhat.com/show_bug.cgi?id=1319944#c3 applies here too so you may want a slightly bigger undercloud, esp, more ram and some swap). thanks, marios
> Can you try again and include all the environment files please (also note > the info about undercloud resources at > https://bugzilla.redhat.com/show_bug.cgi?id=1319944#c3 applies here too so > you may want a slightly bigger undercloud, esp, more ram and some swap). > FYI, I added some swap to this env and re-ran with: openstack overcloud deploy --templates tripleo-heat-templates -e tripleo-heat-templates/overcloud-resource-registry-puppet.yaml -e tripleo-heat-templates/environments/puppet-pacemaker.yaml -e tripleo-heat-templates/environments/storage-environment.yaml -e tripleo-heat-templates/environments/network-isolation.yaml -e tripleo-heat-templates/environments/net-single-nic-with-vlans.yaml -e network-environment.yaml -e tripleo-heat-templates/environments/major-upgrade-pacemaker-init.yaml -e rhos-release-8.yaml 2016-03-22 08:02:28 [overcloud]: UPDATE_IN_PROGRESS Stack UPDATE started .... 2016-03-22 08:08:41 [overcloud-AllNodesExtraConfig-asirvl26u7sh]: UPDATE_COMPLETE Stack UPDATE completed successfully and it completed OK in ~ 6 minutes - I confirmed the presence of the /root/tripleo_upgrade_node.sh on the compute and ceph node
Hi Sasha, Can you confirm that Marios' steps have resolved this? Angus
waiting with pm_ack on qe feedback
Confirm. Let's close it.