Bug 1386115

Summary: OSP-9/10 upgrades times out during compute node upgrade.
Product: Red Hat OpenStack Reporter: Sofer Athlan-Guyot <sathlang>
Component: openstack-tripleo-heat-templatesAssignee: Michele Baldessari <michele>
Status: CLOSED CURRENTRELEASE QA Contact: Marius Cornea <mcornea>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 10.0 (Newton)CC: augol, jjoyce, jschluet, mandreou, mburns, ohochman, rhel-osp-director-maint, sathlang
Target Milestone: ---Keywords: TestOnly, Triaged, ZStream
Target Release: 10.0 (Newton)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-tripleo-heat-templates-5.1.0-7.el7ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-05-02 20:56:14 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1337794    

Description Sofer Athlan-Guyot 2016-10-18 08:22:42 UTC
Description of problem: Upgrading OSP9 to OSP10, the upgrade compute nodes step fails: 
  
  upgrade-non-controller.sh --upgrade overcloud-compute-0

It hangs duing yum upgrade with the nova-compute service waiting for the nova-conductor.

2016-10-18 08:10:37.809 11758 WARNING nova.conductor.api [req-c09c5bb4-dba4-4b9d-8fef-f419e4794550 - - - - -] Timed out waiting for nova-conductor.  Is it running? Or d[140/386]
ervice start before nova-conductor?  Reattempting establishment of nova-conductor connection...
2016-10-18 08:11:37.812 11758 WARNING nova.conductor.api [req-c09c5bb4-dba4-4b9d-8fef-f419e4794550 - - - - -] Timed out waiting for nova-conductor.  Is it running? Or did this s
ervice start before nova-conductor?  Reattempting establishment of nova-conductor connection...

Version-Release number of selected component (if applicable):  openstack-tripleo-heat-templates.noarch 5.0.0-0.20161003064637.d636e3a.1.2.el7ost @rhelosp-10.0-puddle

Comment 1 Sofer Athlan-Guyot 2016-10-18 08:24:05 UTC
Adding needed upstream review.

Comment 2 Marios Andreou 2016-10-18 09:37:02 UTC
marking as blocking the upgrades RFE and moving to POST as it is in newton already

Comment 3 Marios Andreou 2016-10-18 09:47:35 UTC
assigning to Michele since he authored the gerrit review that fixed this.

Also note that this was a bug I introduced with https://review.openstack.org/#/c/382748/1/extraconfig/tasks/major_upgrade_controller_pacemaker_3.sh 

which is a recent merge into master/newton.... that is, this bug does not exist in any current product so there should be no need for doctext (@bandini just making sure you don't have to spend any more time on this by me assigning to you... if you do, feel free to give it back)

Comment 4 Sofer Athlan-Guyot 2016-10-18 11:26:53 UTC
As a side note you need the upstream patch in this https://bugzilla.redhat.com/show_bug.cgi?id=1381628 for this upstream review to apply cleanly.

Comment 9 Jon Schlueter 2017-01-12 21:21:33 UTC
The above mentioned patch was included in code shipped at GA

Comment 10 Jon Schlueter 2017-01-19 19:02:23 UTC
According to our records, this should be resolved by openstack-tripleo-heat-templates-5.1.0-7.el7ost.  This build is available now.

Comment 13 Amit Ugol 2017-04-05 10:13:35 UTC
z3 comes with a newer version. Upgrading seems to be working better.