Created attachment 1263617 [details]
Description of problem:
OSP10 -> OSP11 upgrade fails when Ironic services are enabled on a monolithic controller deployment. The major-upgrade-composable-steps fails after 4h which is the stack update timeout which indicates that it's getting stuck. The cause for this is that during the upgrade process the openstack-nova-compute.service is trying to start on the controller nodes while the rabbitmq servers are down(pacemaker cluster is down).
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Deploy OSP10 with monolithic controllers and Ironic overcloud services activated
2. Run OSP10->OSP11 upgrade
Deployment fails during major-upgrade-composable-steps, after 4h timeout.
Deployment doesn't get stuck.
Note: this could be the same issue as with BZ#1431988 but I reported it as a separate one because the manifestation is different (timeout vs failure) and the topology where it shows up is different.
Waiting to see if review in https://bugzilla.redhat.com/show_bug.cgi?id=1431988 fixes this one two.
*** Bug 1431988 has been marked as a duplicate of this bug. ***
Pointing to stable/ocata.
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory, and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.