Red Hat Bugzilla – Bug 1287689
[ramdisk] Overcloud fails to deploy
Last modified: 2016-04-18 03:11:43 EDT
Description of problem:
When deploying my overcloud with OSP7 (python-rdomanager-oscplugin-0.0.10-19.el7ost.noarch), things simply stop here : http://i.imgur.com/8ecvf7N.png
Talking with trown we looked in the ironic-conductor logs, and it seems like the disk was written, however the error in the console would suggest otherwise.
Working with Lucas I used the upstream ramdisk/kernel and I was able to get the baremetal nodes to install.
These nodes deployed OSP7 all day long until the recent update.
Joe, on the call you mentioned it is deployment of OSP8, but here in the bugzilla is OSP7 version. Can you clarify please against which deployment you are hitting this issue? Thanks, Jarda
Jarda - On the call I mentioned once we moved to the RHEL72 based image - which is OSP7 & OSP8 going forward.
I hit this in a virt environment as well when trying to deploy an overcloud for a 2nd time (delete and redeploy). I worked around it by recreating the overcloud nodes image files:
for image in $(ls /var/lib/libvirt/images/ | grep baremetalbrbm); do qemu-img create -f qcow2 /var/lib/libvirt/images/$image 41G; done
Observed a similar issue, except in my case the error is about /dev/sda1 being write protected. This happens on nodes at random, and varies across deployment.
In my case, it doesn't cause the deployment to fail - eventually (~15 min later), heat reboots the stuck node(s) back into the deployment kernel, and it succeeds on the second attempt.
Karthik, this does not look similar, please report separately.
*** This bug has been marked as a duplicate of bug 1296330 ***