Bug 912284
Summary: | with resume_guests_state_on_host_boot=True rebooting host leaves VM's in Error state | |||
---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Gary Kotton <gkotton> | |
Component: | openstack-nova | Assignee: | Brent Eagles <beagles> | |
Status: | CLOSED ERRATA | QA Contact: | Ofer Blaut <oblaut> | |
Severity: | high | Docs Contact: | ||
Priority: | unspecified | |||
Version: | 2.0 (Folsom) | CC: | ajeain, beagles, breeler, dallan, jhenner, ndipanov, oblaut, pbrady, sgordon | |
Target Milestone: | snapshot5 | |||
Target Release: | 2.1 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | openstack-nova-2012.2.3-6.el6ost | Doc Type: | Release Note | |
Doc Text: |
Setting the configuration option resume_guests_state_on_host_boot to True (it is False by default) is not recommended. Setting it to True causes problems with re-spawning instances when many services are being restarted simultaneously. This usually occurs when the services are running on the same host that gets restarted.
|
Story Points: | --- | |
Clone Of: | ||||
: | 920704 (view as bug list) | Environment: | ||
Last Closed: | 2013-04-04 20:21:16 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 920704 |
Comment 3
Ofer Blaut
2013-03-12 10:01:15 UTC
The issue looks like resume_guests_state_on_host_boot config option we introduced seems to have issues when restarting the whole host (we are investigating and will hopefully soon know more and propose an actual fix.) As a workaround - we will disable this option by default. We have also opened a new bug to track the progress of the actual fix of the issue at #920704 The way to test this would be to try restarting the node and making sure that nova does not attempt to bring instances online instantly. Please note that setting resume_guests_state_on_host_boot to True is now something we want our customers to avoid until we get a full fix. Note that this option is off by default and we had changed it for RHOS. This workaround is changing it back to the upstream default. There is actually a good argument for leaving it off by default, anyway. Many deployments would likely prefer it that way. Having a node go down with running instances on it is a failure, and applications using the cloud would likely have moved on and treated the instances that failed as gone and spawned new ones. For those types of applications, automatically trying to restart their instances may not be what they want. So right now I'm thinking that once we turn this off, we should just leave it that way, but we should still get to the bottom of this and fix it since some deployments still may want to turn it on. The default resume_guests_state_on_host_boot = false After reboot all VMs are in shutoff state Tested on openstack-nova-compute-2012.2.3-7.el6ost.noarch Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHSA-2013-0709.html |