1047863 – Openstack-Nova: Unpause instance after host reboot fails

Bug 1047863 - Openstack-Nova: Unpause instance after host reboot fails

Summary: Openstack-Nova: Unpause instance after host reboot fails

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat OpenStack
Classification:	Red Hat
Component:	openstack-nova
Sub Component:
Version:	4.0
Hardware:	x86_64
OS:	Linux
Priority:	medium
Severity:	high
Target Milestone:	z2
Target Release:	4.0
Assignee:	Xavier Queralt
QA Contact:	Gabriel Szasz
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2014-01-02 12:10 UTC by Tzach Shefi
Modified:	2022-07-09 06:27 UTC (History)
CC List:	7 users (show)
Fixed In Version:	openstack-nova-2013.2.1-4.el6ost
Doc Type:	Bug Fix
Doc Text:	Previously, rebooting a host caused all its instances to stop and change to the SHUTDOWN power state. The unpause action was only allowed on instances with PAUSED power state. As a consequence, it was impossible to unpause/start/reboot an instance that had been paused before the reboot. As it makes no sense to unpause an instance once it has been stopped, a check has been added in a periodic task that will fix the state and stop instances that are in SHUTDOWN/CRASHED power state but are still in the PAUSED state. As a result users can now start those instances again, preventing them from being stuck in an inconsistent state.
Clone Of:
Environment:	RHEL RHOS Havana
Last Closed:	2014-03-04 19:04:46 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
compute.log try to unpuase after reboot (80.03 KB, application/x-gzip) 2014-01-02 12:10 UTC, Tzach Shefi	no flags	Details
View All

Links
System	ID	Priority	Status	Summary	Last Updated
Launchpad	1265494	None	None	None	Never
OpenStack gerrit	68690	None	MERGED	Correct the state for PAUSED instances on reboot	2020-12-03 15:05:28 UTC
Red Hat Issue Tracker	OSP-16397	None	None	None	2022-07-09 06:27:45 UTC
Red Hat Product Errata	RHSA-2014:0231	normal	SHIPPED_LIVE	Moderate: openstack-nova security and bug fix update	2014-03-05 00:00:29 UTC

Description Tzach Shefi 2014-01-02 12:10:32 UTC

Created attachment 844520 [details]
compute.log try to unpuase after reboot

Description of problem:
Unpauseing an instance fails if host has rebooted. 

Version-Release number of selected component (if applicable):
RHEL: release 6.5 (Santiago)
RHOS: Havana
openstack-nova-api-2013.2.1-1.el6ost.noarch
openstack-nova-compute-2013.2.1-1.el6ost.noarch
openstack-nova-scheduler-2013.2.1-1.el6ost.noarch
openstack-nova-common-2013.2.1-1.el6ost.noarch
openstack-nova-console-2013.2.1-1.el6ost.noarch
openstack-nova-conductor-2013.2.1-1.el6ost.noarch
openstack-nova-novncproxy-2013.2.1-1.el6ost.noarch
openstack-nova-cert-2013.2.1-1.el6ost.noarch

How reproducible:
Every time 

Steps to Reproduce:
1. Boot an instance 
2. Pause that instance
3. Reboot host
4. Unpause instance  

Actual results:
can't unpause instance stuck in status paused, power state - shutdown

Expected results:
Instance should unpause, return to running state

Additional info:

virsh list -all --managed-save 
ID is missing from paused instance "-" (pausecirros), state -> shut off.

[root@orange-vdse ~(keystone_admin)]# virsh list --all --managed-save
 Id    Name                           State
----------------------------------------------------
 1     instance-00000003              running
 2     instance-00000002              running
 -     instance-00000001              shut off

[root@orange-vdse ~(keystone_admin)]# nova list  (notice nova status paused)
+--------------------------------------+---------------+--------+------------+-------------+-----------------+
| ID                                   | Name          | Status | Task State | Power State | Networks        |
+--------------------------------------+---------------+--------+------------+-------------+-----------------+
| ebe310c2-d715-45e5-83b6-32717af1ac90 | cirros        | ACTIVE | None       | Running     | net=192.168.1.4 |
| 3ef89feb-414f-4524-b806-f14044efdb14 | pausecirros   | PAUSED | None       | Shutdown    | net=192.168.1.5 |
| 8bcae041-2f92-4ae2-a2c2-ee59b067ac76 | suspendcirros | ACTIVE | None       | Running     | net=192.168.1.2 |
+--------------------------------------+---------------+--------+------------+-------------+-----------------+


Testing without rebooting host, ID/state ("1"/paused) instance (cirros) are ok and it unpauses ok.

[root@orange-vdse ~(keystone_admin)]# virsh list --all --managed-save
 Id    Name                           State
----------------------------------------------------
 1     instance-00000003              paused
 2     instance-00000002              running
 -     instance-00000001              shut off
+--------------------------------------+---------------+--------+------------+-------------+-----------------+
| ID                                   | Name          | Status | Task State | Power State | Networks        |
+--------------------------------------+---------------+--------+------------+-------------+-----------------+
| ebe310c2-d715-45e5-83b6-32717af1ac90 | cirros        | PAUSED | None       | Paused      | net=192.168.1.4 |
| 3ef89feb-414f-4524-b806-f14044efdb14 | pausecirros   | PAUSED | None       | Shutdown    | net=192.168.1.5 |
| 8bcae041-2f92-4ae2-a2c2-ee59b067ac76 | suspendcirros | ACTIVE | None       | Running     | net=192.168.1.2 |
+--------------------------------------+---------------+--------+------------+-------------+-----------------+

Comment 5 errata-xmlrpc 2014-03-04 19:04:46 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHSA-2014-0231.html

Note You need to log in before you can comment on or make changes to this bug.