1544718 – [RFE] Improve high-availability of paused VMs due to IO error with a lease

Bug 1544718 - [RFE] Improve high-availability of paused VMs due to IO error with a lease

Summary: [RFE] Improve high-availability of paused VMs due to IO error with a lease

Keywords:
Status:	CLOSED NOTABUG
Alias:	None
Product:	ovirt-engine
Classification:	oVirt
Component:	BLL.Virt
Sub Component:
Version:	4.2.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	high
Target Milestone:	---
Target Release:	---
Assignee:	Arik
QA Contact:	meital avital
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2018-02-13 11:37 UTC by Arik
Modified:	2021-08-23 11:51 UTC (History)
CC List:	1 user (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2021-08-23 11:51:50 UTC
oVirt Team:	Virt
Embargoed:
Dependent Products:
Flags:	rbarry: ovirt-4.5? rule-engine: planning_ack? rule-engine: devel_ack? rule-engine: testing_ack?

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Bugzilla	1540548	0	high	CLOSED	[RFE] Automatically restart HA VMs paused due to I/O Error	2021-09-09 13:09:55 UTC

Internal Links: 1540548

Description Arik 2018-02-13 11:37:55 UTC

We can improve high-availability of VMs that are paused due to an IO error and are configured with a lease - we can try restarting them on other hosts (assuming there is a problem between the storage device and the particular host the VM runs on, the lease will protect us from a split-brain) and replace the resume-behavior of those VMs (that is currently set to KILL) to AUTO-RESUME.

Comment 1 Michal Skrivanek 2018-02-14 15:27:00 UTC

need to be discussed in context of bug 1540548

Comment 2 Ryan Barry 2019-01-21 14:53:43 UTC

Re-targeting to 4.3.1 since it is missing a patch, an acked blocker flag, or both

Comment 3 Michal Skrivanek 2021-08-20 08:27:40 UTC

This bug/RFE is more than 2 years old and it didn't get enough attention so far, and is now flagged as pending close. 
Please review if it is still relevant and provide additional details/justification/patches if you believe it should get more attention for the next oVirt release.

Comment 4 Arik 2021-08-23 11:51:50 UTC

Having the resume-behavior always set to KILL when a VM lease is defined renders this bz redundant - the VM will be destroyed automatically and then restarted by the engine elsewhere (also in case we have connectivity between ovirt-engine and the host)

Note You need to log in before you can comment on or make changes to this bug.