Bug 1455016 - Host reboot does not trigger VM HA
Summary: Host reboot does not trigger VM HA
Keywords:
Status: CLOSED DUPLICATE of bug 1404623
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: vdsm
Version: 4.1.1
Hardware: x86_64
OS: Linux
unspecified
medium
Target Milestone: ---
: ---
Assignee: Dan Kenigsberg
QA Contact: Raz Tamir
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-05-24 03:39 UTC by Germano Veit Michel
Modified: 2021-08-30 12:48 UTC (History)
6 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2017-05-24 03:58:39 UTC
oVirt Team: SLA
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHV-43246 0 None None None 2021-08-30 12:30:28 UTC
Red Hat Knowledge Base (Solution) 3050321 0 None None None 2017-05-24 05:46:36 UTC

Description Germano Veit Michel 2017-05-24 03:39:13 UTC
Description of problem:

If a HA enabled VM is running in a Host and the host is gracefully rebooted, HA is not triggered because the engine sees it as "shutdown from within the Guest".

1) VM running
May 24 12:30:39 gsslab-24-32 systemd: Starting Virtual Machine qemu-2-DummyVM.

2) Reboot command
May 24 12:31:41 gsslab-24-32 systemd: Stopping Virtual Machine qemu-2-DummyVM.
May 24 12:31:41 gsslab-24-32 systemd: Stopping oVirt ImageIO Daemon...

3) VDSM:

2017-05-24 12:31:42,007+1000 INFO  (libvirt/events) [virt.vm] (vmId='f5fd1ce3-7b75-44e6-a670-a536b9dcd054') Changed state to Down: User shut down from within the guest (code=7) (vm:1207)

2017-05-24 12:31:43,017+1000 INFO  (jsonrpc/3) [jsonrpc.JsonRpcServer] RPC call VM.destroy succeeded in 0.01 seconds 
(__init__:515)

4) VDSM stopping

2017-05-24 12:31:44,985+1000 INFO  (MainThread) [dispatcher] Run and protect: prepareForShutdown(options=None) (logUtils:51)

5) Engine is notified

2017-05-24 12:31:43,010+10 DEBUG [org.ovirt.vdsm.jsonrpc.client.internal.ResponseWorker] (ResponseWorker) [] Event arrived from XXXXXX containing {"notify_time":4455448400,"f5fd1ce3-7b75-44e6-a670-a536b9dcd054":{"status":"Down","timeOffset":"0","exitReason":7,"exitMessage":"User shut down from within the guest","exitCode":0}}

2017-05-24 12:31:43,012+10 INFO  [org.ovirt.engine.core.vdsbroker.monitoring.VmAnalyzer] (ForkJoinPool-1-worker-2) [] VM 'f5fd1ce3-7b75-44e6-a670-a536b9dcd054' was reported as Down on VDS 'd406a6c4-8c8e-46c8-8bb6-0d4fd17cd470'(ovirt-H2)

Since it was "shut down from within the guest", HA is not triggered and the VM is not restarted elsewhere. 

This is not desirable as an unintentional reboot escapes the HA logic.

Version-Release number of selected component (if applicable):
vdsm-4.18.21-1.el7ev
rhevm-4.0.6.3-0.1.el7ev

Also reproduced on:
vdsm-4.19.10.1-1.el7
ovirt-engine-4.1.1.8-1.el7

How reproducible:
100%, as above

Actual results:
HA VM is down after host reboot

Expected results:
HA VM always up unless explicitly shut down by the administrator.

Comment 1 Germano Veit Michel 2017-05-24 03:51:06 UTC
I found some common elements with BZ1404623 but I'm not sure it's the same thing as that is already closed (fixed). But I can reproduce it in a vdsm version which already has https://gerrit.ovirt.org/#/c/64991/.

Comment 2 Germano Veit Michel 2017-05-24 03:58:39 UTC
My bad, had guest agent installed but not running.

Comment 3 Germano Veit Michel 2017-05-24 04:02:17 UTC

*** This bug has been marked as a duplicate of bug 1404623 ***


Note You need to log in before you can comment on or make changes to this bug.