Bug 1018876 - Power on VM fails after blocking HSM connectivity to SDs while LSM and powering off the VM
Summary: Power on VM fails after blocking HSM connectivity to SDs while LSM and poweri...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: ovirt-engine
Version: 3.3.0
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
: 3.3.0
Assignee: Federico Simoncelli
QA Contact: Elad
URL:
Whiteboard: storage
Depends On: 1018867
Blocks: 3.3snap3
TreeView+ depends on / blocked
 
Reported: 2013-10-14 15:05 UTC by vvyazmin@redhat.com
Modified: 2016-02-10 18:19 UTC (History)
9 users (show)

Fixed In Version: is24
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:
oVirt Team: Storage
Target Upstream Version:
Embargoed:
abaron: Triaged+


Attachments (Terms of Use)
## Logs rhevm, vdsm, libvirt, thread dump, superVdsm (iSCSI) (4.85 MB, application/x-gzip)
2013-10-14 15:05 UTC, vvyazmin@redhat.com
no flags Details


Links
System ID Private Priority Status Summary Last Updated
oVirt gerrit 20669 0 None None None Never
oVirt gerrit 21350 0 None None None Never
oVirt gerrit 21351 0 None None None Never

Description vvyazmin@redhat.com 2013-10-14 15:05:02 UTC
Created attachment 812070 [details]
## Logs rhevm, vdsm, libvirt, thread dump, superVdsm (iSCSI)

Description of problem:
Failed power-on VM after disconnections Storage Domain

Version-Release number of selected component (if applicable):
RHEVM 3.3 - IS18 environment:

Host OS: RHEL 6.5

RHEVM:  rhevm-3.3.0-0.25.beta1.el6ev.noarch
PythonSDK:  rhevm-sdk-python-3.3.0.15-1.el6ev.noarch
VDSM:  vdsm-4.13.0-0.2.beta1.el6ev.x86_64
LIBVIRT:  libvirt-0.10.2-27.el6.x86_64
QEMU & KVM:  qemu-kvm-rhev-0.12.1.2-2.412.el6.x86_64
SANLOCK:  sanlock-2.8-1.el6.x86_64

How reproducible:
unknow

Steps to Reproduce:
1. Create iSCSI Data Center with two hosts connected to multiple Storage Domain (SD)
2. Create and run a vm from template with OS installed on it, run on HSM.
3. LSM the vm disk and block connectivity (via iptables) to all domains from the HSM host
* HSM - non operational
* VM - in pause state
4. When the vm pauses remove the iptables block from the hsm host
* HSM - up
* VM - up and running. OS running, and no problem connect to it.
5. Power Off VM
6. Try power-on VM

Actual results:
Failed power-on VM

Expected results:
Secceed power-on VM

Impact on user:
Failed power-on VM

Workaround:
Restart ovirt-engine

Additional info:

/var/log/ovirt-engine/engine.log

2013-10-14 14:59:51,305 WARN  [org.ovirt.engine.core.bll.RunVmCommand] (ajp-/127.0.0.1:8702-2) [3a68d9d7] CanDoAction of action RunVm failed. Reasons:VAR__ACTION__RUN,VAR__TYPE__VM,ACTION_TYPE_FAILED_DISK_IS_BEING_MIGRATED,$DiskName vm001_Disk1

/var/log/vdsm/vdsm.log

Comment 1 Sergey Gotliv 2013-10-16 14:25:21 UTC
This bug is related to BZ#1018867. 

VM is locked, because Engine assumes that VM's disks are being migrated, but migration is actually already failed.

It can be closed after BZ#1018867 will be resolved.

Comment 3 Ayal Baron 2013-11-14 10:54:22 UTC
(In reply to Sergey Gotliv from comment #1)
> This bug is related to BZ#1018867. 
> 
> VM is locked, because Engine assumes that VM's disks are being migrated, but
> migration is actually already failed.
> 
> It can be closed after BZ#1018867 will be resolved.

Bug 1018867 is targeted for rhev 3.4 while this bug is currently targeted for 3.3.
Also, this bug is referencing a patch that has been merged upstream already, so please explain whether there is a separate solution for this bug or it needs to be pushed out to 3.4.

Comment 4 Aharon Canan 2013-11-25 13:21:32 UTC
Sergey, 

please clarify so we will verify it.

Comment 5 Elad 2013-12-04 13:19:14 UTC
Followed the reproduction steps.
Power on VM for the second time after it was in paused state works fine.

Verified with is25
vdsm-4.13.0-0.10.beta1.el6ev.x86_64
rhevm-3.3.0-0.37.beta1.el6ev.noarch

Comment 6 Itamar Heim 2014-01-21 22:29:13 UTC
Closing - RHEV 3.3 Released

Comment 7 Itamar Heim 2014-01-21 22:29:17 UTC
Closing - RHEV 3.3 Released

Comment 8 Itamar Heim 2014-01-21 22:32:11 UTC
Closing - RHEV 3.3 Released


Note You need to log in before you can comment on or make changes to this bug.