Bug 1147235

Summary: [Block disks]After live storage migration, VM has been paused due to a storage I/O problem
Product: Red Hat Enterprise Virtualization Manager Reporter: lkuchlan <lkuchlan>
Component: ovirt-engineAssignee: Daniel Erez <derez>
Status: CLOSED DUPLICATE QA Contact: Aharon Canan <acanan>
Severity: high Docs Contact:
Priority: unspecified    
Version: 3.5.0CC: amureini, ecohen, gklein, iheim, lpeer, lsurette, nsoffer, rbalakri, Rhev-m-bugs, tnisan, yeylon
Target Milestone: ---   
Target Release: 3.5.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: storage
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-09-29 12:14:56 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Storage RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
image and logs none

Description lkuchlan 2014-09-28 10:08:51 UTC
Created attachment 941975 [details]
image and logs

Description of problem:
A live storage migration operation of block disks on a VM which has in addition a bootable  disk with OS installed , causes the VM the loose it's I/O capabilities,hence,the VM moves to pause mode and never comes out of even when attempting ,engine restart,vdsm restart, the only solution I've found is to power off the VM.

Abstract from engine log:

2014-09-22 17:59:28,712 INFO  [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (DefaultQuartzScheduler_Worker-89) VM vm-migrate b62fab8e-fcfc-4cc2-b398-d59aace86cd1 moved from Up --> Paused
2014-09-22 17:59:28,743 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-89) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: VM vm-migrate has paused due to storage I/O problem.
2014-09-22 17:59:32,820 INFO  [org.ovirt.engine.core.bll.tasks.AsyncTaskManager] (DefaultQuartzScheduler_Worker-97) Polling and updating Async Tasks: 3 tasks, 1 tasks to poll now
2014-09-22 17:59:32,850 INFO  [org.ovirt.engine.core.bll.tasks.SPMAsyncTask] (DefaultQuartzScheduler_Worker-97) SPMAsyncTask::PollTask: Polling task 6c5468ff-af3a-48f6-8e53-3df98f1cb7a6 (Parent

Version-Release number of selected component (if applicable):
3.5

How reproducible:
100%

Steps to Reproduce:
1.Create VM with 2 disks, one thin and one pre-allocated  
2.Install OS.
3.Live storage migrate one of them,or both. 

Actual results:
Vm has been paused.

Expected results:
Vm should be up.  

Additional info:

Comment 1 Nir Soffer 2014-09-29 12:14:56 UTC
From vdsm log we can see that lv was refreshed:

4256b099-f24d-48c1-b9f2-dc9fb7aca6ad::DEBUG::2014-09-22 18:06:32,648::lvm::288::Storage.Misc.excCmd::(cmd) /usr/bin/sudo -n /sbin/lvm lvchange --config ' devices { preferred_names = ["^/dev/mapper/"] ignore_suspended_devices=1 write_cache_state=0 disable_after_error_count=3 obtain_device_list_from_udev=0 filter = [ '\''a|/dev/mapper/360060160f4a030000648d2cd2bf6e311|/dev/mapper/360060160f4a0300060aa976bfcf6e311|/dev/mapper/360060160f4a0300090f3983c20f6e311|/dev/mapper/360060160f4a03000b6d3cba523f6e311|/dev/mapper/360060160f4a03000b8264379fbf6e311|'\'', '\''r|.*|'\'' ] }  global {  locking_type=1  prioritise_write_locks=1  wait_for_locks=1  use_lvmetad=0 }  backup {  retain_min = 50  retain_days = 0 } ' --refresh c04c1246-f546-466e-824c-9fe51166b218/8c98a3ef-0af4-4bcd-a1a8-0ab78c1179da (cwd None)

And seconds later vm is paused:

libvirtEventLoop::DEBUG::2014-09-22 18:06:39,068::vm::5435::vm.Vm::(_onLibvirtLifecycleEvent) vmId=`b62fab8e-fcfc-4cc2-b398-d59aace86cd1`::event Resumed detail 0 opaque None
libvirtEventLoop::INFO::2014-09-22 18:06:39,069::vm::4754::vm.Vm::(_onIOError) vmId=`b62fab8e-fcfc-4cc2-b398-d59aace86cd1`::abnormal vm stop device virtio-disk0 error eio
libvirtEventLoop::DEBUG::2014-09-22 18:06:39,069::vm::5435::vm.Vm::(_onLibvirtLifecycleEvent) vmId=`b62fab8e-fcfc-4cc2-b398-d59aace86cd1`::event Suspended detail 2 opaque None
libvirtEventLoop::INFO::2014-09-22 18:06:39,108::vm::4754::vm.Vm::(_onIOError) vmId=`b62fab8e-fcfc-4cc2-b398-d59aace86cd1`::abnormal vm stop device virtio-disk0 error eio

This is known issue on EL7 and Fedora 19,20.

A duplicate of bug 1127460.

*** This bug has been marked as a duplicate of bug 1127460 ***