Bug 1147235
| Summary: | [Block disks]After live storage migration, VM has been paused due to a storage I/O problem | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Virtualization Manager | Reporter: | lkuchlan <lkuchlan> | ||||
| Component: | ovirt-engine | Assignee: | Daniel Erez <derez> | ||||
| Status: | CLOSED DUPLICATE | QA Contact: | Aharon Canan <acanan> | ||||
| Severity: | high | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 3.5.0 | CC: | amureini, ecohen, gklein, iheim, lpeer, lsurette, nsoffer, rbalakri, Rhev-m-bugs, tnisan, yeylon | ||||
| Target Milestone: | --- | ||||||
| Target Release: | 3.5.0 | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | storage | ||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2014-09-29 12:14:56 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | Storage | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
From vdsm log we can see that lv was refreshed:
4256b099-f24d-48c1-b9f2-dc9fb7aca6ad::DEBUG::2014-09-22 18:06:32,648::lvm::288::Storage.Misc.excCmd::(cmd) /usr/bin/sudo -n /sbin/lvm lvchange --config ' devices { preferred_names = ["^/dev/mapper/"] ignore_suspended_devices=1 write_cache_state=0 disable_after_error_count=3 obtain_device_list_from_udev=0 filter = [ '\''a|/dev/mapper/360060160f4a030000648d2cd2bf6e311|/dev/mapper/360060160f4a0300060aa976bfcf6e311|/dev/mapper/360060160f4a0300090f3983c20f6e311|/dev/mapper/360060160f4a03000b6d3cba523f6e311|/dev/mapper/360060160f4a03000b8264379fbf6e311|'\'', '\''r|.*|'\'' ] } global { locking_type=1 prioritise_write_locks=1 wait_for_locks=1 use_lvmetad=0 } backup { retain_min = 50 retain_days = 0 } ' --refresh c04c1246-f546-466e-824c-9fe51166b218/8c98a3ef-0af4-4bcd-a1a8-0ab78c1179da (cwd None)
And seconds later vm is paused:
libvirtEventLoop::DEBUG::2014-09-22 18:06:39,068::vm::5435::vm.Vm::(_onLibvirtLifecycleEvent) vmId=`b62fab8e-fcfc-4cc2-b398-d59aace86cd1`::event Resumed detail 0 opaque None
libvirtEventLoop::INFO::2014-09-22 18:06:39,069::vm::4754::vm.Vm::(_onIOError) vmId=`b62fab8e-fcfc-4cc2-b398-d59aace86cd1`::abnormal vm stop device virtio-disk0 error eio
libvirtEventLoop::DEBUG::2014-09-22 18:06:39,069::vm::5435::vm.Vm::(_onLibvirtLifecycleEvent) vmId=`b62fab8e-fcfc-4cc2-b398-d59aace86cd1`::event Suspended detail 2 opaque None
libvirtEventLoop::INFO::2014-09-22 18:06:39,108::vm::4754::vm.Vm::(_onIOError) vmId=`b62fab8e-fcfc-4cc2-b398-d59aace86cd1`::abnormal vm stop device virtio-disk0 error eio
This is known issue on EL7 and Fedora 19,20.
A duplicate of bug 1127460.
*** This bug has been marked as a duplicate of bug 1127460 ***
|
Created attachment 941975 [details] image and logs Description of problem: A live storage migration operation of block disks on a VM which has in addition a bootable disk with OS installed , causes the VM the loose it's I/O capabilities,hence,the VM moves to pause mode and never comes out of even when attempting ,engine restart,vdsm restart, the only solution I've found is to power off the VM. Abstract from engine log: 2014-09-22 17:59:28,712 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (DefaultQuartzScheduler_Worker-89) VM vm-migrate b62fab8e-fcfc-4cc2-b398-d59aace86cd1 moved from Up --> Paused 2014-09-22 17:59:28,743 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-89) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: VM vm-migrate has paused due to storage I/O problem. 2014-09-22 17:59:32,820 INFO [org.ovirt.engine.core.bll.tasks.AsyncTaskManager] (DefaultQuartzScheduler_Worker-97) Polling and updating Async Tasks: 3 tasks, 1 tasks to poll now 2014-09-22 17:59:32,850 INFO [org.ovirt.engine.core.bll.tasks.SPMAsyncTask] (DefaultQuartzScheduler_Worker-97) SPMAsyncTask::PollTask: Polling task 6c5468ff-af3a-48f6-8e53-3df98f1cb7a6 (Parent Version-Release number of selected component (if applicable): 3.5 How reproducible: 100% Steps to Reproduce: 1.Create VM with 2 disks, one thin and one pre-allocated 2.Install OS. 3.Live storage migrate one of them,or both. Actual results: Vm has been paused. Expected results: Vm should be up. Additional info: