Created attachment 1428065 [details] logs Description of problem: A live storage migration operation sometimes fails with 'Resource unavailable' on vdsm. Version-Release number of selected component (if applicable): vdsm-4.20.27-1.el7ev.x86_64 libvirt-3.9.0-14.el7_5.3.x86_64 qemu-img-rhev-2.10.0-21.el7_5.2.x86_64 glusterfs-3.8.4-54.6.el7.x86_64 sanlock-3.6.0-1.el7.x86_64 RHEL7.5 kernel 3.10.0-862.el7.x86_64 ovirt-engine-4.2.3.3-0.1.el7.noarch How reproducible: Steps to Reproduce: 1. Create a VM and attatch 4 disks to it reside on iSCSI 2. Start the VM 3. Migrate all disks to different type storage domains (NFS or Gluster) Actual results: 2018-04-27 21:44:16,971+0300 INFO (jsonrpc/1) [api.virt] START diskReplicateFinish(srcDisk={u'device': u'disk', u'poolID': u'b21d3b46-4a16-11e8-9dfd-00163e7be000', u'volumeID': u'728b3222-45f4-483a-adc2-e56cbc7 37750', u'domainID': u'3b804599-2145-411f-961c-8027c4b3bd82', u'imageID': u'47379013-8220-4fb6-a49d-3a00781ee41f'}, dstDisk={u'device': u'disk', u'poolID': u'b21d3b46-4a16-11e8-9dfd-00163e7be000', u'volumeID': u '728b3222-45f4-483a-adc2-e56cbc737750', u'domainID': u'd39cefc8-0138-4960-997d-f8ed132c618a', u'imageID': u'47379013-8220-4fb6-a49d-3a00781ee41f'}) from=::ffff:10.46.16.190,57308, flow_id=disks_syncAction_661ec4 31-2cfe-44f9, vmId=432dd185-08ef-4afc-a9e6-daecf8f370ee (api:46) 2018-04-27 21:44:16,973+0300 ERROR (jsonrpc/1) [virt.vm] (vmId='432dd185-08ef-4afc-a9e6-daecf8f370ee') Replication job unfinished (drive: 'vda', srcDisk: {u'device': u'disk', u'poolID': u'b21d3b46-4a16-11e8-9dfd-00163e7be000', u'volumeID': u'728b3222-45f4-483a-adc2-e56cbc737750', u'domainID': u'3b804599-2145-411f-961c-8027c4b3bd82', u'imageID': u'47379013-8220-4fb6-a49d-3a00781ee41f'}, job: {'end': 786432L, 'bandwidth': 0L, 'type': 2, 'cur': 655360L}) (vm:4685) 2018-04-27 21:44:16,973+0300 INFO (jsonrpc/1) [api.virt] FINISH diskReplicateFinish return={'status': {'message': 'Resource unavailable', 'code': 40}} from=::ffff:10.46.16.190,57308, flow_id=disks_syncAction_661ec431-2cfe-44f9, vmId=432dd185-08ef-4afc-a9e6-daecf8f370ee (api:52) engine.log: 2018-04-27 21:44:16,992+03 ERROR [org.ovirt.engine.core.bll.storage.lsm.LiveMigrateDiskCommand] (EE-ManagedThreadFactory-engineScheduled-Thread-56) [disks_syncAction_661ec431-2cfe-44f9] Failed VmReplicateDiskFinish (Disk '47379013-8220-4fb6-a49d-3a00781ee41f', VM '432dd185-08ef-4afc-a9e6-daecf8f370ee') 2018-04-27 21:44:16,992+03 ERROR [org.ovirt.engine.core.bll.SerialChildCommandsExecutionCallback] (EE-ManagedThreadFactory-engineScheduled-Thread-56) [disks_syncAction_661ec431-2cfe-44f9] Command 'LiveMigrateDisk' id: 'dfbea590-34f1-4b76-ab5d-e943725fa6b0' with children [46f74fa8-4c9f-4e6c-9acb-6e25d821c2ff, dad80a51-316a-4e32-86ff-156086962ade, 1b492bda-ef5a-4b46-84d1-1fec60e856c7] failed when attempting to perform the next operation, marking as 'ACTIVE' 2018-04-27 21:44:16,993+03 ERROR [org.ovirt.engine.core.bll.SerialChildCommandsExecutionCallback] (EE-ManagedThreadFactory-engineScheduled-Thread-56) [disks_syncAction_661ec431-2cfe-44f9] EngineException: Resource unavailable (Failed with error unavail and code 40): org.ovirt.engine.core.common.errors.EngineException: EngineException: Resource unavailable (Failed with error unavail and code 40) at org.ovirt.engine.core.bll.storage.lsm.LiveMigrateDiskCommand.replicateDiskFinish(LiveMigrateDiskCommand.java:406) [bll.jar:] at org.ovirt.engine.core.bll.storage.lsm.LiveMigrateDiskCommand.completeLiveMigration(LiveMigrateDiskCommand.java:378) [bll.jar:] at org.ovirt.engine.core.bll.storage.lsm.LiveMigrateDiskCommand.performNextOperation(LiveMigrateDiskCommand.java:242) [bll.jar:] at org.ovirt.engine.core.bll.SerialChildCommandsExecutionCallback.childCommandsExecutionEnded(SerialChildCommandsExecutionCallback.java:32) [bll.jar:] at org.ovirt.engine.core.bll.ChildCommandsCallbackBase.doPolling(ChildCommandsCallbackBase.java:68) [bll.jar:] at org.ovirt.engine.core.bll.tasks.CommandCallbacksPoller.invokeCallbackMethodsImpl(CommandCallbacksPoller.java:145) [bll.jar:] at org.ovirt.engine.core.bll.tasks.CommandCallbacksPoller.invokeCallbackMethods(CommandCallbacksPoller.java:106) [bll.jar:] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [rt.jar:1.8.0_171] at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [rt.jar:1.8.0_171] at org.glassfish.enterprise.concurrent.internal.ManagedScheduledThreadPoolExecutor$ManagedScheduledFutureTask.access$201(ManagedScheduledThreadPoolExecutor.java:383) [javax.enterprise.concurrent.jar:1.0.0.redhat-1] at org.glassfish.enterprise.concurrent.internal.ManagedScheduledThreadPoolExecutor$ManagedScheduledFutureTask.run(ManagedScheduledThreadPoolExecutor.java:534) [javax.enterprise.concurrent.jar:1.0.0.redhat-1] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [rt.jar:1.8.0_171] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [rt.jar:1.8.0_171] at java.lang.Thread.run(Thread.java:748) [rt.jar:1.8.0_171] at org.glassfish.enterprise.concurrent.ManagedThreadFactoryImpl$ManagedThread.run(ManagedThreadFactoryImpl.java:250) [javax.enterprise.concurrent.jar:1.0.0.redhat-1] at org.jboss.as.ee.concurrent.service.ElytronManagedThreadFactory$ElytronManagedThread.run(ElytronManagedThreadFactory.java:78) 2018-04-27 21:44:16,996+03 INFO [org.ovirt.engine.core.bll.SerialChildCommandsExecutionCallback] (EE-ManagedThreadFactory-engineScheduled-Thread-56) [disks_syncAction_661ec431-2cfe-44f9] Command 'LiveMigrateDisk' id: 'dfbea590-34f1-4b76-ab5d-e943725fa6b0' child commands '[46f74fa8-4c9f-4e6c-9acb-6e25d821c2ff, dad80a51-316a-4e32-86ff-156086962ade, 1b492bda-ef5a-4b46-84d1-1fec60e856c7]' executions were completed, status 'FAILED' Expected results: Live storage migration should succeed Additional info: libvirt.log is unavailable for this test execution.
Looks like: https://bugzilla.redhat.com/show_bug.cgi?id=1312909
This bug has not been marked as blocker for oVirt 4.3.0. Since we are releasing it tomorrow, January 29th, this bug has been re-targeted to 4.3.1.
*** This bug has been marked as a duplicate of bug 1312909 ***