Bug 1252099
Summary: | Migration failed libvirtError in migrateToURI3 | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 7 | Reporter: | Israel Pinto <ipinto> | ||||||||||||
Component: | qemu-kvm-rhev | Assignee: | Ademar Reis <areis> | ||||||||||||
Status: | CLOSED NEXTRELEASE | QA Contact: | Virtualization Bugs <virt-bugs> | ||||||||||||
Severity: | urgent | Docs Contact: | |||||||||||||
Priority: | unspecified | ||||||||||||||
Version: | 7.1 | CC: | areis, bazulay, dyuan, ecohen, fjin, fromani, gklein, huding, ipinto, jsuchane, juzhang, knoel, lpeer, lsurette, michal.skrivanek, rbalakri, stefanha, tjelinek, virt-bugs, virt-maint, xfu, ycui, yeylon, zhwang | ||||||||||||
Target Milestone: | pre-dev-freeze | ||||||||||||||
Target Release: | --- | ||||||||||||||
Hardware: | Unspecified | ||||||||||||||
OS: | Unspecified | ||||||||||||||
Whiteboard: | virt | ||||||||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||||||||
Doc Text: | Story Points: | --- | |||||||||||||
Clone Of: | Environment: | ||||||||||||||
Last Closed: | 2015-09-22 14:24:08 UTC | Type: | Bug | ||||||||||||
Regression: | --- | Mount Type: | --- | ||||||||||||
Documentation: | --- | CRM: | |||||||||||||
Verified Versions: | Category: | --- | |||||||||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||||||||
Embargoed: | |||||||||||||||
Attachments: |
|
Description
Israel Pinto
2015-08-10 17:26:22 UTC
Created attachment 1061179 [details]
vdsm and engine logs
Adding logs, source host log: Thread-66209::ERROR::2015-08-11 10:05:49,785::vm::1416::virt.vm::(_getRunningVmStats) vmId=`b88b433c-7eeb-4236-83f9-1997c9f9d0eb`::Error fetching vm stats Traceback (most recent call last): File "/usr/share/vdsm/virt/vm.py", line 1413, in _getRunningVmStats vm_sample.interval) File "/usr/share/vdsm/virt/vmstats.py", line 43, in produce disks(vm, stats, first_sample, last_sample, interval) File "/usr/share/vdsm/virt/vmstats.py", line 262, in disks interval)) File "/usr/share/vdsm/virt/vmstats.py", line 291, in _disk_rate (last_sample['block.%d.rd.bytes' % last_index] - KeyError: 'block.0.rd.bytes' Thread-66209::INFO::2015-08-11 10:05:49,810::xmlrpc::92::vds.XMLRPCServer::(_process_requests) Request handler for 127.0.0.1:54663 stopped destination host log: JsonRpc (StompReactor)::DEBUG::2015-08-11 10:01:08,968::stompreactor::236::Broker.StompAdapter::(handle_frame) Handling message <StompFrame command=u'SEND'> Thread-31299::ERROR::2015-08-11 10:01:08,972::vm::780::virt.vm::(_startUnderlyingVm) vmId=`b88b433c-7eeb-4236-83f9-1997c9f9d0eb`::Failed to start a migration destination vm Traceback (most recent call last): File "/usr/share/vdsm/virt/vm.py", line 750, in _startUnderlyingVm self._completeIncomingMigration() File "/usr/share/vdsm/virt/vm.py", line 2732, in _completeIncomingMigration self._incomingMigrationFinished.isSet(), usedTimeout) File "/usr/share/vdsm/virt/vm.py", line 2785, in _attachLibvirtDomainAfterMigration raise MigrationError(e.get_error_message()) MigrationError: Domain not found: no domain with matching uuid 'b88b433c-7eeb-4236-83f9-1997c9f9d0eb' Thread-31299::INFO::2015-08-11 10:01:08,975::vm::1305::virt.vm::(setDownStatus) vmId=`b88b433c-7eeb-4236-83f9-1997c9f9d0eb`::Changed state to Down: VM failed to migrate (code=8) looking at the logs....there are multiple failures, seem unrelated to each other. Which one you are referring to? The one incoming migration to .159 host ends with "libvirtError: internal error: process exited while connecting to monitor:", which points to qemu crash. We would need libvirt and qemu logs Also note this is a recent feature on qemu/libvirt side, it may behave differently on EL7.2 also note the logs do not cover the period you highlighted in comment #2 also it seems a lot like the issue with the starting of the VM which had 12345 IO Threads configured. Is this happening when you have this amount of IO threads configured? If yes, there is a patch going which will limit the num of threads per vm. Created attachment 1061346 [details]
update_vdsm_engine_logs
about comment 6: We can't vm with 12345 threads see bug: https://bugzilla.redhat.com/show_bug.cgi?id=1251899 I see the failure with 5 threads. I attached update logs with migration failures. If we disable the IO Threads the migration working OK. (test 5 times consecutively) thanks. but again, "libvirtError: internal error: process exited while connecting to monitor:" indicate it failed in qemu, so we need qemu and libvirt logs as well Created attachment 1061928 [details]
New_vdsm_ &&_engine_logs
Created attachment 1061929 [details]
libvirt_logs_host1
Created attachment 1061931 [details]
libvirt_logs_host_2
Added new logs for libvirt and vdsm. The migration failed also when IO-Thread are disabled I can see QEMU crashed/exited right after create on destination side with "2015-08-12T11:38:15.551530Z qemu-kvm: load of migration failed: Input/output error" Moving to libvirt team for investigation. Note it's 7.1 qemu Nisim, I would suggest to retest on RHEL 7.2 host Moving one layer down to qemu-kvm-rhev for further investigation. engine version: 3.6.0-0.12.master.el6 hosts rhel 7.2: vdsm version: 3.6.0-0.12.master.el6 Did not reproduce. (In reply to Israel Pinto from comment #21) > engine version: 3.6.0-0.12.master.el6 > hosts rhel 7.2: > vdsm version: 3.6.0-0.12.master.el6 > > Did not reproduce. Then we're good I guess All we need is 7.2 which is aligned to RHEV 3.6 |