Bug 1330394
Summary: | sometimes vm migration failed and generates the core of the qemu process in RHEV. | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 7 | Reporter: | Sachin Raje <sraje> | ||||
Component: | spice | Assignee: | Default Assignee for SPICE Bugs <rh-spice-bugs> | ||||
Status: | CLOSED DUPLICATE | QA Contact: | SPICE QE bug list <spice-qe-bugs> | ||||
Severity: | unspecified | Docs Contact: | |||||
Priority: | unspecified | ||||||
Version: | 7.2 | CC: | amit.shah, cfergeau, chayang, dgilbert, huding, juzhang, knoel, qizhu, quintela, sraje, tpelka, victortoso, virt-maint, xfu | ||||
Target Milestone: | rc | ||||||
Target Release: | --- | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2016-05-10 07:00:10 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Sachin Raje
2016-04-26 06:55:05 UTC
(In reply to Sachin Raje from comment #0) > libvirtError: internal error: early end of file from monitor: possible > problem: > 2016-04-07T18:45:08.064325Z qemu-kvm: load of migration failed: Input/output > error This error message isn't too descriptive, but it usually happens when there's a device mismatch (after hotplug operations). Were any hotplug/unplug operations performed on the VM prior to migration? What were the qemu command lines on the src and dest machines? The core dump doesn't seem to belong to the crashed VM (or, the qemu version that produced the dump is different from the one that's mentioned). Running gdb on the crash, I don't get a proper backtrace; and in fact there are some call sites that are shown to be in TCG (i.e. non-KVM) code, so something is definitely amiss here. Can you check the qemu version that produced this crash? Also, this crash was on the src host, right? So the VM was lost during migration? Any messages that QEMU output when it crashed? Logs from src qemu and libvirt could provide clues. So one thing I see from the provided qemu versions is that the src is on 7_2.10 version, and dest is on 7.2_4. Since the 7.2_10 binary doesn't produce a valid gdb backtrace, I gave 7_2.4 a try, and it does work. So it looks like the src host was in fact running 7_2.4 when the crash happened. Backtrace is: (gdb) bt #0 timer_del (ts=0x2020202020202020) at qemu-timer.c:401 #1 0x00007f435e0ece41 in spice_server_vm_stop (s=<optimized out>) at reds.c:4615 #2 0x00007f4364e5c234 in qemu_spice_display_stop () at ui/spice-core.c:930 #3 vm_change_state_handler (opaque=<optimized out>, running=<optimized out>, state=<optimized out>) at ui/spice-core.c:639 #4 0x00007f4364d72082 in vm_state_notify (running=running@entry=0, state=state@entry=RUN_STATE_FINISH_MIGRATE) at vl.c:1517 #5 0x00007f4364cac8b2 in do_vm_stop (state=RUN_STATE_FINISH_MIGRATE) at /usr/src/debug/qemu-2.3.0/cpus.c:603 #6 vm_stop (state=RUN_STATE_FINISH_MIGRATE) at /usr/src/debug/qemu-2.3.0/cpus.c:1297 #7 0x00007f4364cac916 in vm_stop_force_state (state=state@entry=RUN_STATE_FINISH_MIGRATE) at /usr/src/debug/qemu-2.3.0/cpus.c:1305 #8 0x00007f4364e33832 in migration_thread (opaque=0x7f4365330fa0 <current_migration.34315>) at migration/migration.c:806 #9 0x00007f43637b6dc5 in start_thread (arg=0x7f414a3fe700) at pthread_create.c:308 #10 0x00007f435d19721d in lseek64 () at ../sysdeps/unix/syscall-template.S:81 #11 0x0000000000000000 in ?? () This looks like it's a use-after-free in spice-server. Re-assigning the bug to Marc-Andre for further investigation. Can you let us know what the spice-server version is on the src? Attaching the full backtrace, as the core file is too huge to be downloaded in reasonable time. Created attachment 1152147 [details]
full backtrace
Closing this one as it seems to be addressed in bug #1281455 with fixes at spice-0.12.4-17.el7 *** This bug has been marked as a duplicate of bug 1281455 *** |