Bug 618940
Summary: | Source host's qemu-kvm process hits core dump when do migration under copying image from host to guest | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 6 | Reporter: | juzhang <juzhang> | ||||||
Component: | qemu-kvm | Assignee: | Kevin Wolf <kwolf> | ||||||
Status: | CLOSED WORKSFORME | QA Contact: | Virtualization Bugs <virt-bugs> | ||||||
Severity: | medium | Docs Contact: | |||||||
Priority: | medium | ||||||||
Version: | 6.0 | CC: | amit.shah, ddumas, kcao, michen, mjenner, mkenneth, quintela, snagar, tburke, virt-maint | ||||||
Target Milestone: | rc | Keywords: | TestBlocker | ||||||
Target Release: | --- | ||||||||
Hardware: | All | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2010-08-11 03:52:40 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
juzhang
2010-07-28 06:01:34 UTC
Created attachment 434926 [details]
The detailed info about image after migration failed
This issue has been proposed when we are only considering blocker issues in the current Red Hat Enterprise Linux release. ** If you would still like this issue considered for the current release, ask your support representative to file as a blocker on your behalf. Otherwise ask that it be considered for the next Red Hat Enterprise Linux release. ** (In reply to comment #4) > It might be the same issue as bz 618601 Unlikely; there's no src host qemu crash in that case. Might still be the same cause. It feels so completely wrong to have the image opened in two qemu instances at the same time. If we can't delay opening the images on the destination until after the migration has completed, maybe we can open it read-only until then at least? Just kindly reminder,maybe useful. 1.if no "scp RHEL-Server-6.0.qcow2 from HostA to guest." operation,migration is ok. 2. HostA and HostB both shared iscsi storage. Please retest with qemu-kvm-0.12.1.2-2.106.el6 that contains the close-to-open consistency fix. Retested with qemu-kvm-0.12.1.2-2.106.el6. Using the steps as same as comment0.migration can be completed.Source host's qemu-kvm process didn't hit core dump.however,still exist two problem. 1.copying image from host to guest can't complete,with error"from UNKNOWN: 2: Packet corrupt,lost connection" #scp RHEL-Server-6.0-64-virtio.qcow2 10.66.91.95:/root root.91.95's password: RHEL-Server-6.0-64-virtio.qcow2 75% 1481MB 7.6MB/s 01:02 ETAReceived disconnect from UNKNOWN: 2: Packet corrupt lost connection 2. check image infos. 2.1 before migration #qemu-img check RHEL-Server-6.0-64-virtio.qcow2 No errors were found on the image. 2.2 After migration completed,check guest img.found lots of errors(19437),compared to comment0 errors number(77649 errors were found on the image).much less,the details infos,please have a look attachment.I just pasted error summary. #qemu-img check RHEL-Server-6.0-64-virtio.qcow2 >qemu2.106imgerrorinfo.txt 19437 errors were found on the image. Data may be corrupted, or further writes to the image may corrupt it. 24455 leaked clusters were found on the image. This means waste of disk space, but no harm to data. 139 internal errors have occurred during the check. An error has occurred during the check: Success The check is not complete and may have missed error. Additional info: if no "scp RHEL-Server-6.0.qcow2 from HostA to guest." operation,migration can be completed.after migration,check image.no error was found. #qemu-img check RHEL-Server-6.0-64-virtio.qcow2 No errors were found on the image. Created attachment 436427 [details]
The detailed info about image after migration completed with qemu-kvm-0.12.1.2-2.106.el6.x86_64
(In reply to comment #7) > Just kindly reminder,maybe useful. > 1.if no "scp RHEL-Server-6.0.qcow2 from HostA to guest." operation,migration is > ok. > 2. HostA and HostB both shared iscsi storage. Does it only happen with iscsi? Tried to reproduce with local files and NFS, but haven't been successful. Second question is if you really need scp or if it's just about writing lots of data (for example, is dd from /dev/urandom or /dev/zero to a file enough?) > Does it only happen with iscsi? Tried to reproduce with local files and NFS, > but haven't been successful. > Source host's qemu-kvm process hits core dump only happen with iscsi on qemu-kvm-0.12.1.2-2.99.el6.tested on qemu-kvm-0.12.1.2-2.106.el6,just as comment9 description. For NFS: Tested on tested on qemu-kvm-0.12.1.2-2.106.el6.migration can be completed.but,still exist problem like iscsi problem. 1.copying image from host to guest can't complete,with error"from UNKNOWN: 2: Packet corrupt,lost connection" #scp RHEL-Server-6.0-64-virtio.qcow2 10.66.91.95:/root/ root.91.95's password: RHEL-Server-6.0-64-virtio.qcow2 43% 859MB 5.1MB/s 03:37 ETAReceived disconnect from UNKNOWN: 2: Packet corrupt lost connection 2.2. check image infos. 2.1 before migration #qemu-img check RHEL-Server-6.0-64-virtio.qcow2 No errors were found on the image. 2.2 After migration completed,check guest img.found some cluster leaked.but no errors. 1711 leaked clusters were found on the image. This means waste of disk space, but no harm to data. For local files: migration can be completed,however,copying image from host to guest can't completed. #scp RHEL-Server-6.0-64-virtio.qcow2 10.66.91.95:/root/ root.91.95's password: RHEL-Server-6.0-64-virtio.qcow2 28% 567MB 19.7KB/s - stalled -Write failed: Broken pipe lost connection > Second question is if you really need scp or if it's just about writing lots of > data (for example, is dd from /dev/urandom or /dev/zero to a file enough?) dd from /dev/urandom or /dev/zero just can generate ios in host internal or in guest internal.can't make io from host to guest or from guest to host.I mainly want to make ios between host and guest,so using scp command. I also tested using dd if=/dev/vda of=/dev/zero bs=1M count=20480 in guest in the progress of migration.migration can be completed and no error was found in os image. So no more core dump? Is it the same as reported in 618601 - check the end of the bugzila there. (In reply to comment #13) > So no more core dump? > > Is it the same as reported in 618601 - check the end of the bugzila there. Yes,no more core dump I don't make sure my scenario is as same as bcao description.however,I can make sure guest lost connection after migration completed.just as I mentioned in comment12. I think we're actually facing two independent problems here: One related to block code that leads to cluster leakage (however no disk corruption any more), and another related to network code that lets the connection abort during migration. I haven't really had a look at the network one, but I tried and couldn't reproduce the cluster leakage yet. QE, please re-test with .62 kernel I think the problem is a duplicated of bug 619002. I'll close this one since the core does not exist no more (In reply to comment #16) > QE, please re-test with .62 kernel I think the problem is a duplicated of bug > 619002. I'll close this one since the core does not exist no more Retested on 2.6.32-62.el6.x86_64 with qemu-kvm-0.12.1.2-2.109.el6.x86_64. Still hit network issue."eceived disconnect from 10.66.91.95: 2: Packet corrupt lost connection". I will pasted my test results into bz619002. |