Bug 1663405
Summary: | migration failed with enable compress | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux Advanced Virtualization | Reporter: | Yiqian Wei <yiwei> |
Component: | qemu-kvm | Assignee: | Virtualization Maintenance <virt-maint> |
qemu-kvm sub component: | Live Migration | QA Contact: | Li Xiaohui <xiaohli> |
Status: | CLOSED WONTFIX | Docs Contact: | |
Severity: | high | ||
Priority: | high | CC: | aadam, ailan, chayang, dgilbert, fjin, jinzhao, juzhang, peterx, quintela, rbalakri, thuth, virt-maint, xianwang, xiaohli, yuhuang |
Version: | 8.0 | Keywords: | Triaged |
Target Milestone: | rc | ||
Target Release: | 8.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-03-15 07:32:49 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1758964, 1771318 |
Description
Yiqian Wei
2019-01-04 08:27:31 UTC
Hi Juan, I reproduce this bz on rhel8.0.1 and rhel8.1.0 host sometimes, not always reproduce. Need I clone this bz for rhel8.0.1 or rhel8.1.0 since this bz is reported in qemu-kvm, adn the versions about qemu-kvm on rhel8.0, rhel8.0.1 and rhel8.1.0 are different Best regards, Li Xiaohui hi multifd + compress don't work. I posted upstream a new way to do compression on top of multifd. Will improve the error message. Hi, I missread the previous commit. This has nothing to do with multifd, investigating what happens with compression. Hi all, sometimes reproduce this bz on rhel8.1-av(kernel-4.18.0-129.el8.x86_64 & qemu-img-4.1.0-1.module+el8.1.0+3966+4a23dca1.x86_64), guest is kernel-4.18.0-130.el8.x86_64, thanks. Hi compression is really difficult to support, and as far as we know we don't use it (the current implementation is only useful if you are migrating over a really, really slow link), otherwise the amount of traffic that is saved is small, and the ammount of CPU that is needed to make it work don't help here. Notice that the compression that RHV uses is "XBZRLE". (capability xbzrle on info migrate). compress capability is a completelly different beast, based on zlib that we don't support. There are two compression methods in qemu: - xbzrle - zlib (this is older, so it got the "compression" name) In xbzrle (the one that we support on RHV), we got a big cache of memory, and we save a copy of (some) of the transferred pages. If the page is dirty again, we just send the difference with the previous page sent, so, we transmit less bits. With the zlib compression (that we don't support), we copy the memory to other place, start a thread to do the compression (that is a slow operation in itself), and copy back to the main thread. This is really very slow. It was introduced because at some point there was going to be intel processors that were able to do this compression fast, but they haven't appeared. So we don't support it, we know that it is very slow in its current incarnation and that is why we don't support it. There are partches posted on qemu list that will be integrated on upstream qemu that use zlib (and zstd) on top of multifd and that they are faster, and we can support. but that is for future versions. One line summary: We don't support zlib compression because we know that it is not reliable. Hi Juan& Hai, I can reproduce this bz on the latest rhel8.1.1-av test. From Juan's above comments, if don't support zlib compression, could disable it at all? Then QE won't test and trace related problems. Thanks. Hi We will do it upstream. But not for 8.1.1. Later, Juan. (In reply to Juan Quintela from comment #9) > Hi > > We will do it upstream. But not for 8.1.1. > Will for rhel8.2.0? > Later, Juan. Yes. QEMU has been recently split into sub-components and as a one-time operation to avoid breakage of tools, we are setting the QEMU sub-component of this BZ to "General". Please review and change the sub-component if necessary the next time you review this BZ. Thanks (In reply to Li Xiaohui from comment #10) > (In reply to Juan Quintela from comment #9) > > Hi > > > > We will do it upstream. But not for 8.1.1. > > > Will for rhel8.2.0? Has this upstream work ever been included in 8.2 or 8.3? ... if yes, could we move this bug forward now? No product uses compression on RHEL. No solution upstream, as said, we have a compression solution on top of multifd that is easier to maintain and much faster. So postpone it. Will try in the beginning of January 2021 since recently busy with other things and will be PTO in next week Hi Amnon, I have tested this issue on RHEL-8.4.0-AV(kernel-4.18.0-262.el8.dt3.x86_64&qemu-img-5.2.0-2.module+el8.4.0+9186+ec44380f.x86_64), still can reproduce(not always). If we plan to close it as won'tfix, could you or Juan give QE a confirmation that QE needn't test multi-thread-compression anymore and needn't track related bzs? Thank you. Sorry, thank you Ariel. After evaluating this issue, there are no plans to address it further or fix it in an upcoming release. Therefore, it is being closed. If plans change such that this issue will be fixed in an upcoming release, then the bug can be reopened. Close this bz as WONTFIX since deprecate multi-thread-compression from migration test plan. |