Bug 1491909
Summary: | IP network can not recover after several vhost-user reconnect | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 7 | Reporter: | Pei Zhang <pezhang> | ||||||
Component: | qemu-kvm-rhev | Assignee: | Maxime Coquelin <maxime.coquelin> | ||||||
Status: | CLOSED ERRATA | QA Contact: | Pei Zhang <pezhang> | ||||||
Severity: | high | Docs Contact: | |||||||
Priority: | high | ||||||||
Version: | 7.5 | CC: | ailan, atragler, chayang, echaudro, jfreiman, juzhang, lmiksik, michen, pezhang, qding, rkhan, tredaelli, virt-maint | ||||||
Target Milestone: | rc | Keywords: | Extras, Regression | ||||||
Target Release: | --- | ||||||||
Hardware: | Unspecified | ||||||||
OS: | Unspecified | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | qemu-kvm-rhev-2.10.0-12.el7 | Doc Type: | If docs needed, set a value | ||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | |||||||||
: | 1579716 (view as bug list) | Environment: | |||||||
Last Closed: | 2018-04-11 00:33:01 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Bug Depends On: | |||||||||
Bug Blocks: | 1579716 | ||||||||
Attachments: |
|
Description
Pei Zhang
2017-09-15 02:39:17 UTC
Created attachment 1326264 [details]
VM XML file.
3. Additional info # After re-start ovs, "net eth1: Unexpected TXQ (0) queue failure: -5" will repeat show in # dmesg # dmesg ... [ 92.339221] virtio_net virtio1: output.0:id 0 is not a head! [ 92.339652] net eth1: Unexpected TXQ (0) queue failure: -5 [ 93.048195] net eth1: Unexpected TXQ (0) queue failure: -5 [ 93.339195] net eth1: Unexpected TXQ (0) queue failure: -5 [ 94.341178] net eth1: Unexpected TXQ (0) queue failure: -5 [ 95.343173] net eth1: Unexpected TXQ (0) queue failure: -5 [ 97.049156] net eth1: Unexpected TXQ (0) queue failure: -5 [ 98.051158] net eth1: Unexpected TXQ (0) queue failure: -5 [ 99.062141] net eth1: Unexpected TXQ (0) queue failure: -5 ... I tried to replicate the issue, but I do not see it on my netdev servers. Ping continues (with a few missing) during the run of your script. These are my versions: $ rpm -q openvswitch kernel qemu-kvm-rhev libvirt openvswitch-2.8.0-1.el7fdb.x86_64 kernel-3.10.0-693.el7.x86_64 qemu-kvm-rhev-2.9.0-16.el7_4.8.x86_64 libvirt-3.2.0-14.el7_4.3.x86_64 I do see you have a newer version of libvirt, not sure where you got it, but it should not be a problem. I also tried virtual to virtual machine, and it also works fine. As a VM host OS I use Centos. What do you use? I can make my machines available for you to see if you can get it replicated, or if you have a failing setup I can use that to troubleshoot? (In reply to Eelco Chaudron from comment #5) > I tried to replicate the issue, but I do not see it on my netdev servers. > Ping continues (with a few missing) during the run of your script. > These are my versions: > > $ rpm -q openvswitch kernel qemu-kvm-rhev libvirt > openvswitch-2.8.0-1.el7fdb.x86_64 > kernel-3.10.0-693.el7.x86_64 > qemu-kvm-rhev-2.9.0-16.el7_4.8.x86_64 > libvirt-3.2.0-14.el7_4.3.x86_64 > > I do see you have a newer version of libvirt, not sure where you got it, but > it should not be a problem. > > I also tried virtual to virtual machine, and it also works fine. As a VM > host OS I use Centos. What do you use? > > I can make my machines available for you to see if you can get it > replicated, or if you have a failing setup I can use that to troubleshoot? Hi Eelco, Sorry for so late reply, as I was not in office last 2 weeks and I just got back to work today. I still can reproduce this issue with openvswitch-2.8.0-3.el7fdb.x86_64. Note: This issue can be triggered after several(about 10 times) restarting ovs. I keep my testing environment, please log in, I'll add the hosts detail info in next Comment. Best Regards, Pei After some discussion with Maxime he can also replicate this with testpmd and qemu. He will take a look at this BZ, so will re-assign it to him, and changed the component to DPDK for now. Series merged upstream & posted downstream. New brew build: https://brewweb.engineering.redhat.com/brew/taskinfo?taskID=14683959 Fix included in qemu-kvm-rhev-2.10.0-12.el7 Verification: Versions: 3.10.0-814.el7.x86_64 qemu-kvm-rhev-2.10.0-12.el7.x86_64 libvirt-3.9.0-5.el7.x86_64 openvswitch-2.8.0-4.el7fdb.x86_64 dpdk-17.11-1.el7fdb.x86_64 Steps: Same with Description. Reconnect ovs 100 times, get PASS results. Guest network can always recover after each reconnect and no any error in guest. So this bug has been fixed very well. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2018:1104 |