Bug 1575016
| Summary: | ovs-vswitchd mempool free race condition | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 7 | Reporter: | Kevin Traynor <ktraynor> |
| Component: | openvswitch | Assignee: | Kevin Traynor <ktraynor> |
| Status: | CLOSED ERRATA | QA Contact: | qding |
| Severity: | unspecified | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 7.6 | CC: | atragler, ctrautma, fbaudin, jhsiao, ktraynor, pvauter, tredaelli |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | openvswitch-2.9.0-28.el7fdn | Doc Type: | If docs needed, set a value |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2018-06-21 13:36:35 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Kevin Traynor
2018-05-04 14:30:19 UTC
Hi Kevin, I'm verifying the bug. Have you got steps to reproduce the issue? Or any idea on how to verify it? Thanks, QJ Hi QC,
I couldn't reproduce the customer seg fault, so I think the best way we can validate is to check the logs and see that the new mempool freeing scheme is working as expected.
1. Setup 2 phy ports in OVS-DPDK on the same NUMA node, with the same MTU, and run traffic between them
2. Turn on debug logging for netdev_dpdk
# ovs-appctl vlog/set netdev_dpdk:console:dbg
# ovs-appctl vlog/set netdev_dpdk:file:dbg
3.Check the name of mempool that is currently being used
# ovs-appctl netdev-dpdk/get-mempool-info dpdk0 | grep "mempool"
mempool <ovs_mp_2030_0_262144>@0x7f9c500f6b40
# ovs-appctl netdev-dpdk/get-mempool-info dpdk1 | grep "mempool"
mempool <ovs_mp_2030_0_262144>@0x7f9c500f6b40
4. Change MTU size of the ports to 5000
# ovs-vsctl -- set Interface dpdk0 mtu_request=5000
|netdev_dpdk|DBG|Allocated "ovs_mp_6126_0_262144"
# ovs-vsctl -- set Interface dpdk1 mtu_request=5000
|netdev_dpdk|DBG|Reusing mempool "ovs_mp_6126_0_262144"
7. At this point the ovs_mp_2030_0_262144 mempool is not associated with any of the ports but has not been freed, in order to allow more time for buffers to be returned to it. If you are running in a script, better to wait for a couple of seconds here.
8. Change MTU size of the ports to 9000
# ovs-vsctl -- set Interface myport mtu_request=9000
|netdev_dpdk|DBG|Freeing mempool "ovs_mp_2030_0_262144"
^^^^^^^
|netdev_dpdk|DBG|Allocated "ovs_mp_9198_0_262144"
It is likely Freeing will happen here, but it is also possible due to the traffic pattern / driver etc. that there are still some in-use buffers and in that case it will take some further time and MTU changes before the mempool can be freed. That is perfectly fine too.
# ovs-vsctl -- set Interface urport mtu_request=9000
|netdev_dpdk|DBG|Reusing mempool "ovs_mp_9198_0_262144"
thanks,
Kevin
Failed to reproduce the issue in RH lab with either openvswitch-2.6.1-16.git20161206.el7ost.x86_64 or openvswitch-2.9.0-15.el7fdp.x86_64.rpm. Verified with openvswitch-2.9.0-36.el7fdp.x86_64.rpm and the steps in Comment$4. No segfault seen and there are logs: |netdev_dpdk|DBG|Reusing mempool "ovs_mp_9198_1_65536" |netdev_dpdk|DBG|Reusing mempool "ovs_mp_6126_1_32768" |netdev_dpdk|DBG|Freeing mempool "ovs_mp_9198_1_65536" Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:1962 (In reply to qding from comment #5) > Verified with openvswitch-2.9.0-36.el7fdp.x86_64.rpm and the steps in > Comment$4. Just to note, I think there is a typo and this should be the fdn package, not fdp. i.e. openvswitch-2.9.0-36.el7fdn.x86_64.rpm There is no fdp package of that number in brew. (In reply to Kevin Traynor from comment #8) > (In reply to qding from comment #5) > > > Verified with openvswitch-2.9.0-36.el7fdp.x86_64.rpm and the steps in > > Comment$4. > > Just to note, I think there is a typo and this should be the fdn package, > not fdp. i.e. openvswitch-2.9.0-36.el7fdn.x86_64.rpm > > There is no fdp package of that number in brew. I do use openvswitch-2.9.0-36.el7fdp.x86_64.rpm. You can find it in http://download-node-02.eng.bos.redhat.com/brewroot/packages/openvswitch/2.9.0/36.el7fdp/ |