Bug 2219405 - Disable LSP_OPTIONS_MCAST_FLOOD_REPORTS for LSPs that may negatively affect the length of action lists in mcast handling flows
Summary: Disable LSP_OPTIONS_MCAST_FLOOD_REPORTS for LSPs that may negatively affect t...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-networking-ovn
Version: 16.2 (Train)
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: ---
Assignee: Lucas Alvares Gomes
QA Contact: Eran Kuris
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-07-03 13:51 UTC by Ihar Hrachyshka
Modified: 2024-09-05 08:22 UTC (History)
6 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2024-09-05 08:22:13 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker OSP-26278 0 None None None 2023-07-03 13:53:36 UTC

Description Ihar Hrachyshka 2023-07-03 13:51:04 UTC
This bug was initially created as a copy of Bug #2218631

I am copying this bug because: 

We should follow up on the comment from the original bug: https://bugzilla.redhat.com/show_bug.cgi?id=2218631#c18

Specifically, we should disable LSP_OPTIONS_MCAST_FLOOD_REPORTS for ports created by ml2/ovn: https://github.com/openstack/neutron/blob/b92d133de66411b9444855cc97480891d7d7b7b2/neutron/plugins/ml2/drivers/ovn/mech_driver/ovsdb/ovn_client.py#L369

The hope is that this will alleviate the size of action lists in table=38 of OVN (for the flows that handle mcast flooding), making it a potential work around for the OVS action list size limitation (tracked in the original bug).

Description of problem:
After creating close to 1220 VMs on a single compute, networking breaks for all of them and we see the following errors in the logs:

Jun 29 17:20:57 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:20:57 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:20:59 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:20:59 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:00 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:07 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:07 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:07 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:07 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:08 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:08 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:08 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:13 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:13 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:14 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:14 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:15 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768
Jun 29 17:21:15 compute00 kernel: openvswitch: netlink: Flow action size exceeds max 32768


Version-Release number of selected component (if applicable):
RHOSP16.2.5

How reproducible:
Always

Steps to Reproduce:
1. Create close to 1220 VMs (at 1219 it should still work)
2.
3.

Actual results:
Global outage

Expected results:
Create 2000 vMs on a single compute

Additional info:


Note You need to log in before you can comment on or make changes to this bug.