The FDP team is no longer accepting new bugs in Bugzilla. Please report your issues under FDP project in Jira. Thanks.
Bug 2172036 - [IP] Random Tempest test fails due to missing flows for metadata port
Summary: [IP] Random Tempest test fails due to missing flows for metadata port
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Enterprise Linux Fast Datapath
Classification: Red Hat
Component: ovn22.03
Version: FDP 22.D
Hardware: Unspecified
OS: Unspecified
medium
high
Target Milestone: ---
: ---
Assignee: OVN Team
QA Contact: Jianlin Shi
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-02-21 09:18 UTC by Yatin Karel
Modified: 2024-02-14 21:15 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2024-02-14 21:15:44 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
Flows and db dumps before and after the issue reproduces (5.52 MB, application/gzip)
2023-02-21 09:18 UTC, Yatin Karel
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker FD-2699 0 None None None 2023-02-21 09:19:11 UTC

Description Yatin Karel 2023-02-21 09:18:26 UTC
Created attachment 1945447 [details]
Flows and db dumps before and after the issue reproduces

Description of problem:
Detected in OpenStack Upstream CI[1][2]. Issue happens only on 22.03(22.03.0, 22.03.2, branch-22.03 etc). It's different than [3] as issue is seen with fix of it included. 

$ sudo ovs-vsctl list open .
_uuid               : 27746eeb-17bd-4d82-8efa-fc716594fda8
bridges             : [24bc6520-aa34-49e9-b4d0-ac6e09453672, 51a5686c-fcd7-4728-bb29-768e473d5826]
cur_cfg             : 634
datapath_types      : [netdev, system]
datapaths           : {system=3c3a1871-321f-450d-a3c6-8a8a86dd5d6c}
db_version          : []
dpdk_initialized    : false
dpdk_version        : none
external_ids        : {hostname=ykarel-jammytest, ovn-bridge=br-int, ovn-bridge-mappings="public:br-ex", ovn-cms-options=enable-chassis-as-gw, ovn-encap-ip="10.0.111.188", ovn-encap-type=geneve, ovn-remote="tcp:10.0.111.188:6642", system-id="c2a81ace-a1e3-491a-be11-97c8b6778374"}
iface_types         : [bareudp, erspan, geneve, gre, gtpu, internal, ip6erspan, ip6gre, lisp, patch, stt, system, tap, vxlan]
manager_options     : [b5aa86b7-b76b-4fc8-8716-2a1d86479d60]
next_cfg            : 634
other_config        : {vlan-limit="0"}
ovs_version         : []
ssl                 : []
statistics          : {}
system_type         : devstack
system_version      : []


Attached dbs, physical flows(with and without stats):-
- fresh(no tests runs)
- afterdummy resources create/cleanup
- beforerecompute flows miss detected
- after running recompute
- after cleanup of test resources

Version-Release number of selected component (if applicable):
ovn-22.03

How reproducible:
In CI[1][2] it's seen randomly. But with the reproducer below could reproduce consistently.

Steps to Reproduce:
1. Setup devstack with ovn22.03
2. Run dummy.sh shared below(reproduces only if logical_router "enabled=true" is set)
3. Run any of the tempest test relying on metadata flows, for ex:- tempest run --regex tempest.scenario.test_server_basic_ops.TestServerBasicOps.test_server_basic_ops
4. Check for flows of localport, those are missing
5. running recompute get backs the flows, ex:-

$ diff beforerecompute_flows_nostats.txt afterrecompute_flows_nostats.txt 
5a6
>  cookie=0x2233ffaa, priority=100,in_port=287 actions=load:0x10->NXM_NX_REG13[],load:0xa->NXM_NX_REG11[],load:0x9->NXM_NX_REG12[],load:0x5->OXM_OF_METADATA[],load:0x1->NXM_NX_REG14[],load:0x1->NXM_NX_REG10[10],resubmit(,8)
774a776
>  cookie=0x2233ffaa, table=37, priority=150,reg14=0x1,metadata=0x5 actions=resubmit(,38)
808d809
<  cookie=0x9cfac85f, table=38, priority=100,reg15=0x8000,metadata=0x5 actions=load:0xf->NXM_NX_REG13[],load:0x3->NXM_NX_REG15[],resubmit(,39),load:0x8000->NXM_NX_REG15[]
810c811,813
<  cookie=0xd295f58d, table=38, priority=100,reg15=0x8005,metadata=0x5 actions=load:0xf->NXM_NX_REG13[],load:0x3->NXM_NX_REG15[],resubmit(,39),load:0x8005->NXM_NX_REG15[]
---
>  cookie=0xd295f58d, table=38, priority=100,reg15=0x8005,metadata=0x5 actions=load:0x10->NXM_NX_REG13[],load:0xf->NXM_NX_REG13[],load:0x3->NXM_NX_REG15[],resubmit(,39),load:0x8005->NXM_NX_REG15[]
>  cookie=0x9cfac85f, table=38, priority=100,reg15=0x8000,metadata=0x5 actions=load:0x10->NXM_NX_REG13[],load:0xf->NXM_NX_REG13[],load:0x3->NXM_NX_REG15[],resubmit(,39),load:0x8000->NXM_NX_REG15[]
>  cookie=0x2233ffaa, table=38, priority=100,reg15=0x1,metadata=0x5 actions=load:0x10->NXM_NX_REG13[],load:0xa->NXM_NX_REG11[],load:0x9->NXM_NX_REG12[],resubmit(,39)
828a832
>  cookie=0x2233ffaa, table=39, priority=100,reg10=0/0x1,reg14=0x1,reg15=0x1,metadata=0x5 actions=drop
1082a1087
>  cookie=0x2233ffaa, table=64, priority=100,reg10=0x1/0x1,reg15=0x1,metadata=0x5 actions=push:NXM_OF_IN_PORT[],load:0xffff->NXM_OF_IN_PORT[],resubmit(,65),pop:NXM_OF_IN_PORT[]
1099a1105
>  cookie=0x2233ffaa, table=65, priority=100,reg15=0x1,metadata=0x5 actions=output:287
1103a1110
>  cookie=0xffc28b38, table=66, priority=100,reg0=0xac180401,reg15=0x1,metadata=0x6 actions=mod_dl_dst:22:65:bc:24:e9:49,load:0x1->NXM_NX_REG10[6]
1107a1115
>  cookie=0xffc28b38, table=67, priority=100,arp,reg0=0xac180401,reg14=0x1,metadata=0x6,dl_src=22:65:bc:24:e9:49 actions=load:0x1->NXM_NX_REG10[6]

6. rerunning the test again succeeds.

Actual results:
Flows missing for localport

Expected results:
Flows should exist for localport without running recompute, looks Incremental processing bug

Additional info:
=====dummy.sh=======
# cat repr.sh 
ovn-nbctl lr-add dummylr1 -- set logical_router dummylr1 enabled=true
ovn-nbctl ls-add dummyls1
ovn-nbctl lrp-add dummylr1 dummylr1-dummyls1 00:00:00:00:00:01 192.168.1.154/24
ovn-nbctl lsp-add dummyls1 dummyls1-dummylr1

# cleanup dummy resources
ovn-nbctl lsp-del dummyls1-dummylr1
ovn-nbctl lrp-del dummylr1-dummyls1
ovn-nbctl ls-del dummyls1
ovn-nbctl lr-del dummylr1


- ovn22.06 is not hitting this issue since [4], so some part of that commit fixes the issue.




[1] https://bugs.launchpad.net/neutron/+bug/2007166
[2] https://bugs.launchpad.net/neutron/+bug/2007565
[3] https://bugzilla.redhat.com/show_bug.cgi?id=2076604
[4] https://github.com/ovn-org/ovn/commit/eaf9832be248daca6d96202d504cc789d6dfec6d

Comment 1 OVN Bot 2024-02-14 21:15:43 UTC
This issue is being closed as an automatic process due to the issue's age. If you wish to re-open this issue, please do so in Jira (https://issues.redhat.com) in the 'FDP' project. Please be sure to set the component to the latest OVN version where this issue is known to occur. If this is a feature request or improvement, please set the component to 'OVN'.


Note You need to log in before you can comment on or make changes to this bug.