Bug 2074161

Summary: Neutron Networks silently removed
Product: Red Hat OpenStack Reporter: Maysa Macedo <mdemaced>
Component: python-networking-ovnAssignee: Lucas Alvares Gomes <lmartins>
Status: NEW --- QA Contact: Eran Kuris <ekuris>
Severity: high Docs Contact:
Priority: unspecified    
Version: 16.2 (Train)CC: apevec, lhh, lmartins, majopela, scohen, twilson
Target Milestone: ---Flags: mdemaced: needinfo? (lmartins)
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Maysa Macedo 2022-04-11 16:40:03 UTC
Description of problem:

While handling an OpenShift Namespace, Kuryr creates a Network and Subnet and connect to a Router. Even though the creation of a couple of Network has succeeded, the creation of Subnets for those Networks failed as apparently those Networks were internally deleted by Neutron. Since the Networks were successfully created, their uids were injected to Kuryr custom resources and were considered ready to use when in fact they were gone. Kuryr would then constantly try to create those Subnets and always fail due to Network not found.

Some of the Networks that disappeared:

c055b070-0162-4fc0-a8e4-ec0b592b4122
75bc4afb-61ea-4c34-8009-6af3f046ac8d
30706f65-6966-4ade-97dc-6ac169684fba
329c5a3b-d862-414b-8f1a-e9f372487bb1
a7384f18-306d-44e2-b5fd-8121ae0a7225
0f2ba723-2769-487f-b391-e0ba424a6a3c
9bfb14c2-ada8-4ab8-a355-0a433c333512
28560040-0509-4601-8791-d64e24b3e959

I see the following log for some of those Networks:

2022-04-08 13:40:56.271 18 ERROR neutron.plugins.ml2.plugin [req-2f7e6211-d8ae-4988-aae4-7da83fa8f9dc 2adeb31a0df24d20b90dc9ef5bedf78d b04c7dd4e9594ffa937ec2da47a75c5f - default default] mechanism_manager.create_network_postcommit failed, deleting network 'c055b070-0162-4fc0-a8e4-ec0b592b4122': neutron.plugins.ml2.common.exceptions.MechanismDriverError

Seems the API didn't report a failure, otherwise Kuryr would not have injected the Network uid to the custom resource:

$ oc get kuryrnetworks 5ffe3ad5-9759-4208-96a5-3407eabae629-168 -n 5ffe3ad5-9759-4208-96a5-3407eabae629-168 -o yaml |grep netId
  netId: c055b070-0162-4fc0-a8e4-ec0b592b4122


 
Version-Release number of selected component (if applicable):
16.2.2
neutron_api manually updated with https://review.opendev.org/c/openstack/neutron/+/834657

How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info: