Bug 2070035

Summary: 16.1 -> 16.2 upgrade caused significant network downtime
Product: Red Hat OpenStack Reporter: Alex Stupnikov <astupnik>
Component: python-networking-ovnAssignee: Rodolfo Alonso <ralonsoh>
Status: CLOSED NOTABUG QA Contact: Eran Kuris <ekuris>
Severity: high Docs Contact:
Priority: unspecified    
Version: 16.2 (Train)CC: apevec, egarciar, lhh, majopela, ralonsoh, scohen
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-07-21 08:23:43 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Alex Stupnikov 2022-03-30 10:49:08 UTC
Description of problem:

One of our customers reported a problem with all instances (not 100% sure, but he believes that all instances are affected) connectivity during RHOSP 16.1.6 --> RHOSP 16.2.1 upgrade.

From provided logs it looks like Deployment Framework acted properly to address problem described in https://bugzilla.redhat.com/show_bug.cgi?id=1895220#c6 and OVN controllers were restarted and updated before ovn-northd.

At the same time, instances were not available for significant period of time, which could be related to some problem inside OVN. I can't find related error in compute logs.

I would like to ask for engineering help with OVN troubleshooting and possible bug isolation.