Bug 2125824

Summary: Neutron server doesn't start
Product: Red Hat OpenStack Reporter: Jakub Libosvar <jlibosva>
Component: python-networking-ovnAssignee: Jakub Libosvar <jlibosva>
Status: CLOSED ERRATA QA Contact: Maor <mblue>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 16.1 (Train)CC: apevec, astillma, astupnik, egarciar, jamsmith, jelynch, jmelvin, joflynn, ldenny, lhh, ljozsa, ltamagno, majopela, mariel, mblue, scohen
Target Milestone: z9Keywords: Regression, Triaged
Target Release: 16.1 (Train on RHEL 8.2)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: python-networking-ovn-7.3.1-1.20220926193640.4e24f4c.el8ost Doc Type: Bug Fix
Doc Text:
This update in RHOSP 16.1.9 fixes a bug that causes the Networking service (neutron) to fail to start after an update to RHOSP 16.1.8 and also causes OVN database instability after updates to RHOSP 16.1.8. + Instead of updating to RHOSP 16.1.8, update directly to RHOSP 16.1.9.
Story Points: ---
Clone Of:
: 2127229 2127230 2132151 (view as bug list) Environment:
Last Closed: 2022-12-07 20:27:09 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2127229    
Bug Blocks: 2132151    

Description Jakub Libosvar 2022-09-10 21:25:09 UTC
Description of problem:
On bigger environment, connections to the OVN databases may take a lot of time. Neutron OVN mechanism attempts to connect in the pre-fork to configure OVN DB server but API workers are started before. API workers rely on hash ring mechanism that considers hash ring node dead if it's not been touched in the last 60 seconds. The touching mechanism is done by periodic task that's executed from each maintenance worker, however the worker is started in a post-fork.

This means, if connection to both OVN DBs take more than 60 seconds, the API worker hash ring nodes are considered dead. It eventually recovers but it takes more than an hour.

This is a regression caused by https://review.opendev.org/c/openstack/networking-ovn/+/812304

Neutron shouldn't be configuring other services.

Version-Release number of selected component (if applicable):
16.1.8

How reproducible:
Always

Steps to Reproduce:
1. Start neutron server
2.
3.

Actual results:


Expected results:


Additional info:

Comment 19 errata-xmlrpc 2022-12-07 20:27:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 16.1.9 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:8795