Bug 2134365 - [OVN] Hash Ring nodes considered dead because of delayed probing
Summary: [OVN] Hash Ring nodes considered dead because of delayed probing
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-networking-ovn
Version: 16.2 (Train)
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: z5
: 16.2 (Train on RHEL 8.4)
Assignee: Lucas Alvares Gomes
QA Contact: Vadim Khitrin
URL:
Whiteboard:
Depends On: 2134364
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-10-13 09:20 UTC by Lucas Alvares Gomes
Modified: 2023-10-06 10:32 UTC (History)
8 users (show)

Fixed In Version: python-networking-ovn-7.4.2-2.20220409154864.el8ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of: 2134364
Environment:
Last Closed: 2023-10-06 10:32:22 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker OSP-19365 0 None None None 2022-10-13 09:33:46 UTC

Description Lucas Alvares Gomes 2022-10-13 09:20:20 UTC
+++ This bug was initially created as a clone of Bug #2134364 +++

Right now, probing the hash ring nodes happens as part of the maintenance task thread but this thread is only started after we establish a connection with the OVSDB servers in the post_fork_initialize() method for ML2/OVN.

If this connection with OVSDB takes longer than expected, it's possible that the nodes in the hash ring (that have to be added prior to this connection) will time out because the maintenance task thread has not yet being started.

Ideally, we would need to separate the probing to it's own periodic thread that is started before the connections with the OVSDBs to avoid this problem.

This BZ is related to the initial work done at: https://bugzilla.redhat.com/show_bug.cgi?id=2125828

Comment 2 Lon Hohberger 2023-04-21 10:39:30 UTC
According to our records, this should be resolved by python-networking-ovn-7.4.2-2.20220409154865.el8ost.  This build is available now.

Comment 3 Vadim Khitrin 2023-10-05 10:55:04 UTC
Verified on compose `RHOS-16.2-RHEL-8-20230926.n.1`:
```
[root@controller-0 containers]# grep -iR 'Hash Ring probing thread has started' /var/log/containers/
/var/log/containers/neutron/server.log.1:2023-10-04 08:09:16.400 16 INFO networking_ovn.ml2.mech_driver [-] Hash Ring probing thread has started
/var/log/containers/neutron/server.log.1:2023-10-04 10:23:43.229 16 INFO networking_ovn.ml2.mech_driver [-] Hash Ring probing thread has started
[root@controller-0 containers]# grep -iR 'Maintenance task thread has started' /var/log/containers/
/var/log/containers/neutron/server.log.1:2023-10-04 08:09:16.972 22 INFO networking_ovn.ml2.mech_driver [-] Maintenance task thread has started
/var/log/containers/neutron/server.log.1:2023-10-04 10:23:43.748 21 INFO networking_ovn.ml2.mech_driver [-] Maintenance task thread has started
```


Note You need to log in before you can comment on or make changes to this bug.