Bug 2018276

Summary: Routes are inaccessible from Infra nodes which holds the ingress VIP
Product: OpenShift Container Platform Reporter: Ashish Sharma <ashsharm>
Component: NetworkingAssignee: Tim Rozet <trozet>
Networking sub component: ovn-kubernetes QA Contact: Anurag saxena <anusaxen>
Status: CLOSED DUPLICATE Docs Contact:
Severity: urgent    
Priority: urgent CC: akaiser, emahoney, openshift-bugs-escalate, pdiak, skanakal, trozet
Version: 4.8   
Target Milestone: ---   
Target Release: ---   
Hardware: other   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-11-01 14:14:19 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 10 Tim Rozet 2021-11-01 14:14:19 UTC
It looks like this is the same as https://bugzilla.redhat.com/show_bug.cgi?id=2009873

Basically what happens is ovnkube-node subscribes to netlink for address updates on the host. Once we see these updates, we update our annotation and routes. The node where the VIP lives requires this route to be able to reach the VIP. It looks like after some time the netlink subscription may fail, and ovnkube-node is not re-subscribing. Essentially ovnkube-node stops receiving events. By restarting ovnkube-node it restarts the netlink subscription, so things work until another error is received. The fix will be to check if there are any errors or if the netlink subscription goes down and then resubscribe.

*** This bug has been marked as a duplicate of bug 2009873 ***