Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

The FDP team is no longer accepting new bugs in Bugzilla. Please report your issues under FDP project in Jira. Thanks.

Bug 1871859

Summary:	[RFE] Smart Load Balancers with OVN-Controller
Product:	Red Hat Enterprise Linux Fast Datapath	Reporter:	Tim Rozet <trozet>
Component:	OVN	Assignee:	OVN Team <ovnteam>
Status:	CLOSED WONTFIX	QA Contact:	Jianlin Shi <jishi>
Severity:	unspecified	Docs Contact:
Priority:	medium
Version:	RHEL 8.0	CC:	ctrautma, djuran, mmichels
Target Milestone:	---
Target Release:	---
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2023-10-05 20:53:36 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description Tim Rozet 2020-08-24 13:26:20 UTC

Description of problem:
Today in ovn-k8s we create a load balancer for all services across all switches. In OpenShift some services will create endpoints on every single node (like coreDNS). As we scale out a cluster to say several hundred nodes, it means every time a pod makes a DNS query it could potentially hit any pod endpoint on any node. This becomes quite inefficient and creates a lot of east<->west traffic when a DNS endpoint resides on every node.

When a load balancer is rendered by ovn-controller it creates a flow, with action to an openflow group. This group contains every possible endpoint, and one is chosen by via packet hash. OVN controller is also aware of the ports that are attached to the OVS it is managing. When ovn-controller goes to create the openflow group entries, it could check for what endpoints are local to its switch, and then give those endpoints higher weight. This will ensure that those endpoints are used more often for pods that access the load balancer local to the node.

By making it more probable for local load balancer traffic to resolve local to the node, we can greatly reduce the amount of service east<->west traffic.

Comment 1 Antonio Ojea 2021-02-09 15:20:57 UTC

I think that is better to try to make this change compatible with Kubernetes Services Topologies feature, where the loadbalancer can choose between different endpoints depending if they are local or in the same cloud zone.

Comment 2 Tim Rozet 2021-02-10 22:55:18 UTC

For kubernetes "local" traffic policy, we can just simply add a single local endpoint per GR load balancer (since GR load balancers are per node). But to satisfy local traffic policy requirement that traffic must not be SNAT'ed we need: https://bugzilla.redhat.com/show_bug.cgi?id=1927540

Comment 5 Mark Michelson 2023-10-05 20:53:36 UTC

This can be closed since this was worked around in ovn-kubernetes. Tim confirmed this with me.