Bug 2108697

Summary: OCP 4.11 AWS excessively high etcd resources with OVN-Kubernetes on large cluster-density tests
Product: OpenShift Container Platform Reporter: Andrew Collins <ancollin>
Component: NetworkingAssignee: Jaime CaamaƱo Ruiz <jcaamano>
Networking sub component: ovn-kubernetes QA Contact: Anurag saxena <anusaxen>
Status: CLOSED DUPLICATE Docs Contact:
Severity: unspecified    
Priority: unspecified CC: jhopper, msheth, rravaiol, rsevilla, surya, wking
Version: 4.11   
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: perfscale-ovn
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-08-30 09:22:54 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Andrew Collins 2022-07-19 17:50:53 UTC
Description of problem: 
On an OpenShift 4.11.0-rc.1 configured with OVN-Kubernetes, we observed significant regression in the etcd CPU and Memory usage from 4.10 to 4.11.

When compared to Openshift SDN, etcd resources grow at a significantly higher rate than an OpenShift SDN cluster installed with the same version.

Version-Release number of selected component (if applicable):
4.11.0-rc.1

How reproducible:
100%


Steps to Reproduce:
1. Install a 4.11.0-rc.1 cluster with r5.4xlarge masters
2. Scale cluster to 252 nodes
3. Apply cluster-density e2e-benchmarking workload with 4000 iterations 

Actual results:
etcd CPU usage maxes out at 1.73 (of 16) cores, and 7.16 GiB (of 124GiB)


Expected results:
etcd resource consumption scales closer to OpenShiftSDN at this scale, which is 0.088 out of 16 cores and 1.69GiB of 124GiB.

Comment 3 Raul Sevilla 2022-08-30 09:22:54 UTC

*** This bug has been marked as a duplicate of bug 2108679 ***