Bug 2095485 - Excessive use of list operations in TALO
Summary: Excessive use of list operations in TALO
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Telco Edge
Version: 4.10
Hardware: Unspecified
OS: Unspecified
unspecified
medium
Target Milestone: ---
: 4.10.z
Assignee: jun
QA Contact: yliu1
URL:
Whiteboard:
Depends On: 2091113
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-06-09 18:43 UTC by OpenShift BugZilla Robot
Modified: 2022-07-11 15:28 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-07-11 15:28:29 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift-kni cluster-group-upgrades-operator pull 215 0 None open [release-4.10] Bug 2095485: Query child resources only when new ones are being created 2022-06-09 18:46:29 UTC
Red Hat Product Errata RHBA-2022:5514 0 None None None 2022-07-11 15:28:42 UTC

Description OpenShift BugZilla Robot 2022-06-09 18:43:24 UTC
+++ This bug was initially created as a clone of Bug #2091113 +++

Description of problem:
Currently TALO does 3 list operations for every reconcile call just to update its status fields that track the ACM resources it created. Even though these are namespaced query, they can be slow in large scale ZTP scenarios where we have large number of CGUs in the same namespace. More details on the impact can be found at https://issues.redhat.com/browse/CNF-5408

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:
TALO should do this only when it's creating new ACM objects.

Additional info:

Comment 2 yliu1 2022-07-04 18:10:32 UTC
This change is transparent to end user. Mark as verified only based on regression test result with latest 4.10 TALM downstream build.

Comment 3 jun 2022-07-04 18:15:17 UTC
This has been verified in the scale lab. With the fix, we can sustain 480 clusters per hour (40 clusters every 5 minutes) for ZTP deployment with du profile.

Comment 6 errata-xmlrpc 2022-07-11 15:28:29 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.10.22 extras update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:5514


Note You need to log in before you can comment on or make changes to this bug.