Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2080314

Summary: Constant reconcile error from TALO when processing upgrades with max concurrency 10
Product: OpenShift Container Platform Reporter: jun
Component: Telco EdgeAssignee: jun
Telco Edge sub component: TALO QA Contact: yliu1
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: unspecified CC: akrzos, ijolliff, imiller, jun, keyoung
Version: 4.10   
Target Milestone: ---   
Target Release: 4.11.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of:
: 2081570 (view as bug list) Environment:
Last Closed: 2022-08-18 04:08:08 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2081570    

Description jun 2022-04-29 12:58:49 UTC
Description of problem:
Current implementation continuously updates the same placement rule in a loop, once per cluster. With batch size higher than 5, this error almost always happens:


2022-04-29T03:29:06.781Z	INFO	controllers.ClusterGroupUpgrade	[getNextRemediationPoliciesForBatch]	{"isBatchComplete": false}
2022-04-29T03:29:06.781Z	INFO	controllers.ClusterGroupUpgrade	[getNextRemediationPoliciesForBatch]	{"plan": {"sno00009":0,"sno00017":2,"sno00021":2,"sno00023":2,"sno00025":1,"sno00041":2,"sno00080":2,"sno00087":2,"sno00095":2,"sno00099":2}}
2022-04-29T03:29:06.813Z	ERROR	controller-runtime.manager.controller.clustergroupupgrade	Reconciler error	{"reconciler group": "ran.openshift.io", "reconciler kind": "ClusterGroupUpgrade", "name": "group-upgrade", "namespace": "ztp-upgrade", "error": "Operation cannot be fulfilled on placementrules.apps.open-cluster-management.io \"group-upgrade-du-upgrade-platform-upgrade-status\": the object has been modified; please apply your changes to the latest version and try again"}


Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1. Start an upgrade with more than 5 clusters in one batch
2.
3.

Actual results:
Reconcile error on placement rule update, no progress can be made

Expected results:
Should work with much higher max concurrency

Additional info:

Comment 2 jun 2022-05-04 20:59:40 UTC
Change to verified to unblock backport

Comment 6 errata-xmlrpc 2022-08-18 04:08:08 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.11 CNF vRAN extras update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2022:6110