Bug 2080314
| Summary: | Constant reconcile error from TALO when processing upgrades with max concurrency 10 | |||
|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | jun | |
| Component: | Telco Edge | Assignee: | jun | |
| Telco Edge sub component: | TALO | QA Contact: | yliu1 | |
| Status: | CLOSED ERRATA | Docs Contact: | ||
| Severity: | high | |||
| Priority: | unspecified | CC: | akrzos, ijolliff, imiller, jun, keyoung | |
| Version: | 4.10 | |||
| Target Milestone: | --- | |||
| Target Release: | 4.11.0 | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | No Doc Update | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 2081570 (view as bug list) | Environment: | ||
| Last Closed: | 2022-08-18 04:08:08 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 2081570 | |||
Change to verified to unblock backport Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.11 CNF vRAN extras update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2022:6110 |
Description of problem: Current implementation continuously updates the same placement rule in a loop, once per cluster. With batch size higher than 5, this error almost always happens: 2022-04-29T03:29:06.781Z INFO controllers.ClusterGroupUpgrade [getNextRemediationPoliciesForBatch] {"isBatchComplete": false} 2022-04-29T03:29:06.781Z INFO controllers.ClusterGroupUpgrade [getNextRemediationPoliciesForBatch] {"plan": {"sno00009":0,"sno00017":2,"sno00021":2,"sno00023":2,"sno00025":1,"sno00041":2,"sno00080":2,"sno00087":2,"sno00095":2,"sno00099":2}} 2022-04-29T03:29:06.813Z ERROR controller-runtime.manager.controller.clustergroupupgrade Reconciler error {"reconciler group": "ran.openshift.io", "reconciler kind": "ClusterGroupUpgrade", "name": "group-upgrade", "namespace": "ztp-upgrade", "error": "Operation cannot be fulfilled on placementrules.apps.open-cluster-management.io \"group-upgrade-du-upgrade-platform-upgrade-status\": the object has been modified; please apply your changes to the latest version and try again"} Version-Release number of selected component (if applicable): How reproducible: Steps to Reproduce: 1. Start an upgrade with more than 5 clusters in one batch 2. 3. Actual results: Reconcile error on placement rule update, no progress can be made Expected results: Should work with much higher max concurrency Additional info: