Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2091616

Summary: Unable to start upgrade with backup enabled with "Reconciler error"
Product: OpenShift Container Platform Reporter: yliu1
Component: Telco EdgeAssignee: sahasan
Telco Edge sub component: TALO QA Contact: Joshua Clark <josclark>
Status: CLOSED ERRATA Docs Contact:
Severity: urgent    
Priority: unspecified CC: ijolliff, jun, keyoung, nparekh, sahasan
Version: 4.11   
Target Milestone: ---   
Target Release: 4.11.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-08-18 04:08:08 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2092393    

Description yliu1 2022-05-30 13:48:17 UTC
Description of problem:
When attempted to start upgrade with backup enabled, a "Reconciler error" appeared, and CGU does not start.

2022-05-27T19:19:18.770Z	ERROR	controller-runtime.manager.controller.clustergroupupgrade	Reconciler error	{"reconciler group": "ran.openshift.io", "reconciler kind": "ClusterGroupUpgrade", "name": "upgradetest1", "namespace": "default", "error": "ManagedClusterAction in version \"v1beta1\" cannot be handled as a ManagedClusterAction: v1.ObjectMeta.Labels: ReadString: expects \" or n, but found t, error found in #10 byte of ...|-backup\":true},\"name|..., bigger context ...|{\"labels\":{\"velero.io/exclude-from-backup\":true},\"name\":\"backup-ns-delete\",\"namespace\":\"helix2|..."}


Version-Release number of selected component (if applicable):
4.11

How reproducible:
Always

Steps to Reproduce:
1. Deploy hub cluster with ZTP and TALO
2. Create cluster version policy via PGT
3. Create GGU using the cluster version policy with backup enabled

Actual results:
CGU does not start 
status:
  computedMaxConcurrency: 1
  status: {}

Following error in upgrade pod logs

2022-05-27T19:19:18.770Z	ERROR	controller-runtime.manager.controller.clustergroupupgrade	Reconciler error	{"reconciler group": "ran.openshift.io", "reconciler kind": "ClusterGroupUpgrade", "name": "upgradetest1", "namespace": "default", "error": "ManagedClusterAction in version \"v1beta1\" cannot be handled as a ManagedClusterAction: v1.ObjectMeta.Labels: ReadString: expects \" or n, but found t, error found in #10 byte of ...|-backup\":true},\"name|..., bigger context ...|{\"labels\":{\"velero.io/exclude-from-backup\":true},\"name\":\"backup-ns-delete\",\"namespace\":\"helix2|..."}

Expected results:
Backup started

Additional info:
CGU spec and status:

status:
  computedMaxConcurrency: 1
  status: {}


spec:
  actions:
    afterCompletion:
      deleteObjects: true
    beforeEnable: {}
  backup: true
  clusters:
  - helix21-1
  enable: true
  managedPolicies:
  - du-upgrade-cluster-version-policy1
  preCaching: false
  remediationStrategy:
    maxConcurrency: 10
    timeout: 240

Comment 1 Joshua Clark 2022-07-07 01:15:29 UTC
QA verified in 4.11 testing.

Comment 4 errata-xmlrpc 2022-08-18 04:08:08 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.11 CNF vRAN extras update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2022:6110