Bug 2062462

Summary: Upgrade to 2.5 hang due to irreconcilable errors of grc-sub and search-prod-sub in MCH
Product: Red Hat Advanced Cluster Management for Kubernetes Reporter: Thuy Nguyen <thnguyen>
Component: Server FoundationAssignee: Jian Qiu <jqiu>
Status: CLOSED ERRATA QA Contact: Song Lai <slai>
Severity: high Docs Contact: Christopher Dawson <cdawson>
Priority: unspecified    
Version: rhacm-2.5CC: dhuynh, huichen, yliu1
Target Milestone: ---Flags: bot-tracker-sync: rhacm-2.5+
Target Release: rhacm-2.5   
Hardware: x86_64   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-06-09 02:09:24 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Thuy Nguyen 2022-03-09 19:47:14 UTC
Description of the problem: Upgrade to 2.5 hang due to Irreconcilable errors of grc-sub and search-prod-sub in MCH

Release version:

Operator snapshot version:
- ACM 2.5.0-DOWNSTREAM-2022-03-03-03-15-27 (Final S3)
- MCE 2.0.0-DOWNANDBACK-2022-03-07-12-37-51

OCP version: 4.8.31

Browser Info:

Steps to reproduce:
1. Install ACM 2.4.2
2. Upgrade to ACM 2.5

Actual results:
MCH is in 'Updating' state

Expected results:

Additional info:

oc get mch
NAME              STATUS     AGE
multiclusterhub   Updating   126m


oc get mch multiclusterhub -oyaml
apiVersion: operator.open-cluster-management.io/v1
kind: MultiClusterHub
metadata:
  annotations:
    installer.open-cluster-management.io/mce-subscription-spec: '{“channel": "stable-2.0","installPlanApproval":
      "Automatic","name": "multicluster-engine","source": "brew-mce-iib","sourceNamespace":
      "openshift-marketplace","startingCSV": "multicluster-engine.v2.0.0"}'
    kubectl.kubernetes.io/last-applied-configuration: |
      {"apiVersion":"operator.open-cluster-management.io/v1","kind":"MultiClusterHub","metadata":{"annotations":{"mch-imageRepository":"brew.registry.redhat.io/rh-osbs/rhacm2"},"name":"multiclusterhub","namespace":"ocm"},"spec":{"customCAConfigmap":"custom-ca","enableClusterBackup":true,"imagePullSecret":"multiclusterhub-operator-pull-secret"}}
    mch-imageRepository: brew.registry.redhat.io/rh-osbs/rhacm2
  creationTimestamp: "2022-03-09T17:24:06Z"
  finalizers:
  - finalizer.operator.open-cluster-management.io
  generation: 3
  name: multiclusterhub
  namespace: ocm
  resourceVersion: "22120299"
  uid: 25e1ab41-8169-422a-b86b-fe2ad36e3413
spec:
  availabilityConfig: High
  customCAConfigmap: custom-ca
  enableClusterBackup: true
  imagePullSecret: multiclusterhub-operator-pull-secret
  ingress:
    sslCiphers:
    - ECDHE-ECDSA-AES256-GCM-SHA384
    - ECDHE-RSA-AES256-GCM-SHA384
    - ECDHE-ECDSA-AES128-GCM-SHA256
    - ECDHE-RSA-AES128-GCM-SHA256
  separateCertificateManagement: false
status:
  components:
    application-chart-sub:
      lastTransitionTime: "2022-03-09T17:24:57Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    cluster-backup-chart-sub:
      lastTransitionTime: "2022-03-09T17:25:04Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    cluster-lifecycle-sub:
      lastTransitionTime: "2022-03-09T17:25:05Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    console-chart-sub:
      lastTransitionTime: "2022-03-09T17:24:59Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    grc-sub:
      lastTransitionTime: "2022-03-09T19:30:38Z"
      message: 'failed to get candidate release: rendered manifests contain a resource
        that already exists. Unable to continue with update: ClusterManagementAddOn
        "cert-policy-controller" in namespace "" exists and cannot be imported into
        the current release: invalid ownership metadata; label validation error: missing
        key "app.kubernetes.io/managed-by": must be set to "Helm"; annotation validation
        error: missing key "meta.helm.sh/release-name": must be set to "grc-6caf0";
        annotation validation error: missing key "meta.helm.sh/release-namespace":
        must be set to "ocm"'
      reason: ReconcileError
      status: "True"
      type: Irreconcilable
    local-cluster:
      lastTransitionTime: "2022-03-09T18:57:52Z"
      message: No conditions available
      reason: No conditions available
      status: Unknown
      type: Unknown
    management-ingress-sub:
      lastTransitionTime: "2022-03-09T17:25:00Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    multicluster-engine:
      lastTransitionTime: "2022-03-09T19:31:37Z"
      reason: ComponentsAvailable
      status: "True"
      type: Available
    multicluster-engine-csv:
      lastTransitionTime: "2022-03-09T19:31:37Z"
      message: install strategy completed with no errors
      reason: InstallSucceeded
      status: "True"
    multicluster-engine-sub:
      lastTransitionTime: "2022-03-09T19:31:37Z"
      message: 'installPlanApproval: Automatic. installPlan: multicluster-engine/install-m96zv'
      reason: AtLatestKnown
      status: "True"
    multiclusterhub-repo:
      lastTransitionTime: "2022-03-09T18:47:27Z"
      reason: MinimumReplicasAvailable
      status: "True"
      type: Available
    policyreport-sub:
      lastTransitionTime: "2022-03-09T17:24:59Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    search-prod-sub:
      lastTransitionTime: "2022-03-09T19:30:38Z"
      message: 'rendered manifests contain a resource that already exists. Unable
        to continue with update: ClusterManagementAddOn "search-collector" in namespace
        "" exists and cannot be imported into the current release: invalid ownership
        metadata; label validation error: missing key "app.kubernetes.io/managed-by":
        must be set to "Helm"; annotation validation error: missing key "meta.helm.sh/release-name":
        must be set to "search-prod-970f9"; annotation validation error: missing key
        "meta.helm.sh/release-namespace": must be set to "ocm"'
      reason: UpgradeError
      status: "True"
      type: ReleaseFailed
  conditions:
  - lastTransitionTime: "2022-03-09T18:58:56Z"
    lastUpdateTime: "2022-03-09T18:58:56Z"
    message: Not all hub components ready.
    reason: ComponentsUnavailable
    status: "False"
    type: Complete
  - lastTransitionTime: "2022-03-09T17:24:07Z"
    lastUpdateTime: "2022-03-09T18:59:59Z"
    message: Created new resource
    reason: NewResourceCreated
    status: "True"
    type: Progressing
  currentVersion: 2.4.2
  desiredVersion: 2.5.0
  phase: Updating


oc get managedclusteraddon -n local-cluster
NAME                          AVAILABLE   DEGRADED   PROGRESSING
application-manager           True                   False
cert-policy-controller        False                  False
config-policy-controller      False
governance-policy-framework   Unknown
iam-policy-controller         False                  False
work-manager                  True                   False

Comment 2 bot-tracker-sync 2022-03-10 10:18:28 UTC
G2Bsync 1063803780 comment 
 zhiweiyin318 Thu, 10 Mar 2022 08:40:46 UTC 
 G2Bsync
because the klusterlet-addons are refactored in 2.5. 
we have a story https://github.com/stolostron/backlog/issues/20393 to handle klusterlet-addon upgrade from 2.4 to 2.5 in sprint 4.

Comment 3 bot-tracker-sync 2022-03-16 08:47:42 UTC
G2Bsync 1068804094 comment 
 zhiweiyin318 Wed, 16 Mar 2022 07:07:32 UTC 
 G2Bsync
PR https://github.com/stolostron/klusterlet-addon-controller/pull/106 has been merged. please verify use the latest snapshot.

Comment 4 bot-tracker-sync 2022-03-23 17:20:06 UTC
G2Bsync 1076521287 comment 
 thuyn-581 Wed, 23 Mar 2022 16:10:56 UTC 
 G2Bsync -
Validated on 2.5.0-DOWNSTREAM-2022-03-22-01-58-08.

Comment 7 errata-xmlrpc 2022-06-09 02:09:24 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat Advanced Cluster Management 2.5 security updates, images, and bug fixes), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:4956

Comment 8 SteinS 2023-01-24 06:04:24 UTC Comment hidden (spam)