Bug 2062462 - Upgrade to 2.5 hang due to irreconcilable errors of grc-sub and search-prod-sub in MCH
Summary: Upgrade to 2.5 hang due to irreconcilable errors of grc-sub and search-prod-s...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Advanced Cluster Management for Kubernetes
Classification: Red Hat
Component: Server Foundation
Version: rhacm-2.5
Hardware: x86_64
OS: Unspecified
unspecified
high
Target Milestone: ---
: rhacm-2.5
Assignee: Jian Qiu
QA Contact: Song Lai
Christopher Dawson
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-03-09 19:47 UTC by Thuy Nguyen
Modified: 2023-05-16 04:20 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-06-09 02:09:24 UTC
Target Upstream Version:
Embargoed:
bot-tracker-sync: rhacm-2.5+


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github stolostron backlog issues 20564 0 None None None 2022-03-09 22:47:10 UTC
Red Hat Product Errata RHSA-2022:4956 0 None None None 2022-06-09 02:09:33 UTC

Description Thuy Nguyen 2022-03-09 19:47:14 UTC
Description of the problem: Upgrade to 2.5 hang due to Irreconcilable errors of grc-sub and search-prod-sub in MCH

Release version:

Operator snapshot version:
- ACM 2.5.0-DOWNSTREAM-2022-03-03-03-15-27 (Final S3)
- MCE 2.0.0-DOWNANDBACK-2022-03-07-12-37-51

OCP version: 4.8.31

Browser Info:

Steps to reproduce:
1. Install ACM 2.4.2
2. Upgrade to ACM 2.5

Actual results:
MCH is in 'Updating' state

Expected results:

Additional info:

oc get mch
NAME              STATUS     AGE
multiclusterhub   Updating   126m


oc get mch multiclusterhub -oyaml
apiVersion: operator.open-cluster-management.io/v1
kind: MultiClusterHub
metadata:
  annotations:
    installer.open-cluster-management.io/mce-subscription-spec: '{“channel": "stable-2.0","installPlanApproval":
      "Automatic","name": "multicluster-engine","source": "brew-mce-iib","sourceNamespace":
      "openshift-marketplace","startingCSV": "multicluster-engine.v2.0.0"}'
    kubectl.kubernetes.io/last-applied-configuration: |
      {"apiVersion":"operator.open-cluster-management.io/v1","kind":"MultiClusterHub","metadata":{"annotations":{"mch-imageRepository":"brew.registry.redhat.io/rh-osbs/rhacm2"},"name":"multiclusterhub","namespace":"ocm"},"spec":{"customCAConfigmap":"custom-ca","enableClusterBackup":true,"imagePullSecret":"multiclusterhub-operator-pull-secret"}}
    mch-imageRepository: brew.registry.redhat.io/rh-osbs/rhacm2
  creationTimestamp: "2022-03-09T17:24:06Z"
  finalizers:
  - finalizer.operator.open-cluster-management.io
  generation: 3
  name: multiclusterhub
  namespace: ocm
  resourceVersion: "22120299"
  uid: 25e1ab41-8169-422a-b86b-fe2ad36e3413
spec:
  availabilityConfig: High
  customCAConfigmap: custom-ca
  enableClusterBackup: true
  imagePullSecret: multiclusterhub-operator-pull-secret
  ingress:
    sslCiphers:
    - ECDHE-ECDSA-AES256-GCM-SHA384
    - ECDHE-RSA-AES256-GCM-SHA384
    - ECDHE-ECDSA-AES128-GCM-SHA256
    - ECDHE-RSA-AES128-GCM-SHA256
  separateCertificateManagement: false
status:
  components:
    application-chart-sub:
      lastTransitionTime: "2022-03-09T17:24:57Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    cluster-backup-chart-sub:
      lastTransitionTime: "2022-03-09T17:25:04Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    cluster-lifecycle-sub:
      lastTransitionTime: "2022-03-09T17:25:05Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    console-chart-sub:
      lastTransitionTime: "2022-03-09T17:24:59Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    grc-sub:
      lastTransitionTime: "2022-03-09T19:30:38Z"
      message: 'failed to get candidate release: rendered manifests contain a resource
        that already exists. Unable to continue with update: ClusterManagementAddOn
        "cert-policy-controller" in namespace "" exists and cannot be imported into
        the current release: invalid ownership metadata; label validation error: missing
        key "app.kubernetes.io/managed-by": must be set to "Helm"; annotation validation
        error: missing key "meta.helm.sh/release-name": must be set to "grc-6caf0";
        annotation validation error: missing key "meta.helm.sh/release-namespace":
        must be set to "ocm"'
      reason: ReconcileError
      status: "True"
      type: Irreconcilable
    local-cluster:
      lastTransitionTime: "2022-03-09T18:57:52Z"
      message: No conditions available
      reason: No conditions available
      status: Unknown
      type: Unknown
    management-ingress-sub:
      lastTransitionTime: "2022-03-09T17:25:00Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    multicluster-engine:
      lastTransitionTime: "2022-03-09T19:31:37Z"
      reason: ComponentsAvailable
      status: "True"
      type: Available
    multicluster-engine-csv:
      lastTransitionTime: "2022-03-09T19:31:37Z"
      message: install strategy completed with no errors
      reason: InstallSucceeded
      status: "True"
    multicluster-engine-sub:
      lastTransitionTime: "2022-03-09T19:31:37Z"
      message: 'installPlanApproval: Automatic. installPlan: multicluster-engine/install-m96zv'
      reason: AtLatestKnown
      status: "True"
    multiclusterhub-repo:
      lastTransitionTime: "2022-03-09T18:47:27Z"
      reason: MinimumReplicasAvailable
      status: "True"
      type: Available
    policyreport-sub:
      lastTransitionTime: "2022-03-09T17:24:59Z"
      reason: UpgradeSuccessful
      status: "True"
      type: Deployed
    search-prod-sub:
      lastTransitionTime: "2022-03-09T19:30:38Z"
      message: 'rendered manifests contain a resource that already exists. Unable
        to continue with update: ClusterManagementAddOn "search-collector" in namespace
        "" exists and cannot be imported into the current release: invalid ownership
        metadata; label validation error: missing key "app.kubernetes.io/managed-by":
        must be set to "Helm"; annotation validation error: missing key "meta.helm.sh/release-name":
        must be set to "search-prod-970f9"; annotation validation error: missing key
        "meta.helm.sh/release-namespace": must be set to "ocm"'
      reason: UpgradeError
      status: "True"
      type: ReleaseFailed
  conditions:
  - lastTransitionTime: "2022-03-09T18:58:56Z"
    lastUpdateTime: "2022-03-09T18:58:56Z"
    message: Not all hub components ready.
    reason: ComponentsUnavailable
    status: "False"
    type: Complete
  - lastTransitionTime: "2022-03-09T17:24:07Z"
    lastUpdateTime: "2022-03-09T18:59:59Z"
    message: Created new resource
    reason: NewResourceCreated
    status: "True"
    type: Progressing
  currentVersion: 2.4.2
  desiredVersion: 2.5.0
  phase: Updating


oc get managedclusteraddon -n local-cluster
NAME                          AVAILABLE   DEGRADED   PROGRESSING
application-manager           True                   False
cert-policy-controller        False                  False
config-policy-controller      False
governance-policy-framework   Unknown
iam-policy-controller         False                  False
work-manager                  True                   False

Comment 2 bot-tracker-sync 2022-03-10 10:18:28 UTC
G2Bsync 1063803780 comment 
 zhiweiyin318 Thu, 10 Mar 2022 08:40:46 UTC 
 G2Bsync
because the klusterlet-addons are refactored in 2.5. 
we have a story https://github.com/stolostron/backlog/issues/20393 to handle klusterlet-addon upgrade from 2.4 to 2.5 in sprint 4.

Comment 3 bot-tracker-sync 2022-03-16 08:47:42 UTC
G2Bsync 1068804094 comment 
 zhiweiyin318 Wed, 16 Mar 2022 07:07:32 UTC 
 G2Bsync
PR https://github.com/stolostron/klusterlet-addon-controller/pull/106 has been merged. please verify use the latest snapshot.

Comment 4 bot-tracker-sync 2022-03-23 17:20:06 UTC
G2Bsync 1076521287 comment 
 thuyn-581 Wed, 23 Mar 2022 16:10:56 UTC 
 G2Bsync -
Validated on 2.5.0-DOWNSTREAM-2022-03-22-01-58-08.

Comment 7 errata-xmlrpc 2022-06-09 02:09:24 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat Advanced Cluster Management 2.5 security updates, images, and bug fixes), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:4956

Comment 8 SteinS 2023-01-24 06:04:24 UTC Comment hidden (spam)

Note You need to log in before you can comment on or make changes to this bug.