Bug 2034805
| Summary: | upgrade not started for ODF 4.10 | |||
|---|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat OpenShift Data Foundation | Reporter: | Vijay Avuthu <vavuthu> | |
| Component: | odf-operator | Assignee: | Nitin Goyal <nigoyal> | |
| Status: | CLOSED ERRATA | QA Contact: | Vijay Avuthu <vavuthu> | |
| Severity: | urgent | Docs Contact: | ||
| Priority: | unspecified | |||
| Version: | 4.10 | CC: | amagrawa, branto, ebenahar, jarrpa, madam, mmuench, muagarwa, nberry, nigoyal, ocs-bugs, odf-bz-bot, pbalogh | |
| Target Milestone: | --- | Keywords: | Automation, TestBlocker | |
| Target Release: | ODF 4.10.0 | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | 4.10.0-113 | Doc Type: | No Doc Update | |
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 2041522 (view as bug list) | Environment: | ||
| Last Closed: | 2022-04-13 18:50:46 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 2041522 | |||
|
Description
Vijay Avuthu
2021-12-22 08:23:39 UTC
As of now dependencies.yaml in the odf-operator has ocs-operator 4.10 only which is causing this problem. When we try to upgrade the odf-operator from 4.9 to 4.10 OLM is not able to upgrade as odf-operator 4.10 can not be run with the ocs-operator 4.9 because of its dependencies.yaml. To come out of this situation we need to add ocs-operator 4.9 to 4.10 in the dependencies.yaml of the odf-operator. Moving it to the build team as dependencies.yaml is handled by them. @branto Can you pls add the 4.9 also in the dependencies.yaml, I remember we were facing some difficulties when we had it in the initial builds of 4.10 and we removed it to solve the issue. Lets do it again and run tests in the debug mode so that the setup won't get destroyed automatically. Upgrade failed even we see odf-operator is succeeded but there are missing mcg and ocs-operator > csvs after upgrade ( missing mcg and ocs operator ) NAME DISPLAY VERSION REPLACES PHASE odf-operator.v4.10.0 OpenShift Data Foundation 4.10.0 odf-operator.v4.9.1 Succeeded > subscriptions NAME PACKAGE SOURCE CHANNEL mcg-operator-stable-4.9-redhat-operators-openshift-marketplace mcg-operator redhat-operators stable-4.10 ocs-operator-stable-4.9-redhat-operators-openshift-marketplace ocs-operator redhat-operators stable-4.10 odf-operator odf-operator redhat-operators stable-4.10 > install plans NAME CSV APPROVAL APPROVED install-7dbnx odf-operator.v4.9.1 Automatic true install-hv59z odf-operator.v4.10.0 Automatic true > storagesytem yaml status: conditions: - lastHeartbeatTime: "2022-01-12T00:15:25Z" lastTransitionTime: "2022-01-11T23:36:53Z" message: Reconcile is in progress reason: Reconciling status: "False" type: Available - lastHeartbeatTime: "2022-01-12T00:15:25Z" lastTransitionTime: "2022-01-11T23:36:53Z" message: Reconcile is in progress reason: Reconciling status: "True" type: Progressing - lastHeartbeatTime: "2022-01-12T00:15:25Z" lastTransitionTime: "2022-01-11T23:02:30Z" message: StorageSystem CR is valid reason: Valid status: "False" type: StorageSystemInvalid - lastHeartbeatTime: "2022-01-12T00:15:25Z" lastTransitionTime: "2022-01-11T23:36:54Z" message: InstallPlan not found for CSV mcg-operator.v4.10.0; InstallPlan not found for CSV ocs-operator.v4.10.0 reason: NotReady status: "False" type: VendorCsvReady - lastHeartbeatTime: "2022-01-11T23:02:30Z" lastTransitionTime: "2022-01-11T23:02:30Z" reason: Found status: "True" type: VendorSystemPresent > odf operator log
2022-01-11T23:58:45.259671211Z 2022-01-11T23:58:45.259Z ERROR controller-runtime.manager.controller.storagesystem Reconciler error {"reconciler group": "odf.openshift.io", "reconciler kind": "StorageSystem", "name": "ocs-storagecluster-storagesystem", "namespace": "openshift-storage", "error": "InstallPlan not found for CSV mcg-operator.v4.10.0; InstallPlan not found for CSV ocs-operator.v4.10.0", "errorCauses": [{"error": "InstallPlan not found for CSV mcg-operator.v4.10.0"}, {"error": "InstallPlan not found for CSV ocs-operator.v4.10.0"}]}
2022-01-11T23:58:45.259671211Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
2022-01-11T23:58:45.259671211Z /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:253
2022-01-11T23:58:45.259671211Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2
2022-01-11T23:58:45.259671211Z /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:214
2022-01-12T00:15:24.919494632Z 2022-01-12T00:15:24.919Z ERROR controller-runtime.manager.controller.subscription Reconciler error {"reconciler group": "operators.coreos.com", "reconciler kind": "Subscription", "name": "odf-operator", "namespace": "openshift-storage", "error": "InstallPlan not found for CSV mcg-operator.v4.10.0; InstallPlan not found for CSV ocs-operator.v4.10.0", "errorCauses": [{"error": "InstallPlan not found for CSV mcg-operator.v4.10.0"}, {"error": "InstallPlan not found for CSV ocs-operator.v4.10.0"}]}
2022-01-12T00:15:24.919494632Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
2022-01-12T00:15:24.919494632Z /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:253
2022-01-12T00:15:24.919494632Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2
2022-01-12T00:15:24.919494632Z /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:214
2022-01-12T00:15:25.260432607Z 2022-01-12T00:15:25.260Z INFO controllers.StorageSystem storagesystem instance found {"instance": "openshift-storage/ocs-storagecluster-storagesystem"}
2022-01-12T00:15:25.265186695Z 2022-01-12T00:15:25.265Z INFO controllers.StorageSystem Updating quickstarts {"instance": "openshift-storage/ocs-storagecluster-storagesystem", "Name": "getting-started-odf", "Namespace": ""}
2022-01-12T00:15:25.269798107Z 2022-01-12T00:15:25.269Z INFO controllers.StorageSystem Updating quickstarts {"instance": "openshift-storage/ocs-storagecluster-storagesystem", "Name": "odf-configuration", "Namespace": ""}
2022-01-12T00:15:25.270183063Z 2022-01-12T00:15:25.270Z ERROR controllers.StorageSystem failed to validate CSV {"instance": "openshift-storage/ocs-storagecluster-storagesystem", "ClusterServiceVersion": "mcg-operator.v4.10.0", "error": "InstallPlan not found for CSV mcg-operator.v4.10.0"}
2022-01-12T00:15:25.270183063Z github.com/red-hat-data-services/odf-operator/controllers.(*StorageSystemReconciler).reconcile
2022-01-12T00:15:25.270183063Z /remote-source/app/controllers/storagesystem_controller.go:163
2022-01-12T00:15:25.270183063Z github.com/red-hat-data-services/odf-operator/controllers.(*StorageSystemReconciler).Reconcile
2022-01-12T00:15:25.270183063Z /remote-source/app/controllers/storagesystem_controller.go:87
2022-01-12T00:15:25.270183063Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler
2022-01-12T00:15:25.270183063Z /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:298
2022-01-12T00:15:25.270183063Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
2022-01-12T00:15:25.270183063Z /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:253
2022-01-12T00:15:25.270183063Z sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2
2022-01-12T00:15:25.270183063Z /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:214
2022-01-12T00:15:25.270183063Z 2022-01-12T00:15:25.270Z ERROR controllers.StorageSystem failed to validate CSV {"instance": "openshift-storage/ocs-storagecluster-storagesystem", "ClusterServiceVersion": "ocs-operator.v4.10.0", "error": "InstallPlan not found for CSV ocs-operator.v4.10.0"}
(In reply to Vijay Avuthu from comment #9) > > odf operator log > > 2022-01-11T23:58:45.259671211Z 2022-01-11T23:58:45.259Z ERROR > controller-runtime.manager.controller.storagesystem Reconciler error > {"reconciler group": "odf.openshift.io", "reconciler kind": "StorageSystem", > "name": "ocs-storagecluster-storagesystem", "namespace": > "openshift-storage", "error": "InstallPlan not found for CSV > mcg-operator.v4.10.0; InstallPlan not found for CSV ocs-operator.v4.10.0", > "errorCauses": [{"error": "InstallPlan not found for CSV > mcg-operator.v4.10.0"}, {"error": "InstallPlan not found for CSV > ocs-operator.v4.10.0"}]} > 2022-01-11T23:58:45.259671211Z > sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller). > processNextWorkItem > 2022-01-11T23:58:45.259671211Z > /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/ > controller/controller.go:253 > 2022-01-11T23:58:45.259671211Z > sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start. > func2.2 > 2022-01-11T23:58:45.259671211Z > /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/ > controller/controller.go:214 > 2022-01-12T00:15:24.919494632Z 2022-01-12T00:15:24.919Z ERROR > controller-runtime.manager.controller.subscription Reconciler error > {"reconciler group": "operators.coreos.com", "reconciler kind": > "Subscription", "name": "odf-operator", "namespace": "openshift-storage", > "error": "InstallPlan not found for CSV mcg-operator.v4.10.0; InstallPlan > not found for CSV ocs-operator.v4.10.0", "errorCauses": [{"error": > "InstallPlan not found for CSV mcg-operator.v4.10.0"}, {"error": > "InstallPlan not found for CSV ocs-operator.v4.10.0"}]} > 2022-01-12T00:15:24.919494632Z > sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller). > processNextWorkItem > 2022-01-12T00:15:24.919494632Z > /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/ > controller/controller.go:253 > 2022-01-12T00:15:24.919494632Z > sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start. > func2.2 > 2022-01-12T00:15:24.919494632Z > /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/ > controller/controller.go:214 > 2022-01-12T00:15:25.260432607Z 2022-01-12T00:15:25.260Z INFO > controllers.StorageSystem storagesystem instance found {"instance": > "openshift-storage/ocs-storagecluster-storagesystem"} > 2022-01-12T00:15:25.265186695Z 2022-01-12T00:15:25.265Z INFO > controllers.StorageSystem Updating quickstarts {"instance": > "openshift-storage/ocs-storagecluster-storagesystem", "Name": > "getting-started-odf", "Namespace": ""} > 2022-01-12T00:15:25.269798107Z 2022-01-12T00:15:25.269Z INFO > controllers.StorageSystem Updating quickstarts {"instance": > "openshift-storage/ocs-storagecluster-storagesystem", "Name": > "odf-configuration", "Namespace": ""} > 2022-01-12T00:15:25.270183063Z 2022-01-12T00:15:25.270Z ERROR > controllers.StorageSystem failed to validate CSV {"instance": > "openshift-storage/ocs-storagecluster-storagesystem", > "ClusterServiceVersion": "mcg-operator.v4.10.0", "error": "InstallPlan not > found for CSV mcg-operator.v4.10.0"} > 2022-01-12T00:15:25.270183063Z > github.com/red-hat-data-services/odf-operator/controllers. > (*StorageSystemReconciler).reconcile > 2022-01-12T00:15:25.270183063Z > /remote-source/app/controllers/storagesystem_controller.go:163 > 2022-01-12T00:15:25.270183063Z > github.com/red-hat-data-services/odf-operator/controllers. > (*StorageSystemReconciler).Reconcile > 2022-01-12T00:15:25.270183063Z > /remote-source/app/controllers/storagesystem_controller.go:87 > 2022-01-12T00:15:25.270183063Z > sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller). > reconcileHandler > 2022-01-12T00:15:25.270183063Z > /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/ > controller/controller.go:298 > 2022-01-12T00:15:25.270183063Z > sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller). > processNextWorkItem > 2022-01-12T00:15:25.270183063Z > /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/ > controller/controller.go:253 > 2022-01-12T00:15:25.270183063Z > sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start. > func2.2 > 2022-01-12T00:15:25.270183063Z > /remote-source/app/vendor/sigs.k8s.io/controller-runtime/pkg/internal/ > controller/controller.go:214 > 2022-01-12T00:15:25.270183063Z 2022-01-12T00:15:25.270Z ERROR > controllers.StorageSystem failed to validate CSV {"instance": > "openshift-storage/ocs-storagecluster-storagesystem", > "ClusterServiceVersion": "ocs-operator.v4.10.0", "error": "InstallPlan not > found for CSV ocs-operator.v4.10.0"} must gather logs: http://magna002.ceph.redhat.com/ocsci-jenkins/openshift-clusters/j-138vu1cs33-ua/j-138vu1cs33-ua_20220111T223532/logs/failed_testcase_ocs_logs_1641943067/test_upgrade_ocs_logs/ job: https://ocs4-jenkins-csb-odf-qe.apps.ocp-c1.prod.psi.redhat.com/job/qe-deploy-ocs-cluster-prod/2874/consoleFull root-cause: while upgrading odf-operator odf-operator csv get deleted and replaced with the new one. which cause ocs-operator, mcg-operator to be deleted as the odf-operator csv was the owner to the ocs-operator, mcg-operator. To prevent ocs-operator, mcg-operator to be deleted we need to remove odf-operator CSV as a owner and add odf-operator subscription as a owner for the garbage collector. PR: https://github.com/red-hat-storage/odf-operator/pull/166 I see that Vijay tried to re-trigger and it passed here: https://ocs4-jenkins-csb-odf-qe.apps.ocp-c1.prod.psi.redhat.com/job/qe-deploy-ocs-cluster-prod/3035/testReport/tests.ecosystem.upgrade/ For sure I am trying once more here with latest build: https://ocs4-jenkins-csb-odf-qe.apps.ocp-c1.prod.psi.redhat.com/job/qe-trigger-aws-ipi-3az-rhcos-3m-3w-upgrade-ocs-auto/130/ we are hitting 2 separate issues while( after ) upgrading and raised issue for the same. https://bugzilla.redhat.com/show_bug.cgi?id=2043513 https://bugzilla.redhat.com/show_bug.cgi?id=2043510 since all csv are upgraded , marking this bug as verified and we will track other issues as separate bugs Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.10.0 enhancement, security & bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:1372 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.10.0 enhancement, security & bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:1372 |