Bug 1916586 - Cluster went into abnormal status with a mess cluster version when downgrading from 4.7 to 4.6
Summary: Cluster went into abnormal status with a mess cluster version when downgradin...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Cluster Version Operator
Version: 4.7
Hardware: Unspecified
OS: Unspecified
high
medium
Target Milestone: ---
: ---
Assignee: Jack Ottofaro
QA Contact: Yang Yang
URL:
Whiteboard:
Depends On:
Blocks: 1907812 1914932
TreeView+ depends on / blocked
 
Reported: 2021-01-15 07:35 UTC by Qin Ping
Modified: 2021-02-04 13:58 UTC (History)
10 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-02-04 13:58:10 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift cluster-version-operator pull 404 0 None closed Bug 1907329: Add cluster profile support 2021-02-15 14:50:09 UTC

Description Qin Ping 2021-01-15 07:35:09 UTC
Description of problem:
The condition of cv shows it's on v4.7, but the clusterversion is v4.6. The abnormal status happened during downgrade test from v4.7 to v4.6. I found the update does not happen actually and cvo is still v4.7, but some of slo updated to v4.6.


Version-Release number of the following components:
upgrade path: 4.6.11 -> 4.7.0-0.nightly-2021-01-14-040200 -> 4.6.11

How reproducible:
Hit this issue in the OCP on vshpere cluster, it can be reproduced with the OCP on AWS cluster.

Steps to Reproduce:
1. Install an OCP 4.6.11 cluster
2. Upgrade to 4.7.0-0.nightly-2021-01-14-040200
3. Downgrade to 4.6.11

Actual results:
$ oc get clusterversion
NAME      VERSION   AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.6.11    True        False         102m    Cluster version is 4.6.11

But there are some COs are in 4.7.0-0.nightly-2021-01-14-040200
$ oc get co
NAME                                       VERSION                             AVAILABLE   PROGRESSING   DEGRADED   SINCE
authentication                             4.6.11                              True        False         False      4h12m
baremetal                                  4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h43m
cloud-credential                           4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
cluster-autoscaler                         4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
config-operator                            4.6.11                              True        False         False      21h
console                                    4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h16m
csi-snapshot-controller                    4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h21m
dns                                        4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
etcd                                       4.6.11                              True        False         False      21h
image-registry                             4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
ingress                                    4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
insights                                   4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
kube-apiserver                             4.6.11                              True        False         False      21h
kube-controller-manager                    4.6.11                              True        False         False      21h
kube-scheduler                             4.6.11                              True        False         False      21h
kube-storage-version-migrator              4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h21m
machine-api                                4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
machine-approver                           4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
machine-config                             4.7.0-0.nightly-2021-01-14-040200   True        False         False      133m
marketplace                                4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h21m
monitoring                                 4.7.0-0.nightly-2021-01-14-040200   True        False         False      133m
network                                    4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h33m
node-tuning                                4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h41m
openshift-apiserver                        4.6.11                              True        False         False      4h12m
openshift-controller-manager               4.6.11                              True        False         False      157m
openshift-samples                          4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h42m
operator-lifecycle-manager                 4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
operator-lifecycle-manager-catalog         4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
operator-lifecycle-manager-packageserver   4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h16m
service-ca                                 4.7.0-0.nightly-2021-01-14-040200   True        False         False      21h
storage                                    4.7.0-0.nightly-2021-01-14-040200   True        False         False      4h21m


Expected results:


Additional info:
Will add the the must gather log later.

Comment 2 Xingxing Xia 2021-01-18 02:55:12 UTC
Adding TestBlocker because it blocks CVO QE's downgrade case's test, also blocks the 4.7 issue MSTR-1055 epic test which requires upgrade-downgrade-upgrade test.

Comment 3 Pratik Mahajan 2021-01-26 17:32:04 UTC
We are waiting for other downgrade bugs to get sorted. Will come again next week for review.

Comment 4 Yang Yang 2021-01-27 05:42:04 UTC
This issue also happens on Azure and Baremetal.

Comment 5 Jack Ottofaro 2021-02-04 13:58:10 UTC
This was caused because 4.7 now supports Cluster Profiles (https://github.com/openshift/enhancements/pull/200) but it is not backwardly compatible with 4.6 breaking the ability to downgrade 4.7->4.6. Closing this bug against PR for adding cluster profile support (https://github.com/openshift/cluster-version-operator/pull/404) and we'll cherry pick it back to 4.6.


Note You need to log in before you can comment on or make changes to this bug.