Bug 1821716 - Unable to upgrade to 4.4 : the cluster operator openshift-apiserver is degraded
Summary: Unable to upgrade to 4.4 : the cluster operator openshift-apiserver is degraded
Keywords:
Status: CLOSED DUPLICATE of bug 1817455
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Machine Config Operator
Version: 4.4
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: ---
Assignee: Stefan Schimanski
QA Contact: Xingxing Xia
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-04-07 13:00 UTC by Jaspreet Kaur
Modified: 2023-09-14 05:55 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2020-04-08 08:46:15 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Jaspreet Kaur 2020-04-07 13:00:08 UTC
Description of problem: It was observed that the first master was marked unschedulable which resulted in apiserver pod not schedulable and resulting in failure.

1. Unable to apply 4.4.0-rc.6: the cluster operator openshift-apiserver is degraded. 

If we mark master schedulable it gets reversed and later below operator failes.
                                                                                                             2. Unable to apply 4.4.0-rc.6: the cluster operator machine-config has not yet successfully rolled out.                                                      Message:               Unable to apply 4.4.0-rc.6: timed out waiting for the condition during syncRequiredMachineConfigPools: pool master has not progressed to latest configuration: controller version mismatch for rendered-master-f90dab41073f1445d44ec27c32c353b1 expected a7b13759061f645a76f03c04d385d275bbbd0c02 has ab4d62a3bf3774b77b6f9b04a2028faec1568aca, retrying                                                                                                                 3.Master-0 marked unschedulabe which prevent etcd-quorum to schedule.


Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results: Fails to upgrade to 4.4


Expected results: Should succeed


Additional info:

 3fc43ae7-1201-4c5c-ab50-51583a209081

Must gather times out.

Comment 1 Abu Kashem 2020-04-07 15:35:38 UTC
Hi jkaur,
Since there is no must-gather we can't find the root cause. We need the following information:
- what is the infrastructure - aws, gcp?
- did it happen more than once? 
- is the cluster usable at all after this happens? is it possible for you to give us access to the cluster once it happens?
- can you give us the release image URLs for upgrade from -> to. We want to provision a cluster and kick off an upgrade and try to reproduce the issue.

Comment 2 Abu Kashem 2020-04-07 16:07:03 UTC
Assiging it to "Machine Config Operator" as looks like it could be an issue with the machine config. If you find otherwise please feel free to assign it back to apiserver.

Comment 4 Kirsten Garrison 2020-04-07 17:28:13 UTC

@Jaspreet is this an error that you've encountered more than once?  What version were you upgrading from?

Comment 5 Antonio Murdaca 2020-04-08 08:46:15 UTC

*** This bug has been marked as a duplicate of bug 1817455 ***

Comment 6 Red Hat Bugzilla 2023-09-14 05:55:09 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days


Note You need to log in before you can comment on or make changes to this bug.