Bug 1871832

Summary: APIServerServiceUnavailableErrorjava error makes ImageChangesInProgress keeping true that blocked the upgrade processed
Product: OpenShift Container Platform Reporter: OpenShift BugZilla Robot <openshift-bugzilla-robot>
Component: SamplesAssignee: Gabe Montero <gmontero>
Status: CLOSED ERRATA QA Contact: XiuJuan Wang <xiuwang>
Severity: medium Docs Contact:
Priority: medium    
Version: 4.5CC: aabhishe, adam.kaplan
Target Milestone: ---Keywords: UpcomingSprint
Target Release: 4.4.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Cause: intermittent API server errors were reported on the wrong condition (ImageChangesInProgress instead of SamplesExists) of the cluster operator config object. Consequence: when API server communication returned and all the samples were installed, the samples operator would fail to switch Progressing to false because there was unexpected data in its ImageChangesInProgress condition, and upgrades would incorrectly be marked as incomplete. Fix: code change was made to update SamplesExists with error reports on APIServer communication Result: upgrades are no longer blocked if intermittent APIServer errors occur while samples operator is upgrading.
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-09-22 06:58:40 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1857201    
Bug Blocks:    

Comment 1 Gabe Montero 2020-08-26 14:19:13 UTC
*** Bug 1872371 has been marked as a duplicate of this bug. ***

Comment 3 Gabe Montero 2020-09-01 12:50:52 UTC
@Abhishek

The customer can run

oc delete configs.samples cluster

That should clear out the samples operator and have it retry.  Assuming the api server is fully accessible samples should come up OK.

Comment 4 Adam Kaplan 2020-09-10 14:34:45 UTC
PR is LGTM and is awaiting patch manager approval.

Comment 6 XiuJuan Wang 2020-09-14 03:39:26 UTC
Delete apiserver pods during samples operator removing or creating imagestream, the APISERVER error reports in samplesexist part.
 
   - lastTransitionTime: "2020-09-14T03:36:11Z"
      lastUpdateTime: "2020-09-14T03:36:11Z"
      message: The error the server is currently unable to handle the request (get
        imagestreams.image.openshift.io) during openshift namespace cleanup has left
        the samples in an unknown state
      reason: APIServerServiceUnavailableError
      status: Unknown
      type: SamplesExist

Verified on 4.4.0-0.nightly-2020-09-12-143345 cluster

Comment 9 errata-xmlrpc 2020-09-22 06:58:40 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.4.23 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:3715