Bug 1899258 - marketplace operator stuck on install
Summary: marketplace operator stuck on install
Keywords:
Status: CLOSED DUPLICATE of bug 1881542
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: OLM
Version: 4.5
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: ---
Assignee: Over the Air Updates
QA Contact: Johnny Liu
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-11-18 18:38 UTC by Mangirdas Judeikis
Modified: 2022-05-06 12:29 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2020-11-19 16:29:20 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Mangirdas Judeikis 2020-11-18 18:38:41 UTC
Marketplace operator stuck during upgrade 

Description of problem:

During non-deterministic installs sometimes install process is not fully completed and CVO is not progressing to completion due to fact that marketplace operator hangs and is not updating ClusterOperator status so from CVO perspective 
it is "not rolled-out".

Where marketplace operator logs and performs as healthy component

How reproducible:

Steps to Reproduce:
Non-deterministic issue. Unknown

Actual results:
Marketplace operator looks healthy when it fact it is not. 

Expected results:
If the marketplace operator is in not healthy state, it should indicate so and be restarted by kuberentes layer.

Comment 5 W. Trevor King 2020-11-18 21:43:40 UTC
Cluster-version operator is successfully acting on the ClusterOperator that marketplace is feeding us.  Moving to the marketplace folks so they can look into fixing what they write to their ClusterOperator.  Also moving the reported version to 4.5.  From the must-gather in comment 1:

$ yaml2json <cluster-scoped-resources/config.openshift.io/clusterversions/version.yaml | jq -r '.status.history[] | .startedTime + " " + (.completionTime // "-") + " " + .state + " " + .version + " " + (.verified | tostring)'
2020-11-18T12:47:58Z - Partial 4.5.16 false
$ yaml2json <cluster-scoped-resources/config.openshift.io/clusterversions/version.yaml | jq -r '.status.conditions[] | .lastTransitionTime + " " + .type + "=" + .status + " " + (.reason // "-") + ": " + (.message // "-")' | sort
2020-11-18T12:47:58Z Available=False -: -
2020-11-18T12:47:58Z Progressing=True ClusterOperatorNotAvailable: Unable to apply 4.5.16: the cluster operator marketplace has not yet successfully rolled out
2020-11-18T12:47:59Z RetrievedUpdates=True -: -
2020-11-18T13:27:39Z Failing=True ClusterOperatorNotAvailable: Cluster operator marketplace is still updating
$ yaml2json <cluster-scoped-resources/config.openshift.io/clusteroperators/marketplace.yaml | jq -r '.status.conditions[] | .lastTransitionTime + " " + .type + "=" + .status + " " + (.reason // "-") + ": " + (.message // "-")' | sort
2020-11-18T12:57:57Z Degraded=False OperandTransitionsSucceeding: Current CR sync ratio (1) meets the expected success ratio (0.3)
2020-11-18T12:57:57Z Upgradeable=True OperatorAvailable: Marketplace is upgradeable
$ yaml2json <cluster-scoped-resources/config.openshift.io/clusteroperators/marketplace.yaml | jq -r '.status | keys | sort[]'
conditions
extension
relatedObjects

So marketplace has not yet claimed Available=True or set an operator version in status.versions.

Comment 6 Mangirdas Judeikis 2020-11-19 11:15:40 UTC
Thanks, Trevor! 

Yes, we noticed this today in our CI but sadly cluster for cleaned by a purger. But all indication in the logs says that this is the same issue.

Comment 7 Kevin Rizza 2020-11-19 16:29:20 UTC
This appears to be a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1881542, which should merge into 4.5 soon

*** This bug has been marked as a duplicate of bug 1881542 ***


Note You need to log in before you can comment on or make changes to this bug.