1712581 – During upgrade, failure mesage can be "Cluster did not complete upgrade: timed out waiting for the condition"

Bug 1712581 - During upgrade, failure mesage can be "Cluster did not complete upgrade: timed out waiting for the condition"

Summary: During upgrade, failure mesage can be "Cluster did not complete upgrade: time...

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Cluster Version Operator
Sub Component:
Version:	4.2.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Target Release:	4.2.0
Assignee:	W. Trevor King
QA Contact:	liujia
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2019-05-21 20:26 UTC by bpeterse
Modified:	2019-06-17 17:42 UTC (History)
CC List:	7 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2019-06-17 17:42:58 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Github	openshift origin pull 22884	0	None	closed	Bug 1712581: test/e2e/upgrade: Summarize ClusterVersion on upgrade failures	2020-06-10 12:27:24 UTC

Description bpeterse 2019-05-21 20:26:22 UTC

Example PR:
https://openshift-gce-devel.appspot.com/build/origin-ci-test/pr-logs/pull/openshift_console-operator/229/pull-ci-openshift-console-operator-master-e2e-aws-upgrade/100#openshift-tests-disruptive-cluster-upgrade-should-maintain-a-functioning-cluster-featureclusterupgrade-suiteopenshift-serial

The message:

```
Cluster did not complete upgrade: timed out waiting for the condition
```

In:

```
fail [github.com/openshift/origin/test/e2e/upgrade/upgrade.go:138]: during upgrade

Unexpected error:
    <*errors.errorString | 0xc002137310>: {
        s: "Cluster did not complete upgrade: timed out waiting for the condition",
    }
    Cluster did not complete upgrade: timed out waiting for the condition
occurred
```
Could likely be clearer to help diagnose what happened to fail the upgrade.

Comment 1 Brenton Leanhardt 2019-05-21 21:16:12 UTC

The team has some ideas to improve the error output here (like printing the ClusterVersion status).  The general process here should be to look at the stdout/stderr and look for ClusterVersion transitions.  The cvo logs would be another place to look.

Comment 2 W. Trevor King 2019-05-22 03:14:52 UTC

Probably affects 4.1 too, but so far has only been reported in origin/master (~4.2) builds.  We can move Version back to 4.1 if/when we see it there.

Comment 3 Brenton Leanhardt 2019-06-17 17:42:58 UTC

Since this is related to CI and the fix landed in master we're just going to close this.  No need for QE verification.

Note You need to log in before you can comment on or make changes to this bug.