Bug 1731481 - failed: waiting for Alertmanager Route to become ready
Summary: failed: waiting for Alertmanager Route to become ready
Keywords:
Status: CLOSED INSUFFICIENT_DATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: openshift-apiserver
Version: 4.2.0
Hardware: Unspecified
OS: Unspecified
unspecified
medium
Target Milestone: ---
: 4.2.0
Assignee: Stefan Schimanski
QA Contact: Xingxing Xia
URL:
Whiteboard: buildcop
: 1721528 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2019-07-19 13:53 UTC by Thomas Wiest
Modified: 2019-08-01 10:42 UTC (History)
10 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2019-08-01 10:42:45 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Thomas Wiest 2019-07-19 13:53:35 UTC
Run template e2e-aws-upgrade - e2e-aws-upgrade container setup

Error: running task Updating Alertmanager failed: waiting for Alertmanager Route to become ready

Full Log:
https://prow.svc.ci.openshift.org/view/gcs/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-upgrade/4230

Comment 2 Dan Mace 2019-07-19 14:45:49 UTC
I'm not seeing any evidence to support the claim that ingress is malfunctioning in this case.

The monitoring routes are admitted:

https://storage.googleapis.com/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-upgrade/4230/artifacts/e2e-aws-upgrade/must-gather/namespaces/openshift-monitoring/route.openshift.io/routes.yaml

The ingress operator is reporting available:

https://storage.googleapis.com/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-upgrade/4230/artifacts/e2e-aws-upgrade/must-gather/namespaces/openshift-ingress-operator/operator.openshift.io/ingresscontrollers/default.yaml

There are no errors in the ingress controller or router logs.

The following error is cited in the build log/this Bugzilla:

    level=fatal msg="failed to initialize the cluster: Cluster operator monitoring is reporting a failure: Failed to rollout the stack. Error: running task Updating Alertmanager failed: waiting for Alertmanager Route to become ready failed: waiting for RouteReady of alertmanager-main: the server is currently unable to handle the request (get routes.route.openshift.io alertmanager-main)"

It's not clear to me how that error can be interpreted as evidence of ingress failure. It looks to me like either the endpoint behind the route is returning an error (what is "the server"?), or the apiserver itself can't be reached (why is "get routes.route.openshift.io" present in the error?)

Do you agree this is unlikely to be a routing issue?

Comment 3 Thomas Wiest 2019-07-19 14:52:01 UTC
Changing the needinfo to Sergiusz Urbaniak as he's the one that reassigned it to the ingress team.

Comment 6 Michal Fojtik 2019-07-24 10:13:06 UTC
The "routes.route.openshift.io" resource is provided by OpenShift API server. Can you check if the OpenShift API server is running?

Comment 7 Michal Fojtik 2019-07-24 10:15:06 UTC
*** Bug 1721528 has been marked as a duplicate of this bug. ***

Comment 8 Greg Blomquist 2019-07-31 14:42:38 UTC
Thomas, needinfo on you based on comment #6

Comment 9 Thomas Wiest 2019-07-31 15:49:07 UTC
Michal and Greg, I don't have any additional information. This bug was created while I was build cop and I provided the information that caused this build failure / bug to be created in the original bug description.


Note You need to log in before you can comment on or make changes to this bug.