Bug 1671055 - cluster-monitoring-operator loops forever with no error or log when route in not accepted.
Summary: cluster-monitoring-operator loops forever with no error or log when route in ...
Keywords:
Status: CLOSED DEFERRED
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Monitoring
Version: 4.1.0
Hardware: Unspecified
OS: Unspecified
unspecified
medium
Target Milestone: ---
: 4.1.0
Assignee: Frederic Branczyk
QA Contact: Junqi Zhao
URL:
Whiteboard:
Depends On:
Blocks: 1664187
TreeView+ depends on / blocked
 
Reported: 2019-01-30 16:30 UTC by Ryan Howe
Modified: 2022-03-13 16:53 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2019-02-18 19:09:53 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Knowledge Base (Solution) 3813031 0 None None None 2019-01-30 17:56:44 UTC

Description Ryan Howe 2019-01-30 16:30:25 UTC
Description of problem:

When deploying metrics if the route is never accepted by a router, the cluster-monitoring-operator will loop forever waiting for route to become ready, with no log showing why its not progressing creating the other objects. 


Version-Release number of selected component (if applicable):
3.11 
4.0 

How reproducible:
100%

Steps to Reproduce:
1. Configure Router sharding all router deployment  
2. No way to set a default shard label with out configuring a customer default project template. 
3. Deploy cluster-monitoring
  
Actual results:
Loop for ever preforming a GET on the grafana route. 

 round_trippers.go:383] GET https://172.22.0.1:443/apis/route.openshift.io/v1/namespaces/openshift-monitoring/routes/grafana


Expected results:
At the very least log an error
 Then either continue with loop or move forward creating the rest of the metrics objects, while still checking for route to become ready logging more verbosely what its waiting on.  


Additional info:

https://github.com/openshift/cluster-monitoring-operator/blob/master/pkg/client/client.go#L497-L502

https://github.com/openshift/cluster-monitoring-operator/blob/release-3.11/pkg/client/client.go#L377-L381

Comment 1 Robert Bost 2019-01-30 22:00:41 UTC
A similar bug was filed here: bz1666936 (copied to https://jira.coreos.com/browse/RFE-13)

Comment 2 Robert Bost 2019-01-30 22:10:04 UTC
(In reply to Robert Bost from comment #1)
> A similar bug was filed here: bz1666936 (copied to
> https://jira.coreos.com/browse/RFE-13)

Since the bug mentioned above was created to request improvements on the errors, can this bug be used to track an improvement in the Route created by the operator? Or should some other method be used?

Comment 3 minden 2019-01-31 10:48:02 UTC
I think the two can be combined as one. I am linking this bug zilla ticket in the RFE-13 Jira ticket.

Comment 4 Frederic Branczyk 2019-02-04 15:59:17 UTC
@Ryan I'm unfamiliar with route sharding. Is it possible to always set the "default" route sharding? These routes are super low in traffic so I'd like to find a solution that involves no configuration by customers.

Comment 5 Eric Rich 2019-02-18 19:09:53 UTC
Closing this in favor of the RFE in Jira.


Note You need to log in before you can comment on or make changes to this bug.