Bug 1825215

Summary: [4.4] 504 error for Prometheus API in administrator console with kuryr network
Product: OpenShift Container Platform Reporter: Junqi Zhao <juzhao>
Component: NetworkingAssignee: Maysa Macedo <mdemaced>
Networking sub component: kuryr QA Contact: Jon Uriarte <juriarte>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: urgent CC: gcheresh, juriarte, ltomasbo
Version: 4.4Keywords: TestBlocker
Target Milestone: ---   
Target Release: 4.4.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1825207
: 1833265 (view as bug list) Environment:
Last Closed: 2020-05-18 13:35:02 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1825207    
Bug Blocks: 1833265    
Attachments:
Description Flags
GET requests from console GUI to prometheus API
none
"Monitoring -> Metrics" menu from console GUI
none
kube_pod_info_node_count metric retrieval from console GUI none

Description Junqi Zhao 2020-04-17 12:07:08 UTC
+++ This bug was initially created as a clone of Bug #1825207 +++

Created attachment 1679633 [details]
504 error for the Prometheus APIs for "Home -> Overview" page

Description of problem:
This bug is from Bug 1822861, use it to track the 504 error for the Prometheus APIs in the administrator console.

Login the administrator console, check in "Home -> Overview" page, "Monitoring -> Alerting", or "Monitoring -> Metrics" page, the Prometheus APIs in the administrator console returns 504 error in the kuryr OCP 4.5.0-0.nightly-2020-04-17-053057 cluster, see the attached picture 



Version-Release number of selected component (if applicable):
kuryr OCP 4.5.0-0.nightly-2020-04-17-05305

How reproducible:
always

Steps to Reproduce:
1. See the description
2.
3.

Actual results:
504 error for Prometheus API in administrator console 

Expected results:
no error

Additional info:

--- Additional comment from Junqi Zhao on 2020-04-17 12:03:22 UTC ---

Comment 5 Jon Uriarte 2020-05-11 13:25:32 UTC
Verified in 4.4.0-0.nightly-2020-05-08-224132 on top of OSP 16 compose RHOS_TRUNK-16.0-RHEL-8-20200506.n.2 (with OVS and amphora-driver).

OCP 4.4 installation with Kuryr completed successfully.

There are no 504 error responses to Prometheus metrics requests in the GUI "Home -> Overview" page.
The metrics are retrieved correctly in "Monitoring -> Metrics"

The metrics can be retrieved from the CLI as well:

$ token=`oc sa get-token prometheus-k8s -n openshift-monitoring`
$ oc -n openshift-monitoring exec -c prometheus prometheus-k8s-0 -- curl -k -H "Authorization: Bearer $token" 'https://10.196.2.46:9100/metrics'

Comment 6 Jon Uriarte 2020-05-11 13:26:24 UTC
Created attachment 1687321 [details]
GET requests from console GUI to prometheus API

Comment 7 Jon Uriarte 2020-05-11 13:28:21 UTC
Created attachment 1687322 [details]
"Monitoring -> Metrics" menu from console GUI

Comment 8 Jon Uriarte 2020-05-11 13:29:00 UTC
Created attachment 1687323 [details]
kube_pod_info_node_count metric retrieval from console GUI

Comment 10 errata-xmlrpc 2020-05-18 13:35:02 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:2133