1498878 – Web UI is unreachable after performing adhoc metric in Openshift

Bug 1498878 - Web UI is unreachable after performing adhoc metric in Openshift

Summary: Web UI is unreachable after performing adhoc metric in Openshift

Keywords:
Status:	CLOSED DUPLICATE of bug 1478434
Alias:	None
Product:	Red Hat CloudForms Management Engine
Classification:	Red Hat
Component:	UI - OPS
Sub Component:
Version:	5.8.0
Hardware:	All
OS:	All
Priority:	high
Severity:	high
Target Milestone:	GA
Target Release:	5.8.3
Assignee:	Joe Rafaniello
QA Contact:	Einat Pacifici
Docs Contact:
URL:
Whiteboard:	container:ui
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2017-10-05 13:20 UTC by Neha Chugh
Modified:	2021-03-11 15:55 UTC (History)
CC List:	14 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2018-01-11 18:27:04 UTC
Category:	---
Cloudforms Team:	CFME Core
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
the error message I get when the metrics server is down (93.27 KB, image/png) 2017-10-15 11:25 UTC, Yaacov Zamir	no flags	Details
Proxy issue video (11.43 MB, application/octet-stream) 2017-12-01 11:46 UTC, Neha Chugh	no flags	Details
View All

Description Neha Chugh 2017-10-05 13:20:18 UTC

Description of problem:
Web UI is unreachable after performing adhoc metric in Openshift

Version-Release number of selected component (if applicable):
Cloudforms 4.5

How reproducible:
Always


Steps to Reproduce:
1. Navigate to compute -> container -> provider -> Monitoring -> Adhoc metric
2. It will give 502 proxy error and Web UI is unreachable
3. After restarting the evm service, the Web UI is reachable.

Actual results:
It is giving below exception:
The proxy server received an invalid response from an upstream server.
The proxy server could not handle the request GET /ops/explorer.

Reason: Error reading from remote server

after performing adhoc metric in openshift.

Expected results:
It should not throw any exception.


Additional info:
In order to get latest logs, please use below commands:

[yank] complete - access files in /cases/01930822
    [browse] the files here: http://collab-shell.usersys.redhat.com/01930822/
    [images] are available here: http://collab-shell.usersys.redhat.com/01930822/x-image

Comment 4 Federico Simoncelli 2017-10-05 14:11:15 UTC

Yaacov can you look into this?

Comment 5 Yaacov Zamir 2017-10-15 11:24:31 UTC

> Steps to Reproduce:
> 1. Navigate to compute -> container -> provider -> Monitoring -> Adhoc metric
> 2. It will give 502 proxy error and Web UI is unreachable

I can't reproduce :-( on my system I get the metrics page, if the metrics server is down I get a regular error message.

Do you have / can you prepare a system I can login and see this happening ?

Comment 6 Yaacov Zamir 2017-10-15 11:25:29 UTC

Created attachment 1338801 [details]
the error message I get when the metrics server is down

Comment 13 Yaacov Zamir 2017-10-18 11:02:53 UTC

Neha Chugh, hi, any news ?

Comment 14 Neha Chugh 2017-10-18 11:53:03 UTC

Hello Yaacov,

The issue is not reproducible in all the environments, give me a day or so time so to reproduce the issue and provide the environment details accordingly.

Regards,
Neha chugh

Comment 16 Neha Chugh 2017-11-14 07:50:11 UTC

Helllo Yaacov,

I am unable to reproduce the issue in any of the test environments, currently we are checking with customer if there is any network connectivity issue between hawkular and Cloudforms.

Currently we are waiting for customer response on this, will update the BZ once we get the required inputs from customer.

Regards,
Neha Chugh

Comment 19 Yaacov Zamir 2017-11-27 12:22:39 UTC

Created attachment 1359449 [details]
hawkular is not responsive (try to connect to hawkular fails)

Comment 22 Neha Chugh 2017-12-01 11:46:38 UTC

Created attachment 1361533 [details]
Proxy issue video

Comment 40 Joe Rafaniello 2017-12-13 22:17:36 UTC

Yes, if the UI worker is dying because it's exceeding memory thresholds, then I'd imagine you might see something similar to that.  

I haven't had a chance to look at the logs but is the evm_worker_memory_exceeded happening to the UI worker?

Comment 45 Joe Rafaniello 2017-12-15 21:35:28 UTC

bug 1478434 was a clone of the above "two miq servers" bug and was released in 5.8.2.0.  The logs indicate the customer logs come from 5.8.1.5.

Neha, I notice you recreated this on your system with version 5.8.1.5, can you recreate this issue in cfme 5.8.2.0+?

If you still hit this issue on 5.8.2.0, we'd have to investigate the timeouts and amazingly long requests highlighted in comment 43.

Comment 46 Neha Chugh 2017-12-18 04:59:48 UTC

Alright Joe, Let me check at cfme 5.8.2.0 and will come back with my findings.

Regards,
Neha Chugh

Note You need to log in before you can comment on or make changes to this bug.