Description of problem: Web UI is unreachable after performing adhoc metric in Openshift Version-Release number of selected component (if applicable): Cloudforms 4.5 How reproducible: Always Steps to Reproduce: 1. Navigate to compute -> container -> provider -> Monitoring -> Adhoc metric 2. It will give 502 proxy error and Web UI is unreachable 3. After restarting the evm service, the Web UI is reachable. Actual results: It is giving below exception: The proxy server received an invalid response from an upstream server. The proxy server could not handle the request GET /ops/explorer. Reason: Error reading from remote server after performing adhoc metric in openshift. Expected results: It should not throw any exception. Additional info: In order to get latest logs, please use below commands: [yank] complete - access files in /cases/01930822 [browse] the files here: http://collab-shell.usersys.redhat.com/01930822/ [images] are available here: http://collab-shell.usersys.redhat.com/01930822/x-image
Yaacov can you look into this?
> Steps to Reproduce: > 1. Navigate to compute -> container -> provider -> Monitoring -> Adhoc metric > 2. It will give 502 proxy error and Web UI is unreachable I can't reproduce :-( on my system I get the metrics page, if the metrics server is down I get a regular error message. Do you have / can you prepare a system I can login and see this happening ?
Created attachment 1338801 [details] the error message I get when the metrics server is down
Neha Chugh, hi, any news ?
Hello Yaacov, The issue is not reproducible in all the environments, give me a day or so time so to reproduce the issue and provide the environment details accordingly. Regards, Neha chugh
Helllo Yaacov, I am unable to reproduce the issue in any of the test environments, currently we are checking with customer if there is any network connectivity issue between hawkular and Cloudforms. Currently we are waiting for customer response on this, will update the BZ once we get the required inputs from customer. Regards, Neha Chugh
Created attachment 1359449 [details] hawkular is not responsive (try to connect to hawkular fails)
Created attachment 1361533 [details] Proxy issue video
Yes, if the UI worker is dying because it's exceeding memory thresholds, then I'd imagine you might see something similar to that. I haven't had a chance to look at the logs but is the evm_worker_memory_exceeded happening to the UI worker?
bug 1478434 was a clone of the above "two miq servers" bug and was released in 5.8.2.0. The logs indicate the customer logs come from 5.8.1.5. Neha, I notice you recreated this on your system with version 5.8.1.5, can you recreate this issue in cfme 5.8.2.0+? If you still hit this issue on 5.8.2.0, we'd have to investigate the timeouts and amazingly long requests highlighted in comment 43.
Alright Joe, Let me check at cfme 5.8.2.0 and will come back with my findings. Regards, Neha Chugh