Bugzilla will be upgraded to version 5.0. The upgrade date is tentatively scheduled for 2 December 2018, pending final testing and feedback.
Bug 1365525 - Difficult to find details on RC error through web console
Difficult to find details on RC error through web console
Status: CLOSED ERRATA
Product: OpenShift Container Platform
Classification: Red Hat
Component: Management Console (Show other bugs)
3.2.1
Unspecified Unspecified
medium Severity low
: ---
: ---
Assigned To: Jessica Forrester
Yadan Pei
:
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2016-08-09 08:48 EDT by Justin Pierce
Modified: 2017-08-16 15 EDT (History)
6 users (show)

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
It was difficult to find the underlying reason for a failed deployment from the project overview. The overview will now link to the Events page in these scenarios, which typically contains useful information about what went wrong.
Story Points: ---
Clone Of:
Environment:
Last Closed: 2017-08-10 01:15:47 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)
failed-deployment-help (49.21 KB, image/png)
2017-07-07 12:42 EDT, Jessica Forrester
no flags Details
help debug link (32.32 KB, image/png)
2017-07-11 04:01 EDT, shahan
no flags Details
useful info (61.79 KB, image/png)
2017-07-11 04:02 EDT, shahan
no flags Details


External Trackers
Tracker ID Priority Status Summary Last Updated
Red Hat Product Errata RHEA-2017:1716 normal SHIPPED_LIVE Red Hat OpenShift Container Platform 3.6 RPM Release Advisory 2017-08-10 05:02:50 EDT

  None (edit)
Description Justin Pierce 2016-08-09 08:48:41 EDT
Description of problem:
Several deployments failed for me during hack day. From the web console, it was difficult to find information about why. Although I could have poked around with the CLI, I wanted to stay within the GUI.
- Clicking the word "FAILED" in the overview of deployment did not take me to information about the failure. 
- The logs for the deployment# did not identify the underlying cause -- merely indicating a timeout. 
- I ultimately found the reason under Browse/Events, but these were in no way connected to my deployment. The event:
     "hellow-2 	Replication Controller 	Warning 	Failed create  	Error creating: pods "hellow-2-" is forbidden: service account jupierce1/jws-service-account was not found, retry after the service account is created
8 times in the last 3 minutes"

Version-Release number of selected component (if applicable):


How reproducible:
100% . 

Steps to Reproduce:
1. Attempted to use template: jws30-tomcat8-mysql-s2i . Do no create a JWS service account in advance. 
2. All template parameters were left as default. 

Actual results:
No apparent way to directly analyze DC's failure. 

Expected results:
Information about RC's failure somewhat correlated with DC in GUI.

Additional info:
I understand that this particular deployment failure is valid (https://bugzilla.redhat.com/show_bug.cgi?id=1313556). I'm just hoping that finding the underlying cause could have been more intuitive.
Comment 1 Samuel Padgett 2016-11-11 13:29:43 EST
Ideally the cause of the failure would be written back to deployment status so we could display it directly. I have opened a PR that adds a link to the events tab to encourage users to check there.

https://github.com/openshift/origin-web-console/pull/864
Comment 2 Jessica Forrester 2016-11-17 12:58:42 EST
Is this not something that is being handled with conditions? Or are conditions only showing up on the DC and not the RCs
Comment 3 Jessica Forrester 2016-12-08 16:40:49 EST
We will have conditions on the RCs once kube 1.5 rebase lands, then we can address this
Comment 4 Jessica Forrester 2017-07-07 12:41:17 EDT
The conditions on RCs don't provide us any useful information about what might have gone wrong.  We now link you to both the log and the events for a failed deployment, which should help diagnose the problem.
Comment 5 Jessica Forrester 2017-07-07 12:42 EDT
Created attachment 1295345 [details]
failed-deployment-help
Comment 6 Jessica Forrester 2017-07-07 12:44:15 EDT
This is probably the best we can do as far as an improvement any time soon, there are too many reasons (events) that could be the underlying cause of a failure.
Comment 7 Jessica Forrester 2017-07-07 12:50:35 EDT
Just going to link to the overview redesign PR for this https://github.com/openshift/origin-web-console/pull/1335
Comment 8 shahan 2017-07-11 03:59:27 EDT
Checked this issue in openshift v3.6.139, now web console will display logs and events to help users analyze failure. see attachment.
BTW, QE is checking many Modified bugs to see if they're verifiable. Because fixed, moving to Verified. If have other concerns, pls tell me, thx
Comment 9 shahan 2017-07-11 04:01 EDT
Created attachment 1296107 [details]
help debug link
Comment 10 shahan 2017-07-11 04:02 EDT
Created attachment 1296108 [details]
useful info
Comment 12 errata-xmlrpc 2017-08-10 01:15:47 EDT
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2017:1716

Note You need to log in before you can comment on or make changes to this bug.