Bug 1830095 - Cluster overview control plane status reported as degraded when no components are degraded
Summary: Cluster overview control plane status reported as degraded when no components...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Management Console
Version: 4.5
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: 4.5.0
Assignee: Rastislav Wagner
QA Contact: Yadan Pei
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-04-30 20:16 UTC by Samuel Padgett
Modified: 2020-07-13 17:34 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
Clone Of:
Environment:
Version: 4.5.0-0.nightly-2020-04-30-112808 Cluster ID: 13ace493-6d50-479a-aead-02258ba49019 Browser: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:75.0) Gecko/20100101 Firefox/75.0
Last Closed: 2020-07-13 17:34:07 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
Popover has no degraded components (154.41 KB, image/png)
2020-04-30 20:16 UTC, Samuel Padgett
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Github openshift console pull 5344 0 None closed Bug 1830095: Update Control Plane status logic to include no. of not healthy compo… 2021-02-09 09:02:38 UTC
Github openshift console pull 5418 0 None closed Bug 1830095: Update apiserver_request_count metric to apiserver_request_total 2021-02-09 09:02:39 UTC
Red Hat Product Errata RHBA-2020:2409 0 None None None 2020-07-13 17:34:22 UTC

Description Samuel Padgett 2020-04-30 20:16:05 UTC
Created attachment 1683448 [details]
Popover has no degraded components

See screenshot. "API Request Success Rate" is "Not available," which seems to be reported as degraded. Two issues here:

1. We should determine why "API Request Success Rate" metrics aren't available.
2. I don't think we should report "degraded" when we can't fetch metrics. This should be a different status.

Comment 1 Rastislav Wagner 2020-05-04 19:09:07 UTC
Adding Andy.

1. Instead of Not Available, we could show something like 'No data' ?
2. Any ideas what to show here ?

Comment 2 Andy Braren 2020-05-04 20:15:20 UTC
I believe we use "Not available" for similar states elsewhere, so that text string in the popover is probably fine. I may be missing some technical details though.

The substatus string below "Control Plane" should probably become "1 component not available" instead of "1 component degraded," and the icon should probably become the unknown icon, unless we think that's too harsh. That approach would align best with similar unknown/unavailable statuses in the Status card, like the Storage status seen in the screenshot with the unknown icon and a status of "Not available".

Does that approach sound good?

Comment 6 Samuel Padgett 2020-05-12 21:22:16 UTC
Moving back to assigned because we still need to understand why "API Request Success Rate" is unknown and fix or remove that query.

Comment 8 Yanping Zhang 2020-05-14 08:48:02 UTC
Checked on OCP 4.5 cluster with payload 4.5.0-0.nightly-2020-05-13-202437.
Go to Overview page, check "API Request Success Rate" under "Control Plane", it's show 100% now. The bug is fixed.

Comment 9 errata-xmlrpc 2020-07-13 17:34:07 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:2409


Note You need to log in before you can comment on or make changes to this bug.