Bug 1809253
Summary: | [4.3.z] 502 error for Prometheus API after the cluster running overnight | |||
---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Maru Newby <mnewby> | |
Component: | apiserver-auth | Assignee: | Maru Newby <mnewby> | |
Status: | CLOSED ERRATA | QA Contact: | Junqi Zhao <juzhao> | |
Severity: | urgent | Docs Contact: | ||
Priority: | urgent | |||
Version: | 4.3.0 | CC: | alegrand, anpicker, aos-bugs, erooth, juzhao, kakkoyun, lcosic, lmohanty, mfojtik, mloibl, pkrupa, slaznick, sttts, surbania | |
Target Milestone: | --- | Keywords: | Upgrades | |
Target Release: | 4.3.z | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | If docs needed, set a value | ||
Doc Text: | Story Points: | --- | ||
Clone Of: | 1801573 | |||
: | 1809258 (view as bug list) | Environment: | ||
Last Closed: | 2020-03-24 14:34:23 UTC | Type: | --- | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | 1801573 | |||
Bug Blocks: | 1809258 |
Comment 1
Scott Dodson
2020-03-18 15:22:14 UTC
Who is impacted? All customers that upgrade to 4.3.5. What is the impact? The service CA will be rotated on upgrade, which is intended to ensure against CA expiry. Without the fix for this bz, though, oauth-proxy will not automatically refresh to pick up the new key material. If not restarted before expiry of the pre-rotation CA, any attempt to communicate via oauth-proxy will result in tls validation errors which will break many of the monitoring components (see [1]). For 4.1 clusters upgraded to 4.3.5 this could occur as soon as May 14th 2020. How involved is remediation? Manual restart of the monitoring components that use oauth-proxy. Is this a regression? No. Without automated rotation, manual rotation (including pod restart) would be required anyway. 1: https://docs.google.com/document/d/1NB2wUf9e8XScfVM6jFBl8VuLYG6-3uV63eUpqmYE8Ts/edit Tested with 4.3.0-0.nightly-2020-03-19-052824 and followed the case OCP-27992, issue is not happen Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:0858 Removing UpgradeBlocker from this older bug, to remove it from the suspect queue described in [1]. If you feel like this bug still needs to be a suspect, please add keyword again. [1]: https://github.com/openshift/enhancements/pull/475 |