Bug 1779285

Summary: Kubelet metrics returned HTTP status 500 Internal Server Error
Product: OpenShift Container Platform Reporter: Ryan Phillips <rphillips>
Component: NodeAssignee: Ryan Phillips <rphillips>
Status: CLOSED CURRENTRELEASE QA Contact: Sunil Choudhary <schoudha>
Severity: medium Docs Contact:
Priority: medium    
Version: 4.3.zCC: aos-bugs, cmarches, csekar, dcaldwel, florin-alexandru.peter, gblomqui, jokerman, juzhao, kechung, nagrawal, schoudha
Target Milestone: ---   
Target Release: 4.3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Cause: When containers have a high (or > 1) restart count, the Kubelet could inject duplicate container metrics into the metrics stream. Consequence: This would cause the /metrics endpoint on the Kubelet to throw a 500 error. Fix: The fix is to only include metrics of most current (running or stopped) container. Result: The metrics endpoint will no longer throw a 500 and allow metrics to flow to Prometheus.
Story Points: ---
Clone Of: 1748073 Environment:
Last Closed: 2019-12-16 15:49:02 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1748073, 1781258, 1781281, 1781285    
Bug Blocks: