Bug 1881077
Summary: | [release 4.5] prometheus-operator: Fix bug in reflector not recovering from "Too large resource version" | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Simon Pasquier <spasquie> |
Component: | Monitoring | Assignee: | Lili Cosic <lcosic> |
Status: | CLOSED WONTFIX | QA Contact: | Junqi Zhao <juzhao> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 4.5 | CC: | alegrand, anpicker, erooth, juzhao, kakkoyun, lcosic, lszaszki, mloibl, oarribas, pkrupa, spasquie, surbania |
Target Milestone: | --- | ||
Target Release: | 4.5.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | No Doc Update | |
Doc Text: | Story Points: | --- | |
Clone Of: | 1881072 | Environment: | |
Last Closed: | 2020-10-26 15:11:50 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1881072 | ||
Bug Blocks: | 1879901 |
Description
Simon Pasquier
2020-09-21 13:55:17 UTC
tested with 4.5.0-0.nightly-2020-10-15-005105, disconnected the node where prometheus-operator pod is deployed from network for a few minutes, after reconnection, no "Too large resource version" error for prometheus-operator container steps: 1. ssh to the node where prometheus-operator is deployed and execute the the script in the background, it disconnects the node from the network for 5 minutes and reconnect later $ ./test.sh & $ cat test.sh sudo ifconfig ens3 down sleep 300 sudo ifconfig ens3 up 2. check prometheus-operator logs after the node is reconnected , there should not have "Too large resource version" error Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.5.16 bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:4268 *** Bug 1892590 has been marked as a duplicate of this bug. *** yes, we had to revert this fix in https://github.com/openshift/prometheus-operator/pull/99 specifically for prometheus-operator as it was causing other issues. Hence marking this bug as CLOSED -> WONTFIX. |