Bug 1787581
Summary: | OCP 4.2.12: ingress and network operators degraded after upgrade to 4.3 | ||||||
---|---|---|---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Walid A. <wabouham> | ||||
Component: | Networking | Assignee: | Alexander Constantinescu <aconstan> | ||||
Networking sub component: | openshift-sdn | QA Contact: | huirwang | ||||
Status: | CLOSED ERRATA | Docs Contact: | |||||
Severity: | urgent | ||||||
Priority: | unspecified | CC: | aos-bugs, bbennett, bparees, ccoleman, huirwang, jokerman, lmohanty, mifiedle, scuppett, wking, zzhao | ||||
Version: | 4.2.z | ||||||
Target Milestone: | --- | ||||||
Target Release: | 4.4.0 | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | |||||||
: | 1787635 (view as bug list) | Environment: | |||||
Last Closed: | 2020-05-04 11:22:00 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 1787635 | ||||||
Attachments: |
|
Description
Walid A.
2020-01-03 14:15:26 UTC
Looks we not cleaning up something in the IPAM code. I don't see enough pods in the logs to exhaust the range. Weibin, can you try to reproduce this? There are insufficient logs to work out what the problem really is, so it would help to have a broken cluster we can dissect. *** Bug 1788683 has been marked as a duplicate of this bug. *** Setting priority appropriately, not that we had a CI failure in the dupe bug Going to set this to 4.4 and the clone to 4.3 (currently 4.3.z) *** Bug 1789248 has been marked as a duplicate of this bug. *** It looks like OpenShift can leak IP address allocations when a node reboots since we don't get called by Kubelet for all of the pods that have gone away. The fix will be to remove the contents of /var/lib/cni/networks/openshift-sdn/ on a reboot. Created attachment 1651387 [details]
cluster-loader.py config file
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:0581 |