Bug 1801890
Summary: | [OVN] Failed to create pod network sandbox due failed CNI requests (when scaling to 200 pods per node) | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Simon <skordas> |
Component: | Networking | Assignee: | Dan Williams <dcbw> |
Networking sub component: | ovn-kubernetes | QA Contact: | Simon <skordas> |
Status: | CLOSED DUPLICATE | Docs Contact: | |
Severity: | high | ||
Priority: | high | CC: | aconstan, anbhat, bbennett, dblack, mifiedle, mkarg, pportant, rkhan, wsun |
Version: | 4.4 | ||
Target Milestone: | --- | ||
Target Release: | 4.5.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | aos-scalability-44,SDN-CI-IMPACT | ||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2020-05-20 13:56:23 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Simon
2020-02-11 20:26:02 UTC
@Ben How should we track scalability related bugs - such as this one? By tagging them? /Alex Hi Simon Could you re-test with the newer version of OVN? We've had a lot of performance improvements coming in recently and we suspect the issue might have been resolved. Thanks in advance! -Alex Retest negative The same issue. oc get clusterversions 4.4.0-0.nightly-2020-03-02-201804 ovnkube version 0.3.0 ovn-controller (Open vSwitch) 2.12.0 OpenFlow versions 0x4:0x4 Marking TestBlocker for PerfScale pod density tests. Dan, Aniket thinks you have some PRs in flight that help with this. When they land, can you get someone on our team to test this and then if it is good, get Joe to kick off a new scale test (after a backport). Moved to 4.5, but any fix to this is a strong candidate for a 4.4 (or 4.3) backport. hi, skordas Can I move the QE-contact to you to verified this bug once this issue is fixed? thanks. Will this be fixed in 4.4 before release? If yes, we should have bug to track 4.4. It's highly likely that both OVN and ovnkube scalability changes have fixed this issue (eg, monitor-all and some ovnkube master things). Can we retest scaling to 200 nodes? |