Bug 2089392
Summary: | 120 node baremetal upgrade from 4.9.29 --> 4.10.13 crashloops on machine-approver | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Dave Wilson <dwilson> |
Component: | Networking | Assignee: | Jaime CaamaƱo Ruiz <jcaamano> |
Networking sub component: | ovn-kubernetes | QA Contact: | Mike Fiedler <mifiedle> |
Status: | CLOSED ERRATA | Docs Contact: | |
Severity: | medium | ||
Priority: | medium | CC: | fbaudin, ffernand, mifiedle, rsevilla, surya, yprokule |
Version: | 4.10 | ||
Target Milestone: | --- | ||
Target Release: | 4.11.0 | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause: When adding a network policy, all other policies logging configuration was being updated unnecessarily keeping ovn-kuberenetes busy and not attending other policy related requests.
Consequence: If there are a lot of network policies in the system, and a new one is added, there could be meaningful and noticeable latency on some concurrent or later network policy configuration changes being in effect.
Fix: All network policies are no longer unnecessarily updated when adding a new policy.
Result: Concurrent or later network policy configuration changes are applied in time.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2022-08-10 11:13:36 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 2099206 |
Description
Dave Wilson
2022-05-23 14:53:54 UTC
Just to add more context, as Dave mentioned, we loaded the cluster with the kubelet-density-cni-networkpolicy workload. Specifically 2500 iterations deploying: - Creates namespace kubelet-density-cni-networkpolicy with the following objects - 1 deny-all network policy in the - 2500 webserver applications (nginx) - 2500 services, each of them backing one of the previous applications - 2500 network policies - 2500 client applications. (cURL-ing the webserver service) PR that contains tentative fix (https://github.com/openshift/ovn-kubernetes/pull/1126) rectifies the issue and the upgrade complete in 1.5hrs, which is inline with expectations Marking verified pre-merge per comment 4. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:5069 |