Bug 2035344
| Summary: | kubemacpool-mac-controller-manager not ready | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | Container Native Virtualization (CNV) | Reporter: | Dan Kenigsberg <danken> | ||||||
| Component: | Networking | Assignee: | Ram Lavi <ralavi> | ||||||
| Status: | CLOSED ERRATA | QA Contact: | awax | ||||||
| Severity: | high | Docs Contact: | |||||||
| Priority: | unspecified | ||||||||
| Version: | 4.9.1 | CC: | cnv-qe-bugs, ctomasko, ellorent, gveitmic, joedward, phoracek, ralavi | ||||||
| Target Milestone: | --- | Flags: | awax:
needinfo-
awax: needinfo- |
||||||
| Target Release: | 4.11.0 | ||||||||
| Hardware: | x86_64 | ||||||||
| OS: | Linux | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | v4.11.0-156 | Doc Type: | Known Issue | ||||||
| Doc Text: |
On a large cluster, the OpenShift Virtualization MAC pool manager might take too much time to boot and OpenShift Virtualization might not become ready.
As a workaround, if you do not require MAC pooling functionality, then disable this sub-component by running the following command: `oc annotate --overwrite -n openshift-cnv hco kubevirt-hyperconverged 'networkaddonsconfigs.kubevirt.io/jsonpatch=[{"op": "replace","path": "/spec/kubeMacPool","value": null}]'`.
|
Story Points: | --- | ||||||
| Clone Of: | |||||||||
| : | 2056619 (view as bug list) | Environment: | |||||||
| Last Closed: | 2022-09-14 19:28:30 UTC | Type: | Bug | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Bug Depends On: | |||||||||
| Bug Blocks: | 2056619 | ||||||||
| Attachments: |
|
||||||||
|
Description
Dan Kenigsberg
2021-12-23 17:12:22 UTC
Created attachment 1847608 [details]
log from non-ready kubemacpool-mac-controller-manager
Created attachment 1847610 [details]
describe non-ready kubemacpool-mac-controller-manager
After some diggins looks like it takes a lot of time for InitMap to finish (it do some api access per pod/VM) so it never reachs webhook start and the readiness probes hits timeout. Possible solutions: 1. Increase readiness probe timeout 2. Remove all the api accesss per pod/vm at InitMap: a. Using controller-runtime client b. Caching namespaces and webhook configuration at the beggining 3. Parallelize InitMap and use a sync.Map for the data structure I suggest we go for 2.a so we have the cache for free and it's already well tested. Completely avoid deployment of KubeMacPool with
kubectl annotate --overwrite -n openshift-cnv hco kubevirt-hyperconverged 'networkaddonsconfigs.kubevirt.io/jsonpatch=[{"op": "replace","path": "/spec/kubeMacPool","value": null}]'
(notice the plurals form of networkaddonsconfigs)
Targeting to 4.11. We want to take our time to properly design the solution. The workaround is described above. @ralavi Please update the Doc Type and Doc Text fields.
Because this issue is now resolved, it is now longer a known issue. The documentation team will exclude the known issue from the 4.11 release notes.
On a large cluster, the OpenShift Virtualization MAC pool manager might take too much time to boot and OpenShift Virtualization might not become ready.
As a workaround, if you do not require MAC pooling functionality, then disable this sub-component by running the following command: `oc annotate --overwrite -n openshift-cnv hco kubevirt-hyperconverged 'networkaddonsconfigs.kubevirt.io/jsonpatch=[{"op": "replace","path": "/spec/kubeMacPool","value": null}]'`.
@ctomc is removing the release note from BZ and setting the status to "If docs needed, set a value" good enough? Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: OpenShift Virtualization 4.11.0 Images security and bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:6526 |