Bug 2122119
Summary: | Virtual machine fails to start with error "Unable to use native AIO: failed to create linux AIO context: Resource temporarily unavailable" | ||
---|---|---|---|
Product: | Container Native Virtualization (CNV) | Reporter: | nijin ashok <nashok> |
Component: | Virtualization | Assignee: | Igor Bezukh <ibezukh> |
Status: | CLOSED ERRATA | QA Contact: | guy chen <guchen> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 4.10.4 | CC: | acardace, dholler, fdeutsch, germano, guchen, gveitmic, ibezukh, jhopper, ycui |
Target Milestone: | --- | Keywords: | Scale |
Target Release: | 4.13.0 | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2023-05-18 02:55:41 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
nijin ashok
2022-08-29 08:15:46 UTC
Hi, Unlike RHV, in OCP case the CNV doesn't manage the nodes directly, CNV is an operator installed as an add-on on the cluster. Because of that, the tuning of the RHCOS nodes needs to be done on the OCP level using MachineConfig or the Node Tune Operator. This cannot be done automatically as part of CNV deployment, since then it will cause issue with HyperShift support. Perhaps what we can do is produce some kind of event that the CNV worker nodes are not tuned. Also, aio-nr per VM also depends on VMs IO threads policy. It is 1024 * disks if the disks are having dedicated IO threads. The High-Performance VM templates have got "ioThreadsPolicy: shared" and will enable one io threads and for these VMs it will be 2048 per VM. We've found a way to make this tune be part of default OCP installation, details are in this ticket: https://issues.redhat.com/browse/PSAP-900 Tested the verified the following scenario on lab with OCP 4.12.5 and CNV is 4.12.0. Checked aio on the node before starting VMS : sudo sysctl -a | grep aio fs.aio-max-nr = 1048576 fs.aio-nr = 2048 1. Start a VM with disk aio=native. Use'd block local storage ODF. 2. fs.aio-nr is incremented by 1024 for every new VMs. 3. Started 81 VMS on a single node Checked aio on the node with the VMS : sudo sysctl -a | grep aio fs.aio-max-nr = 1048576 fs.aio-nr = 84992 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Virtualization 4.13.0 Images security, bug fix, and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2023:3205 |