Bug 2168346
Summary: | VM stuck at CrashLoopBackOff state after it hits OOM | |||
---|---|---|---|---|
Product: | Container Native Virtualization (CNV) | Reporter: | Boaz <bbenshab> | |
Component: | Virtualization | Assignee: | Igor Bezukh <ibezukh> | |
Status: | CLOSED MIGRATED | QA Contact: | Kedar Bidarkar <kbidarka> | |
Severity: | high | Docs Contact: | ||
Priority: | unspecified | |||
Version: | 4.11.3 | CC: | aasserzo, acardace, jhopper, jlejosne, qixuan.wang, yadu, ycui | |
Target Milestone: | --- | |||
Target Release: | 4.15.1 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | If docs needed, set a value | ||
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 2173980 (view as bug list) | Environment: | ||
Last Closed: | 2023-12-14 16:12:47 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 2173980, 2225204 |
Description
Boaz
2023-02-08 18:51:37 UTC
It doesn't look like storage component, move to Virt component. There's an interesting error there: internal error: UNIX socket path '/var/run/kubevirt-private/libvirt/qemu/channel/target/domain-214-default_rhel82-vm000/org.qemu.guest_agent.0' too long This indeed 108 characters long, 1 more than the 107 allowed by Linux. I think "214" here is the number of times the VM rebooted. This means VMs can only be rebooted 98 times. We need to address that. I don't see why, as far as libvirt is concerned, VMs couldn't just be called "vm" instead of "<namespace>_<VMI name>". (In reply to Jed Lejosne from comment #2) > [...] > This means VMs can only be rebooted 98 times. We need to address that. This is actually incorrect, VMs need to actually crash for that number to increase, so that's not such a big deal. However, @bbenshab, please give more information on how you managed to trigger the OOM killer. If that was solely by doing things from inside the guest, then that's a problem. No matter what guests do, that should cause virt-launcher to run out of memory... Deferring to 4.14 due to capacity. Clone of libvirt fix for RHEL 9.2.0.z: https://bugzilla.redhat.com/show_bug.cgi?id=2233744 Hi, Libvirt fix will be available at RHEL 9.2.0.z batch update 3, which will be released at 12-09-2023 CNV blocker only date is 05-09-2023 I would suggest to the defer the bug to 4.15 Deferring to 4.15.1 due to capacity. The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days |