Bug 2000364
Summary: | Engine fails to start, unable to read cloud-init network config from stateless snapshot configuration. | ||
---|---|---|---|
Product: | Red Hat Enterprise Virtualization Manager | Reporter: | Germano Veit Michel <gveitmic> |
Component: | ovirt-engine | Assignee: | Saif Abusaleh <sabusale> |
Status: | CLOSED ERRATA | QA Contact: | Qin Yuan <qiyuan> |
Severity: | high | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.4.7 | CC: | ahadas, emarcus, jspanko, mavital, mburman, mn.albeschenko, mperina, qiyuan |
Target Milestone: | ovirt-4.4.9 | Keywords: | ZStream |
Target Release: | --- | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | ovirt-engine-4.4.9.2 | Doc Type: | Bug Fix |
Doc Text: |
Previously, on Manager startup, system threads may have been used to retrieve the virtual machine configuration from stateless snapshots, causing the Manager to fail to start.
In this release, the way of retrieving the virtual machine configuration from stateless snapshots on the Manager was changed to avoid using the system thread and only use application threads. AS a result, the Manager can start when stateless snapshots with cloud-init network properties are defined.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2021-11-16 14:46:57 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | Virt | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Germano Veit Michel
2021-09-02 00:24:32 UTC
I'm under the impression the issue reproduces if there is no guest-agent/OS and the VM is started first time when the stateless snapshot is created, as if its related to the guest agent reporting something back (vNICs?) Still a bit confusing though. The documentation text flag should only be set after 'doc text' field is provided. Please provide the documentation text and set the flag to '?' again. Verified with: ovirt-engine-4.4.9.3-0.3.el8ev.noarch Steps: 1. Create one stateless VM with cloud-init network(no OS, one NIC device, one disk), run the VM, restart ovirt-engine. 2. Compare the restart times of ovirt-engine 4.4.9.3 and 4.4.9.1 when there are many running stateless VMs: 1) create 1000 stateless VMs with the following configurations: - 1M mem(because of the resource limitation) - 1M disk(because of the resource limitation) - no console(because of the port number limitation) - no NIC device(because of the MAC resource limitation) - no OS - cloud-init enabled - cloud-init network configured(for 4.4.9.1 not configured, otherwise the engine can't be restarted) 2) run the 1000 VMs 3) restart ovirt-engine 10 times, record restart times, calculate the average value. 4) compare the average restart times of ovirt-engine 4.4.9.3 and 4.4.9.1 3. Run cloud-init automation tests. Results: 1. ovirt-engine can be restarted when there is one running stateless VM with cloud-init network configured. 2. ovirt-engine can be restarted when there are 1000 running stateless VMs with cloud-init network configured. 3. The average restart time of ovirt-engine 4.4.9.3 is 5 seconds longer than that of ovirt-engine 4.4.9.1 when there are 1000 running stateless VMs. Checked with Arik and Saif, the overhead looks reasonable. 4. No regression issue found in cloud-init automation tests. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: RHV Manager (ovirt-engine) security update [ovirt-4.4.9]), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:4626 |