Bug 1282581
Summary: | VDSM failure on RNG device conf on VM migration after upgrade to oVirt 3.6 | ||||||
---|---|---|---|---|---|---|---|
Product: | [oVirt] vdsm | Reporter: | Shmuel Melamud <smelamud> | ||||
Component: | Core | Assignee: | Shmuel Melamud <smelamud> | ||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | Nisim Simsolo <nsimsolo> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | unspecified | ||||||
Version: | 4.17.11 | CC: | amureini, bugs, danken, gklein, jas, mgoldboi, michal.skrivanek, nsimsolo, philippe, smelamud | ||||
Target Milestone: | ovirt-3.6.1 | Flags: | rule-engine:
ovirt-3.6.z+
mgoldboi: planning_ack+ rule-engine: devel_ack+ mavital: testing_ack+ |
||||
Target Release: | 4.17.12 | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2016-02-23 09:19:01 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | Virt | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 1285700 | ||||||
Attachments: |
|
Description
Shmuel Melamud
2015-11-16 19:26:49 UTC
Why isn't this a dup of bug 1233825 ? (In reply to Yaniv Kaul from comment #1) > Why isn't this a dup of bug 1233825 ? This is completely different problem. Bug 1233825 is about incompatibility between the Engine and VDSM: Engine didn't add 'device' field that VDSM was relying on. But this problem was fixed before 3.6 release: now the Engine always puts the 'device' field and VDSM uses 'type' to distinguish RNG devices. But in this bug the VM with an RNG device was started on old VDSM that's completely unaware of RNG devices. The old VDSM created unknown device conf instead of RNG device conf. When some of the hosts in the cluster were upgraded to the recent VDSM, the VM was migrated to an upgraded host and the recent VDSM treats this unknown device conf (that was preserved during migration) as RNG device conf (because it contains 'type'==RNG) and fails while trying to read 'source' spec param that's absent. (In reply to Shmuel Melamud from comment #2) > (In reply to Yaniv Kaul from comment #1) > > Why isn't this a dup of bug 1233825 ? > > This is completely different problem. Bug 1233825 is about incompatibility > between the Engine and VDSM: Engine didn't add 'device' field that VDSM was > relying on. But this problem was fixed before 3.6 release: now the Engine > always puts the 'device' field and VDSM uses 'type' to distinguish RNG > devices. > > But in this bug the VM with an RNG device was started on old VDSM that's > completely unaware of RNG devices. The old VDSM created unknown device conf > instead of RNG device conf. When some of the hosts in the cluster were > upgraded to the recent VDSM, the VM was migrated to an upgraded host and the > recent VDSM treats this unknown device conf (that was preserved during > migration) as RNG device conf (because it contains 'type'==RNG) and fails > while trying to read 'source' spec param that's absent. Ok, I still think it's two sides of the same issue. But if this is the exact issue, please fix the title of the bug to reflect it. The title is not mentioning that the problem is with the RNG device, for example. Still need to update flags (I don't have permissions for this) and create clone for 3.6. Fixed bug tickets must have version flags set prior to fixing them. Please set the correct version flags and move the bugs back to the previous status after this is corrected. please use this same bug for 3.6 backport actually made it in 3.6.1 Bug tickets that are moved to testing must have target release set to make sure tester knows what to test. Please set the correct target release before moving to ON_QA. Verified. - Build versions used before upgrading to RHEVM 3.6: rhevm-3.5.6.2-0.1.el6ev host: qemu-kvm-rhev-0.12.1.2-2.479.el6_7.3.x86_64 libvirt-client-0.10.2-54.el6_7.2.x86_64 vdsm-4.16.34-2.el6ev.x86_64 sanlock-2.8-2.el6_5.x86_64 - Build version used for upgrading engine: rhevm-3.6.3.2-0.1.el6 - build versions of host added after the migration: qemu-kvm-rhev-2.3.0-31.el7_2.7.x86_64 libvirt-client-1.2.17-13.el7_2.2.x86_64 vdsm-4.17.21-0.el7ev.noarch sanlock-3.2.4-1.el7.x86_64 Verification scenario: 1. Use rhevm-3.5, enable RNG (/dev/random source) on the cluster and create RHEL7 VM with RNG enabled (also /dev/random source). 2. Run VM and verify host /dev/random is attached to VM properly: On host run the next commands: #lsof /dev/random (fetch qemu-kvm PID) #ps -aux | grep [qemu-kvm PID] Verify QEMU process is running with the next parameters: -object rng-random,id=rng0,filename=/dev/random -device virtio-rng-pci,rng=rng0 3. Verify host and VM RNG functionality using the next commands: VM: #dd count=100 bs=100000 if=/dev/random of=/dev/stdout | hexdump -c on host verify entropy starvation does not occur: #watch -d -n 1 cat /proc/sys/kernel/random/entropy_avail 4. Upgrade engine to 3.6 using the next procedure: #yum install http://bob.eng.lab.tlv.redhat.com/builds/latest_3.6/rhev-release-3.6.3-3-001.noarch.rpm #yum update rhevm-setup #engine-setup 5. After engine is up, validate VM and host RNG functionallity (step 3 procedure). Add another host to cluster with latest VDSM (vdsm-4.17.21-0.el7ev.noarch) 6. Migrate VM to the new host, verify VM is not going down and RNG functionality is not affected by the migration. 7. Migrate VM back to the first host and verify VM is not going down and RNG functionality is not affected by the migration. 8. Repeat steps 5-6 few times. |