Bug 1526133
Summary: | [SR-IOV] hot-plug of vNIC on running VM fails with VDSErrorException | ||||||
---|---|---|---|---|---|---|---|
Product: | [oVirt] vdsm | Reporter: | Mor <mkalfon> | ||||
Component: | Core | Assignee: | Milan Zamazal <mzamazal> | ||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | Mor <mkalfon> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | high | ||||||
Version: | 4.20.9.1 | CC: | bugs, danken, edwardh, mburman, mkalfon, mzamazal | ||||
Target Milestone: | ovirt-4.2.1 | Keywords: | Automation, Regression | ||||
Target Release: | --- | Flags: | rule-engine:
ovirt-4.2+
rule-engine: blocker+ |
||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: |
Hot plug of a SR-IOV network device didn't work properly, the hot plugged device wasn't displayed in the UI and couldn't be hot unplugged. It has been fixed and SR-IOV network device hot plug should work properly now.
|
Story Points: | --- | ||||
Clone Of: | Environment: | ||||||
Last Closed: | 2018-02-12 11:47:27 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | Virt | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Mor
2017-12-14 20:06:40 UTC
Created attachment 1368141 [details]
logs
Milan, could you help me understand this repeating traceback? I suspect you did not test your recent changes to hotplug with sr-iov. 2017-12-14 22:01:04,117+0200 ERROR (jsonrpc/7) [virt.vm] (vmId='0fb617b0-ca53-4220-b558-2fe940ec0ab2') Error fetching vm stats (vm:1766) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 1762, in _getRunningVmStats vm_sample.interval) File "/usr/lib/python2.7/site-packages/vdsm/virt/vmstats.py", line 45, in produce networks(vm, stats, first_sample, last_sample, interval) File "/usr/lib/python2.7/site-packages/vdsm/virt/vmstats.py", line 322, in networks if nic.name.startswith('hostdev'): AttributeError: name This bug report has Keywords: Regression or TestBlocker. Since no regressions or test blockers are allowed between releases, it is also being identified as a blocker for this release. Please resolve ASAP. This is what happens: - SR-IOV NIC hotplug is called. - Such a hotplug is called without `network' parameter (network=null). - The NIC is successfully hotplugged in libvirt. - Metadata update follows in Vdsm, which doesn't expect that `network' is missing, resulting in AttributeError. - NIC device instance initialization in Vdsm remains incomplete, resulting in the vmstats errors. - Following hotplug attempts of the same NIC fail, apparently since the device is already present, but Engine doesn't know about it. So the problem is not in my recent patches (which are not present in that version anyway), but in missing `network' parameter in metadata update. We don't know whether the parameter should be present with SR-IOV hotplug or not, depending on that the bug is either in Engine or in Vdsm metadata update. Mor, can you find in which Vdsm/Engine versions were last where sr-iov hotplug has been passing? Is it correct that `network' parameter is null on SR-IOV hotplug? If yes, we should fix metadata update in Vdsm, if no then it should be fixed in Engine and perhaps Vdsm should reject hotplug in such a case. (In reply to Milan Zamazal from comment #6) > Is it correct that `network' parameter is null on SR-IOV hotplug? Yes, I think so. AFAIR when `hostdev` is passed, `network` does not. Too bad that this (as the `hostdev` element itself) is not documented in lib/vdsm/api/vdsm-api.yml. > If yes, we > should fix metadata update in Vdsm would you, Milan? Thank you Dan for info, I'll try to make a fix. Dan, I tried to look for success on running the affected test case in Jenkins, but I couldn't found one on the downstream RHV 4.2 build. (In reply to Mor from comment #9) > Dan, I tried to look for success on running the affected test case in > Jenkins, but I couldn't found one on the downstream RHV 4.2 build. Mor, I'm not sure I understand your English. Would you please install Milan's vdsm from http://jenkins.ovirt.org/job/vdsm_master_build-artifacts-on-demand-el7-x86_64/703/ and test sriov hotplug on it? (In reply to Dan Kenigsberg from comment #10) > (In reply to Mor from comment #9) > > Dan, I tried to look for success on running the affected test case in > > Jenkins, but I couldn't found one on the downstream RHV 4.2 build. > > Mor, I'm not sure I understand your English. Would you please install > Milan's vdsm from > http://jenkins.ovirt.org/job/vdsm_master_build-artifacts-on-demand-el7- > x86_64/703/ and test sriov hotplug on it? Which one should I check? There are two builds: fc26 and fcraw. (In reply to Mor from comment #11) > > Which one should I check? There are two builds: fc26 and fcraw. Neither. My link leads you to an el7 build. Anyway, we have verified the patch by applying manually. hotplug sriov is an advanced feature. ykaul believes it should not block 4.2.0. Verified on RHV 4.2.1-0.2.el7 This bugzilla is included in oVirt 4.2.1 release, published on Feb 12th 2018. Since the problem described in this bug report should be resolved in oVirt 4.2.1 release, it has been closed with a resolution of CURRENT RELEASE. If the solution does not work for you, please open a new bug report. |