Bug 1539040
Summary: | host-deploy stops libvirt-guests triggering a shutdown of all the running VMs (including HE one) | ||
---|---|---|---|
Product: | [oVirt] ovirt-host-deploy | Reporter: | Simone Tiraboschi <stirabos> |
Component: | Plugins.VDSM | Assignee: | Simone Tiraboschi <stirabos> |
Status: | CLOSED CURRENTRELEASE | QA Contact: | Nikolai Sednev <nsednev> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 1.7.0 | CC: | bugs, cshao, danken, dguo, huzhao, lsvaty, phbailey, pkotas, qiyuan, rbarry, sabose, sbonazzo, weiwang, yaniwang, ycui, yisong, yzhao |
Target Milestone: | ovirt-4.2.1 | Keywords: | Triaged |
Target Release: | --- | Flags: | rule-engine:
ovirt-4.2+
rule-engine: blocker+ |
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
libvirt-guests is now required for the new graceful shutdown feature but host-deploy was explicitly stopping it triggering a shutdown of all the running VMs (including HE one).
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2018-02-12 11:53:41 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | Integration | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1458698 | ||
Bug Blocks: | 1478904 |
Description
Simone Tiraboschi
2018-01-26 13:20:02 UTC
I think it got introduced here: https://gerrit.ovirt.org/#/c/79840/ The commits message says: - the setup introduced by this commit is non intrusive the libvirt-guests by default still remains disabled and stopped the libvirt-guests by default configuration cannot connect to the libvirt, because they do not have proper connection string, therefore they can't touch the running VMs in any way and this explains why libvirt-guests doesn't kick in on the first attempt due to 'Jan 26 11:25:04 c74he20180108h1.localdomain libvirt-guests.sh[15210]: Please enter your authentication name: Please enter your password:' but then it will be effective on subsequent attempts due to the leftover configuration. Petr, should we simply stop libvirt-guests while deploying hosted-engine? Any other way to prevent libvirt-guests acting on HostedEngineLocal? Please note that probably there is also something not correctly working on host-deploy side. Also on the first attempt, at [ INFO ] TASK [Wait for the host to be up] host-deploy already configured and started vdsm and libvirtd (with SASL authentication) and, directly or indirectly, libvirt-guests but in that case libvirt-guests has still to be restarted to consume its new configuration and so the first attempt we see: Jan 26 11:25:04 c74he20180108h1.localdomain virsh[15215]: All-whitespace username. Jan 26 11:25:04 c74he20180108h1.localdomain libvirt-guests.sh[15210]: Please enter your authentication name: Please enter your password: just because libvirt-guests doesn't correctly authenticate to libvirtd over SASL. (In reply to Simone Tiraboschi from comment #2) > Please note that probably there is also something not correctly working on > host-deploy side. This is probably due to the fact that libvirt and vdsm are still configured and started by host-deploy while libvirt-guests got configured in a second shot via ansible: https://github.com/oVirt/ovirt-engine/blob/master/packaging/playbooks/roles/ovirt-host-deploy-libvirt-guests/tasks/main.yml *** Bug 1539734 has been marked as a duplicate of this bug. *** We have discussed this issue and could not find a suitable workaround, that would provide a hot-fix. The libvirt-guests service is configured in a way it shuts down all running VMs before the host shutdown. The VM shutdown is triggered when the service is stopped. This is a proper behavior and should not be changed. What is unfortunate, the libvirt-guests is stopped during the ovirt-host-deploy phase. This results in stopping the running engines VM. We were able to pin point the source of this pressing issue to the otopi part of ovirt-host-deploy. Namely the lines: https://github.com/oVirt/ovirt-host-deploy/blob/master/src/plugins/ovirt-host-deploy/vdsm/packages.py#L122 https://github.com/oVirt/ovirt-host-deploy/blob/master/src/plugins/ovirt-host-deploy/vdsm/packages.py#L164 The right place to solve this issue is inside the legacy otopi code. This is not a trivial change, currently otopi allows only to stop and start services, not to restart them in a single shot: https://github.com/oVirt/otopi/blob/master/src/plugins/otopi/services/systemd.py#L134 *** Bug 1538938 has been marked as a duplicate of this bug. *** *** Bug 1539563 has been marked as a duplicate of this bug. *** Works for me on these components: rhvm-appliance-4.2-20180202.0.el7.noarch ovirt-hosted-engine-setup-2.2.9-1.el7ev.noarch ovirt-hosted-engine-ha-2.2.4-1.el7ev.noarch Red Hat Enterprise Linux Server release 7.4 (Maipo) Moving to verified. http://pastebin.test.redhat.com/552873 This bugzilla is included in oVirt 4.2.1 release, published on Feb 12th 2018. Since the problem described in this bug report should be resolved in oVirt 4.2.1 release, it has been closed with a resolution of CURRENT RELEASE. If the solution does not work for you, please open a new bug report. |