Bug 1647906
| Summary: | systemctl stop ovirt-guest-agent fails (stop-sigterm timed out. Killing) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | [oVirt] ovirt-engine | Reporter: | Polina <pagranat> | ||||||
| Component: | BLL.Virt | Assignee: | Tomáš Golembiovský <tgolembi> | ||||||
| Status: | CLOSED DUPLICATE | QA Contact: | Polina <pagranat> | ||||||
| Severity: | medium | Docs Contact: | |||||||
| Priority: | unspecified | ||||||||
| Version: | 4.3.0 | CC: | bugs, michal.skrivanek, pagranat, rbarry | ||||||
| Target Milestone: | --- | Keywords: | Automation, Regression | ||||||
| Target Release: | --- | ||||||||
| Hardware: | x86_64 | ||||||||
| OS: | Linux | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2018-11-15 16:31:40 UTC | Type: | Bug | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | Virt | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Attachments: |
|
||||||||
This may not be fixed, as a transition to be the qemu guest agent is b expected in 4.3 Actually a duplicate of bug 1640479 which is just a duplicate of bug 1640193 fixed by bug 1635687 Add host version info and/or retest and close please Updated US environment to the last version and still see the problem happens. ovirt-engine-4.3.0-0.0.master.20181101091940.git61310aa.el7.noarch ovirt-release-master-4.3.0-0.1.master.20181003000238.git13297cd.el7.noarch host info: OS Version: RHEL - 7.6 - 4.el7 OS Description: Red Hat Enterprise Linux Kernel Version: 3.10.0 - 957.el7.x86_64 KVM Version: 2.12.0 - 18.el7_6.2 LIBVIRT Version: libvirt-4.5.0-10.el7 VDSM Version: vdsm-4.30.1-49.gitcbf040c.el7 CPU Type:Intel Haswell-noTSX IBRS SSBD Family you're using old libvirt version, as per bug 1635687 it's libvirt-4.5.0-10.el7_6.2.x86_64 Hi Michal, we actually talking about the same version - libvirt-4.5.0-10.el7_6.2.x86_64. And I reproduce the problem in this version. then please specify the versions correctly in the bug report. libvirt-4.5.0-10.el7 and libvirt-4.5.0-10.el7_6.2 is (unfortunately) very different Can you provide corresponding vdsm.log and/or provide the access to the setup where this is happening to doublecheck? Thanks. Created attachment 1505698 [details]
vdsm.log
The vdsm debug log attached.
The problem is reproduced on the setup:
engine: compute-ge-6.scl.lab.tlv.redhat.com
host: puma42.scl.lab.tlv.redhat.com
You are right about the versions:
in our Upstream environments the libvirt is taken from repo: rhel-7.6-zstream. So, it is now libvirt-4.5.0-10.el7.x86_64.
As the setup is fixed the issue could be re-tested
I still see libvirt-4.5.0-10.el7 on puma42. What you need is libvirt-4.5.0-10.el7_6.2 - released by https://access.redhat.com/errata/RHBA-2018:3339 Whatever name your repo has it apparently doesn't contain the right packages Re-tested on libvirt-4.5.0-10.el7_6.2 . Works correctly. *** This bug has been marked as a duplicate of bug 1635687 *** |
Created attachment 1503359 [details] jornalctl -xe output Description of problem: attempt to stop service ovirt-guest-agent fails. Regression from 4.2 Version-Release number of selected component (if applicable): qemu-guest-agent-2.12.0-2.el7.x86_64 ovirt-guest-agent-common-1.0.14-1.20180608132658.gitdcb03c3.el7.noarch ovirt-release-master-4.3.0-0.1.master.20181025000106.git023a723.el7.noarch How reproducible:100% Steps to Reproduce: 1. Start VM . Login running VM 2. systemctl stop ovirt-guest-agent Actual results: hangs about 90 sec and then fails. jornalctl -xe attached systemctl status ovirt-guest-agent ● ovirt-guest-agent.service - oVirt Guest Agent Loaded: loaded (/usr/lib/systemd/system/ovirt-guest-agent.service; enabled; vendor preset: disabled) Active: active (running) since Thu 2018-11-08 13:35:28 IST; 2h 1min ago Process: 3068 ExecStartPre=/bin/chown ovirtagent:ovirtagent /run/ovirt-guest-agent.pid (code=exited, status=0/SUCCESS) Process: 3058 ExecStartPre=/bin/touch /run/ovirt-guest-agent.pid (code=exited, status=0/SUCCESS) Process: 3034 ExecStartPre=/sbin/modprobe virtio_console (code=exited, status=0/SUCCESS) Main PID: 3078 (python) CGroup: /system.slice/ovirt-guest-agent.service └─3078 /usr/bin/python /usr/share/ovirt-guest-agent/ovirt-guest-agent.py [root@dhcp160-221 ~]# systemctl stop ovirt-guest-agent [root@dhcp160-221 ~]# systemctl status ovirt-guest-agent ● ovirt-guest-agent.service - oVirt Guest Agent Loaded: loaded (/usr/lib/systemd/system/ovirt-guest-agent.service; enabled; vendor preset: disabled) Active: failed (Result: signal) since Thu 2018-11-08 15:38:08 IST; 53s ago Process: 3078 ExecStart=/usr/bin/python /usr/share/ovirt-guest-agent/ovirt-guest-agent.py (code=killed, signal=KILL) Process: 3068 ExecStartPre=/bin/chown ovirtagent:ovirtagent /run/ovirt-guest-agent.pid (code=exited, status=0/SUCCESS) Process: 3058 ExecStartPre=/bin/touch /run/ovirt-guest-agent.pid (code=exited, status=0/SUCCESS) Process: 3034 ExecStartPre=/sbin/modprobe virtio_console (code=exited, status=0/SUCCESS) Main PID: 3078 (code=killed, signal=KILL) Nov 08 13:35:28 localhost.localdomain systemd[1]: Starting oVirt Guest Agent... Nov 08 13:35:28 localhost.localdomain systemd[1]: Started oVirt Guest Agent. Nov 08 15:36:38 dhcp160-221.scl.lab.tlv.redhat.com systemd[1]: Stopping oVirt Guest Agent... Nov 08 15:38:08 dhcp160-221.scl.lab.tlv.redhat.com systemd[1]: ovirt-guest-agent.service stop-sigterm timed out. Killing. Nov 08 15:38:08 dhcp160-221.scl.lab.tlv.redhat.com systemd[1]: ovirt-guest-agent.service: main process exited, code=killed, status=9/KILL Nov 08 15:38:08 dhcp160-221.scl.lab.tlv.redhat.com systemd[1]: Stopped oVirt Guest Agent. Nov 08 15:38:08 dhcp160-221.scl.lab.tlv.redhat.com systemd[1]: Unit ovirt-guest-agent.service entered failed state. Nov 08 15:38:08 dhcp160-221.scl.lab.tlv.redhat.com systemd[1]: ovirt-guest-agent.service failed. Expected results: stopped successfully Additional info: