Bug 981974 - Libvirt daemon no longer respawns after crash
Libvirt daemon no longer respawns after crash
Product: Virtualization Tools
Classification: Community
Component: libvirt (Show other bugs)
Unspecified Linux
unspecified Severity unspecified
: ---
: ---
Assigned To: Eric Blake
Depends On:
Blocks: 958367
  Show dependency treegraph
Reported: 2013-07-07 08:17 EDT by Mooli Tayer
Modified: 2013-07-31 18:58 EDT (History)
3 users (show)

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Last Closed: 2013-07-31 18:58:55 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---

Attachments (Terms of Use)

  None (edit)
Description Mooli Tayer 2013-07-07 08:17:14 EDT
Description of problem:
Libvirt daemon does not respawn after crash as it used to.

Currently it does not respawn on fedora 18 and does respawn on rhel 6.4.

It should respawn after crashes as it always had,
it is important, we have seen it crash due to bugs before.

Version-Release number of selected component (if applicable):

How reproducible:

Steps to Reproduce:
1.pkill libvirt
2.pgrep libvirt

Actual results:
Libvirt re spawns on rhel, but not on fedora   

Expected results:
Libvirt should re spawns on both   

Additional info:
Comment 1 Daniel Berrange 2013-07-29 05:36:15 EDT
Libvirt has never been set to respawn on crash, by default, in either RHEL or Fedora. The behaviour you describe in RHEL must be the custom handling that VDSM sets up with upstart.

That said, I've no objection to configuring the systemd unit to make it restart in Fedora.
Comment 2 Eric Blake 2013-07-29 11:15:13 EDT
Patch proposed upstream, although it will have to wait until after the 1.1.1 release as we are in freeze and it represents a new feature.
Comment 3 Mooli Tayer 2013-07-30 08:42:51 EDT
Second patch submitted upstream:
Comment 4 Eric Blake 2013-07-31 18:58:55 EDT
Now applied. will be in 1.1.2.

commit 6f4a0ebe8c83d9952c30b657e42a08760549eda7
Author: Mooli Tayer <mtayer@redhat.com>
Date:   Tue Jul 30 15:32:59 2013 +0300

    Configuring systemd to restart libvirt on failure
    This will create a respawn behaviour in case libvirt
    process exits due to nonzero exit code, is terminated
    by a signal, an operation times out or the configured
    watchdog timeout is triggered.
    see http://www.freedesktop.org/software/systemd/man/systemd.service.html
    and https://bugzilla.redhat.com/show_bug.cgi?id=981974
    Signed-off-by: Eric Blake <eblake@redhat.com>

This bug was originally filed against upstream; if this is needed in Fedora 19, we can reopen and move the bug there to get this patch backported.

Note You need to log in before you can comment on or make changes to this bug.