RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1569614 - IOERROR pause code lost after resuming a VM while I/O error is still present
Summary: IOERROR pause code lost after resuming a VM while I/O error is still present
Keywords:
Status: CLOSED NOTABUG
Alias: None
Product: Red Hat Enterprise Linux 7
Classification: Red Hat
Component: libvirt
Version: 7.5
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: rc
: ---
Assignee: Jiri Denemark
QA Contact: Yanqiu Zhang
URL:
Whiteboard:
Depends On:
Blocks: 1526025
TreeView+ depends on / blocked
 
Reported: 2018-04-19 15:04 UTC by Markus Armbruster
Modified: 2018-06-26 15:56 UTC (History)
22 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Under certain circumstances, resuming a paused guest generated redundant "VIR_DOMAIN_PAUSED_UNKNOWN" error messages in the libvirt log. This update corrects the event sending order when resuming guests, which prevents the errors being logged.
Clone Of: 1566153
Environment:
Last Closed: 2018-05-29 21:51:27 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Comment 2 Markus Armbruster 2018-04-19 15:18:55 UTC
When a VM is paused due to an erroneous storage, libvirt emits a
corresponding life cycle event with VIR_DOMAIN_PAUSED_IOERROR reason
and then VIR_DOMAIN_EVENT_ID_IO_ERROR_REASON event. Also the state of
the VM is set appropriately:

  # virsh -r domstate 2 --reason
  paused (I/O error)

When the VM is then resumed manually while the I/O error still persists, it gets paused again immediately. However in that case the life cycle event contains VIR_DOMAIN_PAUSED_UNKNOWN reason. I/O error is also no longer reported when asking for the VM state:

  # virsh -r domstate 2 --reason
  paused (unknown)

Additionally, the order of incoming events is weird, as follows:

- IO_ERROR event
- RESUME event
- PAUSED event

That means the real pause reason is lost.

This happens because libvirt gets confused by the QMP events it receives from qemu-kvm after the resume: first BLOCK_IO_ERROR, then RESUME, then STOP.

I guess libvirt would be fine if qemu-kvm sent them in the more natural order RESUME, BLOCK_IO_ERROR, STOP.

Perhaps we can fix qemu-kvm to do that (bug 1566153), and perhaps making libvirt coping with the current order won't be necessary then.  This bug tracks possible libvirt work in case we can't fix qemu-kvm, or libvirt needs to cope with unfixed versions of qemu-kvm.

For detailed reproducers see bug 1566153.

Comment 3 Jiri Denemark 2018-05-29 21:51:27 UTC
QEMU fixed the order of emitted events and no additional libvirt work is needed.


Note You need to log in before you can comment on or make changes to this bug.