Bug 723198 - No timeout on migration when connection between hosts crashes
Summary: No timeout on migration when connection between hosts crashes
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 6
Classification: Red Hat
Component: vdsm
Version: 6.2
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: rc
: ---
Assignee: Erez Shinan
QA Contact: Tomas Dosek
URL:
Whiteboard:
Depends On: 725373 799478
Blocks:
TreeView+ depends on / blocked
 
Reported: 2011-07-19 11:20 UTC by Tomas Dosek
Modified: 2012-03-02 17:39 UTC (History)
6 users (show)

Fixed In Version: vdsm-4.9-85
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2011-12-06 07:31:51 UTC
Target Upstream Version:


Attachments (Terms of Use)
vdsm.log (428.58 KB, application/octet-stream)
2011-07-19 11:20 UTC, Tomas Dosek
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHEA-2011:1782 0 normal SHIPPED_LIVE new packages: vdsm 2011-12-06 11:55:51 UTC

Description Tomas Dosek 2011-07-19 11:20:07 UTC
Created attachment 513773 [details]
vdsm.log

Description of problem:
When connection between hosts (SPM and non-SPM) crashes during a VM migration, there's no timeout - the VM stays in locked mode, vdsm on SPM host tries to migrate the vm repeatedly (possibly forever).

Version-Release number of selected component (if applicable):
vdsm-4.9-82.el6.x86_64
libvirt-0.9.2-1.el6.x86_64
qemu-kvm-0.12.1.2-2.171.el6.x86_64

How reproducible:
100 %

Steps to Reproduce:
1. Have a working setup - 2 hosts (without fencing), NFS storage (probably irrelevant) 
2. Start a vm (on SPM host), when it's up perform migration to the non-SPM host
3. Right after it starts block communication between SPM and non-SPM host using iptables (but not between hosts and storage!)
  
Actual results:
Vdsm keeps trying to migrate the vm (possibly forever), vm is in locked mode

Expected results:
Migration fails with proper error shown to user after some timeout - when data transferred to the second host stays on 1% for more then 10 mins it definitely means that something isn't ok.

Additional info:
Attaching vdsm.log

Comment 4 Erez Shinan 2011-07-21 13:52:38 UTC
http://gerrit.usersys/#change,732

Comment 6 Tomas Dosek 2011-07-29 09:09:37 UTC
Verified - vdsm-4.9-86.el6 - timeout was set in vdsm therefore migration (from vdsm point of view) can't be proceeding forever.

Comment 7 errata-xmlrpc 2011-12-06 07:31:51 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHEA-2011-1782.html


Note You need to log in before you can comment on or make changes to this bug.