Bug 1188854
Summary: | An HA VM ended up running on 2 hosts after the engine thought that an apparently successful migration had failed. | ||
---|---|---|---|
Product: | Red Hat Enterprise Virtualization Manager | Reporter: | Gordon Watson <gwatson> |
Component: | ovirt-engine | Assignee: | Nobody <nobody> |
Status: | CLOSED DUPLICATE | QA Contact: | |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 3.3.0 | CC: | ecohen, gklein, iheim, lpeer, lsurette, ofrenkel, pstehlik, rbalakri, rgolan, Rhev-m-bugs, yeylon |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Linux | ||
Whiteboard: | virt | ||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2015-02-16 09:33:16 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Gordon Watson
2015-02-03 20:50:49 UTC
Created attachment 987834 [details]
vdsm log from host 'h0080d'
The issue here is that "migrating_to" field of the vm had the wrong host id, in this case it had the source host itself, so later, when migration succeeded, the hand-over process updated the "run_on" field with the wrong id (of the source) making it think the vm is missing (because it was not running on the source host anymore), and therefor re-starting it because its HA. this issue was solved by fixing the retry timing of maintenance in Bug 1104030 - Failed VM migrations do not release VM resource lock properly leading to failures in subsequent migration attempts and by clearing old migrations information in Bug 1112359 - Failed to remove host xxxxxxxx both bugs already merged to latest 3.4.z |