Bug 1647388
| Summary: | [downstream clone - 4.2.8] Power on on already powered on host sets VMs as down and results in split-brain | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Virtualization Manager | Reporter: | RHV bug bot <rhv-bugzilla-bot> |
| Component: | ovirt-engine | Assignee: | Eli Mesika <emesika> |
| Status: | CLOSED ERRATA | QA Contact: | Petr Matyáš <pmatyas> |
| Severity: | high | Docs Contact: | |
| Priority: | urgent | ||
| Version: | 4.2.6 | CC: | audgiri, gveitmic, lleistne, mgoldboi, mperina, ratamir, Rhev-m-bugs |
| Target Milestone: | ovirt-4.2.8 | Keywords: | ZStream |
| Target Release: | --- | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | ovirt-engine-4.2.8.1 | Doc Type: | If docs needed, set a value |
| Doc Text: | Story Points: | --- | |
| Clone Of: | 1641882 | Environment: | |
| Last Closed: | 2019-01-22 12:44:51 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | Infra | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1641882 | ||
| Bug Blocks: | |||
|
Description
RHV bug bot
2018-11-07 11:44:13 UTC
Customer hit this on 4.1.6. On newer RHEL versions, and depending on the storage, qemu-kvm improved image locking saves the VM from running twice - "Failed to get "write" lock Is another process using the image?.". A VM lease would do the same). However, the engine bug is still there should be fixed. (Originally by Germano Veit Michel) Customer hit this on 4.1.6. On newer RHEL versions, and depending on the storage, qemu-kvm improved image locking saves the VM from running twice - "Failed to get "write" lock Is another process using the image?.". A VM lease would do the same). However, the engine bug is still there should be fixed. (Originally by Germano Veit Michel) Please attach the full log , it is mandatory for fully understanding the flow (Originally by Eli Mesika) (In reply to Eli Mesika from comment #2) > Please attach the full log , it is mandatory for fully understanding the flow I'm attaching the customer's logs (4.1.6) because mine (4.2.6 from comment #0) are gone as I redeployed. Look for this correlation ID - 20bb556f-05f5-43ad-80e0-514b14042b59 You will see a PM START on an already ON host, which triggered VM_WAS_SET_DOWN_DUE_TO_HOST_REBOOT_OR_MANUAL_FENCE on some VMs, which the customer lost due to corruption. This is easily reproducible on 4.2.6 too. (Originally by Germano Veit Michel) Verified on ovirt-engine-4.2.8.1-0.1.el7ev.noarch Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2019:0121 |