Description of problem: When a merge snapshot operation fails, the snapshot is left in status BROKEN and is unusable. Instead, we could merge the disk-snapshots we know have succeeded, and only mark the disk who's merge operation failed as broken. Version-Release number of selected component (if applicable): ovirt-3.4-bera1 How reproducible: 100% Steps to Reproduce: 1. Create a DC 2. Create a cluster with compatibility level >=3.4 3. Add a host 4. Create a storage domain 5. Create a VM with three disks 6. Fill up one of the disks so it's considerably larger than the others 7. Take a snapshot 8. Fill up the same disk, so it's snapshot is also considerably larger than the others 9. merge the entire snapshot 10. wait for the two small disks' merge operations to complete, and restart the host (note: since it's the only host, it's also the SPM) Actual results: The entire snapshot is broken. Expected results: The two smaller disks should be removed from the snapshot (as they have been merged successfully), and the larger only should be marked as broken.
This is an automated message. Re-targeting all non-blocker bugs still open on 3.4.0 to 3.4.1.
*** Bug 996945 has been marked as a duplicate of this bug. ***
*** Bug 1096508 has been marked as a duplicate of this bug. ***
*** Bug 1082655 has been marked as a duplicate of this bug. ***
verified on rc1
Hello, I can also verify this problem on RHEV 3.4. During snapshot removal our SPM node load skyrocket and it was thrown out of cluster. After rebooting and activating the node 3 VMs have BROKEN snapshots, which we can't remove. We get this error in user interface: Error while executing action RemoveSnapshot: Image does not exist in domain Will provide log of ovirt-engine during this problem. I really need some sort of solution, is there any workaround ?
Created attachment 944265 [details] log of failed BROKEN snapshot removal ... Attached is the ovirt-engine log during failed removal of BROKEN snapshot.
(In reply to gody1 from comment #6) > Hello, > > I can also verify this problem on RHEV 3.4. > > During snapshot removal our SPM node load skyrocket and it was thrown out of > cluster. > > After rebooting and activating the node 3 VMs have BROKEN snapshots, which > we can't remove. > > We get this error in user interface: > Error while executing action RemoveSnapshot: Image does not exist in domain > > Will provide log of ovirt-engine during this problem. > > I really need some sort of solution, is there any workaround ? Daniel?
(In reply to gody1 from comment #6) > Hello, > > I can also verify this problem on RHEV 3.4. > > During snapshot removal our SPM node load skyrocket and it was thrown out of > cluster. > > After rebooting and activating the node 3 VMs have BROKEN snapshots, which > we can't remove. > > We get this error in user interface: > Error while executing action RemoveSnapshot: Image does not exist in domain > > Will provide log of ovirt-engine during this problem. > > I really need some sort of solution, is there any workaround ? This issue seems more related to bug 996945, which addresses failure of a broken snapshot removal. Up until 3.5, the behavior has been to mark a snapshot as broken on any failure while removing it. Deleting the broken snapshot could fail as well since that's the reason it was marked as broken in the first place. Hence, the alternative is a manual cleanup of the snapshot. Are you looking for a workaround just to remove it from snapshots list in the UI/rest-api, or the snapshots disks still exist on the storage?
oVirt 3.5 has been released and should include the fix for this issue.
*** Bug 1149770 has been marked as a duplicate of this bug. ***
Works on 4.3