Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 949310

Summary: [engine-backend] Data corruption in master SD metadata after setting SPM to maintenance and reactivate it
Product: Red Hat Enterprise Virtualization Manager Reporter: Elad <ebenahar>
Component: ovirt-engineAssignee: Ayal Baron <abaron>
Status: CLOSED NOTABUG QA Contact: Elad <ebenahar>
Severity: high Docs Contact:
Priority: unspecified    
Version: 3.2.0CC: acathrow, dyasny, hateya, iheim, lpeer, Rhev-m-bugs, yeylon, ykaplan, ykaul
Target Milestone: ---   
Target Release: 3.2.0   
Hardware: x86_64   
OS: Unspecified   
Whiteboard: storage
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2013-04-08 19:30:36 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Storage RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
logs none

Description Elad 2013-04-07 16:37:05 UTC
Created attachment 732409 [details]
logs

Description of problem:
Data corruption after setting SPM to maintenance and reactivate it again

Version-Release number of selected component (if applicable):
RHEVM - rhevm-backend-3.2.0-10.18.beta2.el6ev.noarch
VDSM - vdsm-4.10.2-14.0.el6ev.x86_64
Libvirt - libvirt-0.10.2-18.el6_4.2.x86_64
Qemu-KVM - qemu-kvm-rhev-0.12.1.2-2.348.el6.x86_64
Sanlock - sanlock-2.6-2.el6.x86_64


How reproducible:
100%

Steps to Reproduce: in 2 hosts setup with 2 iSCSI SD's 
1. Maintenance to SPM and activate it right after

  
Actual results:
2013-04-07 19:17:39,459 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStartVDSCommand] (QuartzScheduler_Worker-16) [4ad2cee9] Start SPM Task failed - result: cleanSuccess, message: VDSGenericException: VDSErrorException: Failed in vdscommand to HSMGetTaskStatusVDS, error = BlockSD master file system FSCK error

There is a corruption in master domain metadata and Engine fails to select SPM  

Expected results:
There should not be data corruption and engine should be able to select SPM

Additional info: see logs attached

Comment 4 Haim 2013-04-08 19:30:36 UTC
configuration issue with storage server, closing till reproduced again after storage is fixed.