Bug 783930

Summary: [VDSM][Storage]: Unable to move SPM to an Active host, when original host lost connectivity
Product: [Retired] oVirt Reporter: zvi <zfridler>
Component: vdsmAssignee: Dan Kenigsberg <danken>
Status: CLOSED WONTFIX QA Contact:
Severity: medium Docs Contact:
Priority: unspecified    
Version: unspecifiedCC: abaron, acathrow, amureini, bazulay, iheim, yeylon, ykaul
Target Milestone: ---   
Target Release: 3.3.4   
Hardware: i686   
OS: Unspecified   
Whiteboard: storage
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2013-03-12 09:37:43 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Storage RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
engine + vdsm logs none

Description zvi 2012-01-23 09:51:00 UTC
Created attachment 556884 [details]
engine + vdsm logs

Description of problem:

Unable to move SPM to an Active host.

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1. Working setup with 2 hosts. host A is Active and SPM. host B is in maintenence .
2.Disconnect host A from network .
3.Activate host B. Verify it is active.
4.On host A - Confirm host has been rebooted 

See attached vdsm and engine logs. 


Actual results:
SPM is locked on a "dead" host A.
Error messages

Expected results:
Host B is SPM

Additional info:
Host A dosn't have PM configuration in engine.

SecureError
Thread-1959::DEBUG::2012-01-23 11:02:08,850::task::874::TaskManager.Task::(_run) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::Task._run: 9a8feb84-fd3f-45b7-b656-2c2ecd12c994 ('eedadac2-0559-4037-98
c5-669b059f7ab2', 5, '14') {} failed - stopping task
Thread-1959::DEBUG::2012-01-23 11:02:08,850::task::1201::TaskManager.Task::(stop) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::stopping in state preparing (force False)
Thread-1959::DEBUG::2012-01-23 11:02:08,851::task::980::TaskManager.Task::(_decref) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::ref 1 aborting True
Thread-1959::INFO::2012-01-23 11:02:08,851::task::1159::TaskManager.Task::(prepare) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::aborting: Task is aborted: '' - code 100
Thread-1959::DEBUG::2012-01-23 11:02:08,851::task::1164::TaskManager.Task::(prepare) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::Prepare: aborted: 
Thread-1959::DEBUG::2012-01-23 11:02:08,852::task::980::TaskManager.Task::(_decref) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::ref 0 aborting True
Thread-1959::DEBUG::2012-01-23 11:02:08,852::task::915::TaskManager.Task::(_doAbort) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::Task._doAbort: force False
Thread-1959::DEBUG::2012-01-23 11:02:08,853::resourceManager::841::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {}
Thread-1959::DEBUG::2012-01-23 11:02:08,853::task::588::TaskManager.Task::(_updateState) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::moving from state preparing -> state aborting
Thread-1959::DEBUG::2012-01-23 11:02:08,853::task::537::TaskManager.Task::(__state_aborting) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::_aborting: recover policy none
Thread-1959::DEBUG::2012-01-23 11:02:08,854::task::588::TaskManager.Task::(_updateState) Task=`9a8feb84-fd3f-45b7-b656-2c2ecd12c994`::moving from state aborting -> state failed
Thread-1959::DEBUG::2012-01-23 11:02:08,854::resourceManager::806::ResourceManager.Owner::(releaseAll) Owner.releaseAll requests {} resources {'Storage.eedadac2-0559-4037-98c5-669b059f7ab2': < Reso
urceRef 'Storage.eedadac2-0559-4037-98c5-669b059f7ab2', isValid: 'True' obj: 'None'>}
Thread-1959::DEBUG::2012-01-23 11:02:08,854::resourceManager::841::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {}
Thread-1959::DEBUG::2012-01-23 11:02:08,855::resourceManager::535::ResourceManager::(releaseResource) Trying to release resource 'Storage.eedadac2-0559-4037-98c5-669b059f7ab2'
Thread-1959::DEBUG::2012-01-23 11:02:08,855::resourceManager::550::ResourceManager::(releaseResource) Released resource 'Storage.eedadac2-0559-4037-98c5-669b059f7ab2' (0 active users)
Thread-1959::DEBUG::2012-01-23 11:02:08,856::resourceManager::555::ResourceManager::(releaseResource) Resource 'Storage.eedadac2-0559-4037-98c5-669b059f7ab2' is free, finding out if anyone is waiti
ng for it.
Thread-1959::DEBUG::2012-01-23 11:02:08,856::resourceManager::562::ResourceManager::(releaseResource) No one is waiting for resource 'Storage.eedadac2-0559-4037-98c5-669b059f7ab2', Clearing records
.
Thread-1959::ERROR::2012-01-23 11:02:08,857::dispatcher::93::Storage.Dispatcher.Protect::(run) 
Traceback (most recent call last):
  File "/usr/share/vdsm/storage/dispatcher.py", line 85, in run
    result = ctask.prepare(self.func, *args, **kwargs)
  File "/usr/share/vdsm/storage/task.py", line 1166, in prepare
    raise self.error
SecureError
Thread-1963::DEBUG::2012-01-23 11:02:15,578::task::588::TaskManager.Task::(_updateState) Task=`77f1960c-90d9-42b5-8575-dd51fc228527`::moving from state init -> state preparing
Thread-1963::INFO::2012-01-23 11:02:15,578::logUtils::37::dispatcher::(wrapper) Run and protect: repoStats(options=None)
Thread-1963::INFO::2012-01-23 11:02:15,579::logUtils::39::dispatcher::(wrapper) Run and protect: repoStats, Return response: {'924de01f-187f-411b-8615-36ba2a6a5e0e': {'delay': '0.000598192214966', 
'lastCheck': 1327309325.865981, 'code': 0, 'valid': True}, '183d2408-1f0c-4ea4-ae8e-00dd3eca15f8': {'delay': '0.00192904472351', 'lastCheck': 1327309334.789749, 'code': 0, 'valid': True}}
Thread-1963::DEBUG::2012-01-23 11:02:15,579::task::1174::TaskManager.Task::(prepare) Task=`77f1960c-90d9-42b5-8575-dd51fc228527`::finished: {'924de01f-187f-411b-8615-36ba2a6a5e0e': {'delay': '0.000
598192214966', 'lastCheck': 1327309325.865981, 'code': 0, 'valid': True}, '183d2408-1f0c-4ea4-ae8e-00dd3eca15f8': {'delay': '0.00192904472351', 'lastCheck': 1327309334.789749, 'code': 0, 'valid': T
rue}}
Thread-1963::DEBUG::2012-01-23 11:02:15,580::task::588::TaskManager.Task::(_updateState) Task=`77f1960c-90d9-42b5-8575-dd51fc228527`::moving from state preparing -> state finished

Comment 2 Itamar Heim 2013-03-12 09:37:43 UTC
Closing old bugs. If this issue is still relevant/important in current version, please re-open the bug.