Hide Forgot
Description of problem: in deactivateStorageDomain on migrate master scenario, vdsm takes old values of master version, and thus, fails to connect storage pool (pool not connected). Thread-2904::INFO::2011-01-17 15:15:44,269::dispatcher::95::irs::Run and protect: deactivateStorageDomain, args: ( sdUUID=ae0b976c-83b0-458c-be2a-265637529d78 spUUID=04422aa0-39e6-475c-adac-ffb2ddf1e40c msdUUID=29b93fd7-1a68-406e-bfcf-3e85828575b7 masterVersion=2) MainThread::ERROR::2011-01-17 16:15:17,891::misc::65::irs::Wrong Master domain or its version: 'SD=ae0b976c-83b0-458c-be2a-265637529d78, pool=04422aa0-39e6-475c-adac-ffb2ddf1e40c' MainThread::ERROR::2011-01-17 16:15:17,892::misc::66::irs::Traceback (most recent call last): File "/usr/share/vdsm/storage/hsm.py", line 223, in __init__ self._restorePool(spUUID) File "/usr/share/vdsm/storage/hsm.py", line 417, in _restorePool pool.reconnect() File "/usr/share/vdsm/storage/sp.py", line 514, in reconnect return self.connect(hostId, scsiKey, msdUUID, masterVersion) File "/usr/share/vdsm/storage/sp.py", line 411, in connect mDom = self.getMasterDomain(msdUUID=msdUUID, masterVersion=masterVersion) File "/usr/share/vdsm/storage/sp.py", line 1223, in getMasterDomain self.masterDomain = self.findMasterDomain(msdUUID=msdUUID, masterVersion=masterVersion) File "/usr/share/vdsm/storage/sp.py", line 1293, in findMasterDomain raise e StoragePoolWrongMaster: Wrong Master domain or its version: 'SD=ae0b976c-83b0-458c-be2a-265637529d78, pool=04422aa0-39e6-475c-adac-ffb2ddf1e40c' MainThread::INFO::2011-01-17 16:15:17,910::dispatcher::139::irs::Starting StorageDispatcher... Thread-17::INFO::2011-01-17 16:15:18,411::dispatcher::95::irs::Run and protect: getSpmStatus, args: ( spUUID=04422aa0-39e6-475c-adac-ffb2ddf1e40c) Thread-17::DEBUG::2011-01-17 16:15:18,411::task::577::irs::Task 459b58b1-297d-493a-9963-eb170ad729bb: moving from state init -> state preparing Thread-17::ERROR::2011-01-17 16:15:18,412::misc::65::irs::Unknown pool id, pool not connected: ('04422aa0-39e6-475c-adac-ffb2ddf1e40c',) Thread-17::ERROR::2011-01-17 16:15:18,414::misc::66::irs::Traceback (most recent call last): File "/usr/share/vdsm/storage/task.py", line 978, in _run return fn(*args, **kargs) File "/usr/share/vdsm/storage/spm.py", line 578, in public_getSpmStatus hsm.HSM.validateConnectedPool(spUUID) File "/usr/share/vdsm/storage/hsm.py", line 86, in validateConnectedPool raise se.StoragePoolUnknown(spUUID) StoragePoolUnknown: Unknown pool id, pool not connected: ('04422aa0-39e6-475c-adac-ffb2ddf1e40c',) backend then send connectStorageServer and connectStoragePool again, and host is connected to the pool. repro steps: 1) work with several storage domains 2) put master domain in maintenance notes: 1) Ayal reviewed this bug on rhel 5.5.6 and asked to open a bug on rhel6 (2.3) 2) see attached log
note: restart means restart of vdsm service.
Created attachment 474036 [details] vdsm log.
Patches in gerrit: http://gerrit.usersys.redhat.com/247
verified on vdsm-4.9-62, migrated master domain several times, restarted service, and operation passed as expected.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHEA-2011-1782.html