Bug 690079

Summary: [vdsm][storage]ConnectStoragePool (HSM) fails on a large scale deployment
Product: Red Hat Enterprise Linux 6 Reporter: Moran Goldboim <mgoldboi>
Component: vdsmAssignee: Saggi Mizrahi <smizrahi>
Status: CLOSED ERRATA QA Contact: Haim <hateya>
Severity: high Docs Contact:
Priority: unspecified    
Version: 6.1CC: abaron, bazulay, danken, iheim, sgrinber, yeylon, ykaul
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: vdsm-4.9-66 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-12-06 07:09:49 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Bug Depends On: 664432    
Bug Blocks:    
Attachments:
Description Flags
vdsm log none

Description Moran Goldboim 2011-03-23 09:50:11 UTC
Created attachment 486991 [details]
vdsm log

Description of problem:
topology: 45 SDs (3 luns each) FC
connectStoragePool fails on HSM host:
Thread-159::ERROR::2011-03-23 11:35:38,313::sp::1059::Storage.StoragePool::(__rebuild) Unexpected error
Traceback (most recent call last):
  File "/usr/share/vdsm/storage/sp.py", line 1054, in __rebuild
    self.refreshDomain(d, msdUUID, linkName)
  File "/usr/share/vdsm/storage/sp.py", line 988, in refreshDomain
    d.refresh()
  File "/usr/share/vdsm/storage/blockSD.py", line 918, in refresh
    self.refreshDirTree()
  File "/usr/share/vdsm/storage/blockSD.py", line 902, in refreshDirTree
    if self.isISO():
  File "/usr/share/vdsm/storage/sd.py", line 563, in isISO
    return name2class(self.getMetaParam(CLASS)) == ISO_DOMAIN
  File "/usr/share/vdsm/storage/sd.py", line 499, in getMetaParam
    for i in self.Metadata(cache=cache):
  File "/usr/share/vdsm/storage/sd.py", line 482, in Metadata
    return self._metadata.metadata
  File "/usr/share/vdsm/storage/metadata.py", line 110, in __getmetadata
    md = self._get()
  File "/usr/share/vdsm/storage/blockSD.py", line 165, in _get
raise se.VolumeMetadataReadError("%s/%s: %s %s" % (self._vgName, self._lvName, msg, str(metadata)))


Version-Release number of selected component (if applicable):
vdsm-4.9-55.el6.x86_64

How reproducible:
always

Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 3 Dan Kenigsberg 2011-03-28 15:42:12 UTC
This flow will undergo major changes with bug 664432, please re-test when it's done.

Comment 6 Haim 2011-09-11 09:12:00 UTC
verified as a part of pool refactoring and performance improvement process.
tested several setups with different topologies such as:

- 1 pool with 1 SD with 60 LUNs via 2 paths. 
- 1 pool with 100 SD's 
- 1 pool with 4 SD's where each SD is combined from25 luns.

Comment 7 errata-xmlrpc 2011-12-06 07:09:49 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHEA-2011-1782.html