Bug 1125958
Summary: | DHT + NFS :- if Directory is created when sub-volume was down then unable to access Directory and its data when sub-volume is up again | ||||||
---|---|---|---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Rachana Patel <racpatel> | ||||
Component: | distribute | Assignee: | Susant Kumar Palai <spalai> | ||||
Status: | CLOSED ERRATA | QA Contact: | amainkar | ||||
Severity: | high | Docs Contact: | |||||
Priority: | unspecified | ||||||
Version: | rhgs-3.0 | CC: | nbalacha, nsathyan, spalai, srangana, ssamanta | ||||
Target Milestone: | --- | Keywords: | Regression | ||||
Target Release: | RHGS 3.0.0 | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | glusterfs-3.6.0.28-1 | Doc Type: | Bug Fix | ||||
Doc Text: |
Cause: The directory is missing on some bricks as it was created when some bricks were down. If a caller bypasses lookup and calls access due to saved/cached inode information (like NFS server does) then, dht_access fails the op in case ENOENT is found.
Fix: In case the directory is not found in one sub-volume, then fetch information from the next sub-volume.
|
Story Points: | --- | ||||
Clone Of: | Environment: | ||||||
Last Closed: | 2014-09-22 19:45:15 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | 1121099, 1125824, 1138393, 1139997, 1140338 | ||||||
Bug Blocks: | |||||||
Attachments: |
|
Description
Rachana Patel
2014-08-01 12:57:27 UTC
Created attachment 926543 [details] Brief test case Tested this using the attached test script post the fix presented here was applied to upstream code, http://review.gluster.org/#/c/8462/ The test case passed. A lot of the TC is commented out as the kill was not working properly, so did manual steps post the point where things were commented out. Susant, can we try this test case before and after the dht_access fix, so that we know we have fixed the regression? Shyam, Here is update on the patch. Tried without patch and here is the result: [root@vm50 mnt1]# kill -9 5881 [root@vm50 mnt1]# mkdir down [root@vm50 mnt1]# ls down [root@vm50 mnt1]# touch down/f{1..100} [root@vm50 mnt1]# gluster v start test1 force volume start: test1: success [root@vm50 mnt1]# cd down [root@vm50 down]# ls ls: cannot open directory .: No such file or directory [root@vm50 down]# ls ls: cannot open directory .: No such file or directory [root@vm50 down]# ls ls: cannot open directory .: No such file or directory And with the patch: [root@vm50 mnt1]# kill -9 10866 [root@vm50 mnt1]# mkdir down [root@vm50 mnt1]# touch down/f{1..100} [root@vm50 mnt1]# ls down [root@vm50 mnt1]# cd down/^C [root@vm50 mnt1]# gluster v start test1 force volume start: test1: success [root@vm50 mnt1]# cd down/ [root@vm50 down]# ls f1 f12 f16 f2 f23 f27 f30 f34 f38 f41 f45 f49 f52 f56 f6 f63 f67 f70 f74 f78 f81 f85 f89 f92 f96 f10 f13 f17 f20 f24 f28 f31 f35 f39 f42 f46 f5 f53 f57 f60 f64 f68 f71 f75 f79 f82 f86 f9 f93 f97 f100 f14 f18 f21 f25 f29 f32 f36 f4 f43 f47 f50 f54 f58 f61 f65 f69 f72 f76 f8 f83 f87 f90 f94 f98 f11 f15 f19 f22 f26 f3 f33 f37 f40 f44 f48 f51 f55 f59 f62 f66 f7 f73 f77 f80 f84 f88 f91 f95 f99 [root@vm50 down]# So everything looks good :) verified with 3.6.0.28-1.el6rhs.x86_64 , working as expected hence moving to verified Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHEA-2014-1278.html |