Bug 1318093

Summary: [GSS] Client's App is having issues retrieving files from share 1002976973
Product: [Red Hat Storage] Red Hat Gluster Storage Reporter: Oonkwee Lim <olim>
Component: quotaAssignee: Raghavendra G <rgowdapp>
Status: CLOSED INSUFFICIENT_DATA QA Contact: storage-qa-internal <storage-qa-internal>
Severity: urgent Docs Contact:
Priority: urgent    
Version: rhgs-3.1CC: amukherj, bkunal, hchen, olim, rgowdapp, rhs-bugs, rnalakka, skoduri, smohan, vbellur
Target Milestone: ---Keywords: ZStream
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
: 1318158 (view as bug list) Environment:
Last Closed: 2017-11-06 22:34:26 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1318170, 1320925, 1320926, 1324040    
Bug Blocks: 1318158, 1320024    

Comment 3 Vijaikumar Mallikarjuna 2016-03-16 07:56:51 UTC
Hi Oonkwee,

The error "ctx for the node ... is NULL" can be ignored.

Regarding Quota list showing wrong usage, was any rename operations performed?

There was an issue related to rename operation, where quota usage would go wrong and this is been fixed in 3.1.2.

Here is the upstream patch: http://review.gluster.org/#/c/11578/


Thanks,
Vijay

Comment 4 Vijaikumar Mallikarjuna 2016-03-16 08:32:39 UTC
For quota error "ctx for the node ... is NULL", we have submitted a patch upstream: http://review.gluster.org/#/c/13748/2

Comment 5 Vijaikumar Mallikarjuna 2016-03-16 08:40:35 UTC
Bug# 1318170 filed to track quota error "ctx for the node ... is NULL"

Comment 11 Vijaikumar Mallikarjuna 2016-03-18 07:15:04 UTC
Hi Oonkwee,

There are 3 different issues mentioned in bug:

Issue-1)
Error in the brick logs "ctx for the node ... is NULL".
This is actually not an error and has no impact with quota accounting. Can be ignored.
Patch "http://review.gluster.org/#/c/13748/" will fix this error message

Issue-2)
Quota list show wrong usage:
Some how quota accounting is miscalculated, I suspect this be same issue as:
bug# 1240991


Issue-3)
You have mentioned in comment# 6
>> The client suspects there is some data corruption, hence, their application is 
>> facing issues while retrieving files.

Quota only set/gets metadata (xattrs), could ypu please provide more information what kind of data corruption is seen?

I suspect, because the usage is miscalculated to "16384.0PB", quota-enforcer may not be allowing the writes to happen.


Thanks,
Vijay

Comment 13 Vijaikumar Mallikarjuna 2016-03-22 07:01:12 UTC
Hi Oonkwee,

Regarding quota usage showing, we have couple of issues related to quota miscalculation fixed in 3.1.2. I will find the list of patches submitted related to this and update the bug with the patches.

Please let us know if you need a workaround to correct the quota usage?


Thanks,
Vijay

Comment 14 Vijaikumar Mallikarjuna 2016-03-22 09:28:17 UTC
Hi Oonkwee,


We have couple of fixes related to quota accounting:

Fixed in 3.1.1
    http://review.gluster.org/#/c/11863/
    http://review.gluster.org/#/c/11995/
    http://review.gluster.org/#/c/12032/
    http://review.gluster.org/#/c/11403/

Fixed in 3.1.2
    http://review.gluster.org/#/c/11578/


Thanks,
Vijay

Comment 17 Oonkwee Lim 2016-03-24 15:39:20 UTC
Hi Vijay,

They are open to a workaround to the quota issue if you can get one.

Comment 19 Vijaikumar Mallikarjuna 2016-03-28 04:51:56 UTC
Hi Oonkwee Lim,

What is the gluster version the customer is running. If they are running 3.1.0,
then there are multiple fixes went to 3.1.1 on quota size miscalculation.
Is there any plan for the customer on upgrading Gluster? If yes, then it is good to apply workaround after the upgrade.


Here is the workaround to correct quota size:
 
1) Find a directory whose size is calculated incorrectly
   In the description there is once directory which is showing wrong incorrect 
value:

Path                   Hard-limit Soft-limit   Used  Available  Soft-limit exceeded? Hard-limit exceeded? 
/1002976973               3.0TB       80%   16384.0PB   3.0TB              No                   No


2) Execute below command on all the nodes and for all the bricks
   #find /brickpath/1002976973/ -type d | xargs /usr/bin/setfattr -n trusted.glusterfs.quota.dirty -v 0x3100


3) Mount a volume with fuse and no-readdirp options
   #mount -t glusterfs -o use-readdirp=no localhost:/volname /mnt


4) send lookup on all sub-dirs of dir 1234
   #find /mnt/1002976973 -type d -exec stat {} \;


5) After completion of lookup, verify that quota usage is showing correct values
   #gluster volume quota volname list /1002976973


Thanks,
Vijay

Comment 21 Vijaikumar Mallikarjuna 2016-03-28 06:44:27 UTC
Hi Bipin,

Workaround provided is for correcting Quota size.
Regarding data corruption, we are working on the RCA.

Thanks,
Vijay