Bug 763998 (GLUSTER-2266)

Summary: stale file handle errors after volume expansion and half way through rebalance
Product: [Community] GlusterFS Reporter: Harshavardhana <fharshav>
Component: distributeAssignee: Amar Tumballi <amarts>
Status: CLOSED WORKSFORME QA Contact:
Severity: low Docs Contact:
Priority: low    
Version: 3.1.1CC: cww, gluster-bugs, renee, vijay, vraman
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: ---
Regression: --- Mount Type: ---
Documentation: DNR CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Harshavardhana 2010-12-31 22:40:51 UTC
Reproducibility :- Easy

A replicated volume of 2 nodes and 2 backends with replica count 2. Is expanded to distributed replicated volume of 3 x 2. 

After the expansion the VM's were running during the expansion and there was seamless reflection of additional space dynamically without a remount. 

Started the rebalance and after like 30 files transferred stopped in bet'n to see if the files are reflected properly.  

Started seeing stale file handle errors for certain directories which were still being accessed by the VM's. After a remount the issue goes away. 

Why would a user see this error? does rebalance change the gfids for files? Does directory layout change not properly reflected on to clients?

Comment 1 Harshavardhana 2011-01-10 20:14:30 UTC
(In reply to comment #0)
> Reproducibility :- Easy
> 
> A replicated volume of 2 nodes and 2 backends with replica count 2. Is expanded
> to distributed replicated volume of 3 x 2. 
> 
> After the expansion the VM's were running during the expansion and there was
> seamless reflection of additional space dynamically without a remount. 

Will this be fixed in 3.1.2 release? This is tracked for a customer issue.

Comment 2 Amar Tumballi 2011-02-25 05:03:20 UTC
(In reply to comment #0)
> Reproducibility :- Easy
> 

Reproducibility was not at all easy for us. Instead, we couldn't even reproduce it (with latest git (say from last 7 days)). Will be closing it with 'works for me'. Please reopen if it exist in codebase after two pending rebalance patches get in.