Bug 1361518
Summary: | Files not able to heal after arbiter and data bricks were rebooted | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Pranith Kumar K <pkarampu> |
Component: | arbiter | Assignee: | Ravishankar N <ravishankar> |
Status: | CLOSED UPSTREAM | QA Contact: | Karan Sandha <ksandha> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | rhgs-3.1 | CC: | amukherj, atumball, bkunal, bmohanra, bugs, ksandha, nchilaka, ravishankar, rcyriac, rhinduja, rhs-bugs |
Target Milestone: | --- | Keywords: | Triaged, ZStream |
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Known Issue | |
Doc Text: |
If a file create is wound to all bricks, and it succeeds only on arbiter, the application will get a failure. But during self-heal, the file gets created on the data bricks with arbiter marked as source. Since data self-heal can never happen from arbiter, 'heal-info' will list the entries forever.
Workaround:
If 'gluster vol heal <volname> info` shows the pending heals for a file forever, then check if the issue is the same as mentioned above by:
i) checking that trusted.afr.volname-client* xattrs are zero on the data bricks
ii)checking that trusted.afr.volname-client* xattrs is non-zero on the arbiter brick *only* for the data part (first 4 bytes)
Example:
#getfattr -d -m . -e hex /bricks/arbiterbrick/file |grep trusted.afr.testvol*
getfattr: Removing leading '/' from absolute path names
trusted.afr.testvol-client-0=0x000000540000000000000000
trusted.afr.testvol-client-1=0x000000540000000000000000
If it is in this state, then delete the xattr:
#for i in $(getfattr -d -m . -e hex /bricks/arbiterbrick/file |grep trusted.afr.testvol*|cut -f1 -d'='); do setfattr -x $i file; done
|
Story Points: | --- |
Clone Of: | 1340032 | Environment: | |
Last Closed: | 2018-04-16 18:16:27 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1340032 | ||
Bug Blocks: | 1351530 |
Description
Pranith Kumar K
2016-07-29 09:21:00 UTC
http://review.gluster.org/14769 posted upstream for review. Increasing the priority of this bug as i am hitting this issue on pretty much every brick down scenarios. Bipin, Idea is not about we WONTFIX always, but that was done with looking the activity on the bugzilla, and we had not picked the particular bug for previous 2 releases. We will keep it as an open bug in Upstream and fix it, and will get it to downstream when we get it in releases as backports. |