Bug 1360331
Summary: | default timeout of 5min not honored for analyzing split-brain files post setfattr replica.split-brain-heal-finalize | |||
---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Nag Pavan Chilakam <nchilaka> | |
Component: | replicate | Assignee: | Karthik U S <ksubrahm> | |
Status: | CLOSED ERRATA | QA Contact: | Vijay Avuthu <vavuthu> | |
Severity: | medium | Docs Contact: | ||
Priority: | high | |||
Version: | rhgs-3.1 | CC: | amukherj, ravishankar, rhinduja, rhs-bugs, sheggodu | |
Target Milestone: | --- | Keywords: | ZStream | |
Target Release: | RHGS 3.4.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | glusterfs-3.12.2-2 | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1503519 (view as bug list) | Environment: | ||
Last Closed: | 2018-09-04 06:29:40 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | 1503519 | |||
Bug Blocks: | 1503134 |
Description
Nag Pavan Chilakam
2016-07-26 12:35:05 UTC
FYI, If i clear the client cache using "free && sync && echo 3 > /proc/sys/vm/drop_caches && free" then the 5min is honoured. similar behavior even if we change the timeout So, now the question is , shouldn't the cache invalidation kick in post timeout instead of user needing to clear cache? Hi Karthik, could you take a look at this bug? Check if the problem is with AFR timer expiry logic (unlikely) or due to caching in perf xlators or in the fuse kernel. Sure Ravi. Will check. Upstream patch: https://review.gluster.org/18546 Update:
========
Build Used : glusterfs-3.12.2-7.el7rhgs.x86_64
1) create data-split brain files
2) set the replica.split-brain-choice for accessing the file for 5 min ( default time ) from mount point
3) validate whether file is accessed ONLY for 5 min. After 5th min, it should throw I/O error
# date;setfattr -n replica.split-brain-choice -v "12-client-1" file_2
Thu Apr 19 02:36:39 EDT 2018
#
# date;cat file_2
Thu Apr 19 02:41:39 EDT 2018
Initial contnet
Appending contnet while b0 is down
# date;cat file_2
Thu Apr 19 02:41:40 EDT 2018
cat: file_2: Input/output error
#
> Also Tried accessing file in a loop continuously and after 5 min its throwing I/O error as expected.
Changing status to Verified.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2018:2607 |