Bug 1347257
Summary: | spurious heal info as pending heal entries never end on an EC volume while IOs are going on | |||
---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Nag Pavan Chilakam <nchilaka> | |
Component: | disperse | Assignee: | Ashish Pandey <aspandey> | |
Status: | CLOSED ERRATA | QA Contact: | Nag Pavan Chilakam <nchilaka> | |
Severity: | urgent | Docs Contact: | ||
Priority: | unspecified | |||
Version: | rhgs-3.1 | CC: | amukherj, aspandey, nchilaka, pkarampu, ravishankar, rcyriac, rhinduja, rhs-bugs | |
Target Milestone: | --- | |||
Target Release: | RHGS 3.2.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | glusterfs-3.8.4-3 | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1366815 (view as bug list) | Environment: | ||
Last Closed: | 2017-03-23 05:36:57 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1351522, 1366815, 1383913 |
Description
Nag Pavan Chilakam
2016-06-16 11:57:09 UTC
I was not able to hit the issue when I did a rolling upgrade of a 2 node 4+2 disperse volume from rhgs-3.1.2 to 3.1.3 with IO (dd'ing to a file) happening from a 3.1.2 client. The heal info was showing entries as long as IO was happening (shd was waiting for locks). Once the IO stopped, healing resumed and came to zero entries. Nag, do you have a more consistent reproducer for the issue? Also, please provide the getfattr outputs of the files from all bricks and the logs. If the heal-info entries are spurious, then the trusted.ec* attributes of the file must be same on all bricks (indicating no heal is pending). patch has been posted and merged on upstream - http://review.gluster.org/#/c/15543/ 3.9 upstream patch : http://review.gluster.org/15627 QA verification: I am not seeing this issue of spurious entries anymore on 3.8.4-13 Note, the file that is being written currently can be seen in the heal info due to timing issue, which is acceptable Hence moving to verified (In reply to nchilaka from comment #9) > QA verification: > I am not seeing this issue of spurious entries anymore on 3.8.4-13 > Note, the file that is being written currently can be seen in the heal info > due to timing issue, which is acceptable Note, I am not seeing this issue of file being written showing up in heal pending (the whole purpose of the bz), however we can see if there is a network partition case, which is expected > Hence moving to verified So fix is working Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHSA-2017-0486.html |