Bug 1110730
Summary: | brick process crashed when rebalance and rename was in progress | |||
---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Rachana Patel <racpatel> | |
Component: | distribute | Assignee: | Nithya Balachandran <nbalacha> | |
Status: | CLOSED ERRATA | QA Contact: | amainkar | |
Severity: | high | Docs Contact: | ||
Priority: | high | |||
Version: | rhgs-3.0 | CC: | achauras, amainkar, asrivast, nbalacha, nlevinki, nsathyan, rcyriac, sauchter, sdharane, shmohan, smohan, ssaha, ssamanta, vagarwal, vbellur | |
Target Milestone: | --- | Keywords: | ZStream | |
Target Release: | RHGS 3.0.4 | |||
Hardware: | x86_64 | |||
OS: | Linux | |||
Whiteboard: | ||||
Fixed In Version: | glusterfs-3.6.0.46-1 | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1113960 1165897 (view as bug list) | Environment: | ||
Last Closed: | 2015-03-26 06:34:21 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1113960, 1182947 |
Description
Rachana Patel
2014-06-18 10:49:24 UTC
able to reproduce even without remove-brick step. - create, start and mount volume, add bricks. Do I/O. start rebalance and while rename is in progress do renames. brick process was crashed (gdb) bt #0 0x000000376740c380 in pthread_spin_lock () from /lib64/libpthread.so.0 #1 0x00007f2669124af6 in dict_get () from /usr/lib64/libglusterfs.so.0 #2 0x00007f265a9eff1b in posix_lookup_xattr_fill () from /usr/lib64/glusterfs/3.6.0.19/xlator/storage/posix.so #3 0x00007f265a9dd21e in posix_entry_xattr_fill () from /usr/lib64/glusterfs/3.6.0.19/xlator/storage/posix.so #4 0x00007f265a9ec0bf in posix_readdirp_fill () from /usr/lib64/glusterfs/3.6.0.19/xlator/storage/posix.so #5 0x00007f265a9ec5b3 in posix_do_readdir () from /usr/lib64/glusterfs/3.6.0.19/xlator/storage/posix.so #6 0x00007f265a9ed32e in posix_readdirp () from /usr/lib64/glusterfs/3.6.0.19/xlator/storage/posix.so #7 0x00007f266912ff63 in default_readdirp () from /usr/lib64/libglusterfs.so.0 #8 0x00007f265a5b64bd in posix_acl_readdirp () from /usr/lib64/glusterfs/3.6.0.19/xlator/features/access-control.so #9 0x00007f265a39eef4 in pl_readdirp () from /usr/lib64/glusterfs/3.6.0.19/xlator/features/locks.so #10 0x00007f2669132832 in default_readdirp_resume () from /usr/lib64/libglusterfs.so.0 #11 0x00007f266914c631 in call_resume () from /usr/lib64/libglusterfs.so.0 #12 0x00007f265a193348 in iot_worker () from /usr/lib64/glusterfs/3.6.0.19/xlator/performance/io-threads.so #13 0x00000037674079d1 in start_thread () from /lib64/libpthread.so.0 #14 0x0000003766ce8b7d in clone () from /lib64/libc.so.6 I think we should consider this as a blocker for Denali if this is frequently reproducible or document that people do not execute a rename operation when rebalance is in progress and fix it in U1. We have not managed to reproduce the bug after it was reported. I'm not sure if documenting it will help as, IIUC, rebalance can take days to complete and placing restrictions on the operations allowed might not be possible for such a long time. Based on comment above, removing the blocker flag. To be targeted for u1 this issue is also reproducible on 3.6.0.41-1.el6rhs.x86_64 Tried on NFS as well as FUSE mount points while looking in progress. No crash seen while performing rename and even while rebalance. Marking it verified. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2015-0682.html |