Bug 1451280
Summary: | [Bitrot]: Brick process crash observed while trying to recover a bad file in disperse volume | |||
---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Sweta Anandpara <sanandpa> | |
Component: | bitrot | Assignee: | Kotresh HR <khiremat> | |
Status: | CLOSED ERRATA | QA Contact: | Sweta Anandpara <sanandpa> | |
Severity: | high | Docs Contact: | ||
Priority: | unspecified | |||
Version: | rhgs-3.3 | CC: | amukherj, rhinduja, rhs-bugs, storage-qa-internal | |
Target Milestone: | --- | |||
Target Release: | RHGS 3.3.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | glusterfs-3.8.4-27 | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1454317 (view as bug list) | Environment: | ||
Last Closed: | 2017-09-21 04:43:23 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | 1454317, 1456331 | |||
Bug Blocks: | 1417151 |
Description
Sweta Anandpara
2017-05-16 09:56:09 UTC
[qe@rhsqe-repo 1451280]$ [qe@rhsqe-repo 1451280]$ hostname rhsqe-repo.lab.eng.blr.redhat.com [qe@rhsqe-repo 1451280]$ [qe@rhsqe-repo 1451280]$ pwd /home/repo/sosreports/1451280 [qe@rhsqe-repo 1451280]$ [qe@rhsqe-repo 1451280]$ ll total 708976 -rwxr-xr-x. 1 qe qe 157433856 May 16 15:13 core.5950 -rwxr-xr-x. 1 qe qe 157433856 May 16 15:13 core.9730 -rwxr-xr-x. 1 qe qe 73012628 May 16 15:12 sosreport-sysreg-prod-20170516050748.tar.xz_dhcp47_121 -rwxr-xr-x. 1 qe qe 69134612 May 16 15:12 sosreport-sysreg-prod-20170516050917.tar.xz_dhcp47_113 -rwxr-xr-x. 1 qe qe 69795020 May 16 15:12 sosreport-sysreg-prod-20170516051025.tar.xz_dhcp47_114 -rwxr-xr-x. 1 qe qe 69256712 May 16 15:12 sosreport-sysreg-prod-20170516051259.tar.xz_dhcp47_115 -rwxr-xr-x. 1 qe qe 65140528 May 16 15:12 sosreport-sysreg-prod-20170516051545.tar.xz_dhcp47_116 -rwxr-xr-x. 1 qe qe 64772920 May 16 15:12 sosreport-sysreg-prod-20170516051639.tar.xz_dhcp47_117 [qe@rhsqe-repo 1451280]$ Following the bt: Program terminated with signal 11, Segmentation fault. #0 list_add_tail (head=0x7f0e28001908, new=0x18) at ../../../../../libglusterfs/src/list.h:40 40 new->next = head; Missing separate debuginfos, use: debuginfo-install glibc-2.17-157.el7_3.1.x86_64 keyutils-libs-1.5.8-3.el7.x86_64 krb5-libs-1.14.1-27.el7_3.x86_64 libacl-2.2.51-12.el7.x86_64 libaio-0.3.109-13.el7.x86_64 libattr-2.4.46-12.el7.x86_64 libcom_err-1.42.9-9.el7.x86_64 libgcc-4.8.5-11.el7.x86_64 libselinux-2.5-6.el7.x86_64 libuuid-2.23.2-33.el7_3.2.x86_64 openssl-libs-1.0.1e-60.el7_3.1.x86_64 pcre-8.32-15.el7_2.1.x86_64 sqlite-3.7.17-8.el7.x86_64 sssd-client-1.14.0-43.el7_3.14.x86_64 zlib-1.2.7-17.el7.x86_64 (gdb) bt #0 list_add_tail (head=0x7f0e28001908, new=0x18) at ../../../../../libglusterfs/src/list.h:40 #1 br_stub_add_fd_to_inode (this=this@entry=0x7f0e6c012440, fd=fd@entry=0x7f0e6c0a5050, ctx=ctx@entry=0x0) at bit-rot-stub.c:2398 #2 0x00007f0e7174fe56 in br_stub_open (frame=0x7f0e28000ca0, this=0x7f0e6c012440, loc=0x7f0e6c0ccf90, flags=2, fd=0x7f0e6c0a5050, xdata=0x0) at bit-rot-stub.c:2352 #3 0x00007f0e71535815 in posix_acl_open (frame=0x7f0e280014b0, this=0x7f0e6c013d70, loc=0x7f0e6c0ccf90, flags=2, fd=0x7f0e6c0a5050, xdata=0x0) at posix-acl.c:1129 #4 0x00007f0e71312dc8 in pl_open (frame=frame@entry=0x7f0e28000ac0, this=this@entry=0x7f0e6c015320, loc=loc@entry=0x7f0e6c0ccf90, flags=flags@entry=2, fd=fd@entry=0x7f0e6c0a5050, xdata=xdata@entry=0x0) at posix.c:1698 #5 0x00007f0e71106e59 in worm_open (frame=0x7f0e28000ac0, this=<optimized out>, loc=0x7f0e6c0ccf90, flags=2, fd=0x7f0e6c0a5050, xdata=0x0) at worm.c:43 #6 0x00007f0e70efb478 in ro_open (frame=0x7f0e28001740, this=0x7f0e6c018130, loc=0x7f0e6c0ccf90, flags=2, fd=0x7f0e6c0a5050, xdata=0x0) at read-only-common.c:341 #7 0x00007f0e70ce70b4 in leases_open (frame=0x7f0e28001b50, this=0x7f0e6c019880, loc=0x7f0e6c0ccf90, flags=2, fd=0x7f0e6c0a5050, xdata=0x0) at leases.c:75 #8 0x00007f0e70ad7143 in up_open (frame=0x7f0e28002250, this=0x7f0e6c01af20, loc=0x7f0e6c0ccf90, flags=2, fd=0x7f0e6c0a5050, xdata=0x0) at upcall.c:75 #9 0x00007f0e805b1269 in default_open_resume (frame=0x7f0e6c002020, this=0x7f0e6c01c690, loc=0x7f0e6c0ccf90, flags=2, fd=0x7f0e6c0a5050, xdata=0x0) at defaults.c:1726 #10 0x00007f0e80542b25 in call_resume (stub=0x7f0e6c0ccf40) at call-stub.c:2508 #11 0x00007f0e708c1957 in iot_worker (data=0x7f0e6c0550e0) at io-threads.c:220 #12 0x00007f0e7f37fdc5 in start_thread () from /lib64/libpthread.so.0 #13 0x00007f0e7ecc473d in clone () from /lib64/libc.so.6 Upstream Patch: https://review.gluster.org/17357 Upstream Patches: https://review.gluster.org/17357 (master) https://review.gluster.org/#/c/17406/ (release-3.11) Downstream Patches: https://code.engineering.redhat.com/gerrit/#/c/107534/ Tested and verified this on the build glusterfs-3.8.4-35. A round of testing has taken place on bitrot on the said build, and I have not seen this crash again anytime in my logs. Moving this to verified in 3.3.0. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2017:2774 |