Bug 1183988
Summary: | DHT:Quota:- brick process crashed after deleting .glusterfs from backend | |||
---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | shylesh <shmohan> | |
Component: | quota | Assignee: | Vijaikumar Mallikarjuna <vmallika> | |
Status: | CLOSED ERRATA | QA Contact: | Anil Shah <ashah> | |
Severity: | medium | Docs Contact: | ||
Priority: | high | |||
Version: | 2.1 | CC: | achauras, annair, asrivast, nlevinki, nsathyan, rcyriac, rhs-bugs, smohan, storage-qa-internal, vagarwal, vbellur | |
Target Milestone: | --- | |||
Target Release: | RHGS 3.1.0 | |||
Hardware: | x86_64 | |||
OS: | Linux | |||
Whiteboard: | ||||
Fixed In Version: | glusterfs-3.7.1-3 | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1203629 (view as bug list) | Environment: | ||
Last Closed: | 2015-07-29 04:37:56 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | 1203629 | |||
Bug Blocks: | 1202842, 1217419 |
Description
shylesh
2015-01-20 11:27:06 UTC
A little indicates that the issue is only with a folder: ls -ltrh /rhs/brick1/gv0/data1/shd/gluster/test/ The getfattr of the folder shows that there is no gfid for the folder. [root@dht-rhs-19 ~]# getfattr -d -m . -e hex /rhs/brick1/gv0/data1/shd/gluster/test/ getfattr: Removing leading '/' from absolute path names # file: rhs/brick1/gv0/data1/shd/gluster/test/ trusted.glusterfs.dht=0x0000000100000000000000007ffffffe trusted.glusterfs.quota.d11e1ed1-88d4-4cf2-9d10-8c77a63cb206.contri=0x00000000000f9e00 trusted.glusterfs.quota.dirty=0x3100 trusted.glusterfs.quota.size=0x00000000000f9e00 As part of one my testcase, I deleted the this gfid using setfattr. Ideally, this xattr should have been healed through lookup. But, instead of healing, it dumps core. Patch submitted upstream: http://review.gluster.com/#/c/3842/ Below upstream patch fixes the issue: http://review.gluster.org/#/c/9941/ Delete the .glusterfs folder from backend. Didn't not see any core. However the brick process was kill. This is expected behaviour as discussed with developer. Earlier brick process was getting killed unexpectedly with core file generated. Now its fails gracefully. bricks logs ======================================================== [2015-07-07 06:19:32.297714] I [MSGID: 115029] [server-handshake.c:610:server_setvolume] 0-vol0-server: accepted client from darkknight-3012-2015/07/07-06:19:31:178889-vol0-client-0-0-0 (version: 3.7.1) [2015-07-07 06:20:27.232896] W [MSGID: 113075] [posix-helpers.c:1676:posix_fs_health_check] 0-vol0-posix: open() on /rhs/brick1/b001/.glusterfs/health_check returned [No such file or directory] [2015-07-07 06:20:27.232989] W [MSGID: 113075] [posix-helpers.c:1741:posix_health_check_thread_proc] 0-vol0-posix: health_check on /rhs/brick1/b001 returned [No such file or directory] [2015-07-07 06:20:27.233018] M [MSGID: 113075] [posix-helpers.c:1762:posix_health_check_thread_proc] 0-vol0-posix: health-check failed, going down [2015-07-07 06:20:57.234507] M [MSGID: 113075] [posix-helpers.c:1768:posix_health_check_thread_proc] 0-vol0-posix: still alive! -> SIGTERM Bug verified on build glusterfs-3.7.1-7.el6rhs.x86_64 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHSA-2015-1495.html |