Bug 1273728
Summary: | Crash while bringing down the bricks and self heal | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Bhaskarakiran <byarlaga> |
Component: | tier | Assignee: | Joseph Elwin Fernandes <josferna> |
Status: | CLOSED ERRATA | QA Contact: | Neha <nerawat> |
Severity: | urgent | Docs Contact: | |
Priority: | urgent | ||
Version: | rhgs-3.1 | CC: | asrivast, dlambrig, mzywusko, nchilaka, rhs-bugs, sankarshan, sashinde, storage-qa-internal |
Target Milestone: | --- | Keywords: | ZStream |
Target Release: | RHGS 3.1.2 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | glusterfs-3.7.5-7 | Doc Type: | Bug Fix |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2016-03-01 05:43:55 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1260783, 1260923 |
Description
Bhaskarakiran
2015-10-21 06:41:34 UTC
1) Tested the following but couldnt reproduce this. a) Created a volume with 1000 files in it already b) Attached a hot tier and created another 1000 files. [root@fedora1 test]# gluster vol info Volume Name: test Type: Tier Volume ID: bb7a3b77-063d-4334-9e60-862ce4f90bd0 Status: Started Number of Bricks: 10 Transport-type: tcp Hot Tier : Hot Tier Type : Distributed-Replicate Number of Bricks: 2 x 2 = 4 Brick1: fedora1:/home/ssd/small_brick3/s3 Brick2: fedora1:/home/ssd/small_brick2/s2 Brick3: fedora1:/home/ssd/small_brick1/s1 Brick4: fedora1:/home/ssd/small_brick0/s0 Cold Tier: Cold Tier Type : Disperse Number of Bricks: 1 x (4 + 2) = 6 Brick5: fedora1:/home/disk/d1 Brick6: fedora1:/home/disk/d2 Brick7: fedora1:/home/disk/d3 Brick8: fedora1:/home/disk/d4 Brick9: fedora1:/home/disk/d5 Brick10: fedora1:/home/disk/d6 Options Reconfigured: diagnostics.brick-log-level: TRACE cluster.self-heal-daemon: enable cluster.disperse-self-heal-daemon: enable cluster.tier-mode: test features.record-counters: on features.ctr-enabled: on performance.readdir-ahead: on [root@fedora1 test]# c) during promotion and demotion stopped and restarted EC bricks. Didnt find any crash. 2) The code path where this crash was seen previously has completely changed in this patch https://code.engineering.redhat.com/gerrit/#/c/61006/ Similar kind of crashes where seen previously in https://bugzilla.redhat.com/show_bug.cgi?id=1258144 https://bugzilla.redhat.com/show_bug.cgi?id=1273347 And the above fix is supposed to fix these crashes. Changing the status to ON_QA. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2016-0193.html |