Bug 1582526 - [Tracker-RHGS-BZ#1519105]On inducing node remove and gluster pod scale down to 2 in parallel, glusterd fails to start on 1 pod upon scaling back to 4 pods
Summary: [Tracker-RHGS-BZ#1519105]On inducing node remove and gluster pod scale down t...
Keywords:
Status: CLOSED DUPLICATE of bug 1598340
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: heketi
Version: cns-3.10
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: ---
Assignee: Raghavendra Talur
QA Contact: Neha Berry
URL:
Whiteboard:
Depends On: 1519105 1593865 1631664
Blocks: OCS-3.11.1-devel-triage-done
TreeView+ depends on / blocked
 
Reported: 2018-05-25 13:38 UTC by Neha Berry
Modified: 2019-03-27 11:51 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2019-03-27 11:51:59 UTC
Embargoed:


Attachments (Terms of Use)

Comment 13 Raghavendra Talur 2018-07-11 12:52:07 UTC
There is a bug in glusterd where a replace-brick operation returns 1 but updates the list of bricks on the glusterd side. This is what has happened here. I am not able to find the bugzilla entry.

From the cmd_history.log
cmd_history (1).log:34:[2018-05-25 11:10:17.850306]  : volume replace-brick nb_glusterfs_mongodb-5_22a87523-5ffb-11e8-9dde-005056a5aac9 10.70.46.75:/var/lib/heketi/mounts/vg_4011ccd7b65237d7ebd2dbba56a8886b/brick_221779a6b7afce4311fba85b61443c07/brick 10.70.47.89:/var/lib/heketi/mounts/vg_7581c8554e4ea2c452eb90a2c3e01390/brick_ac3ed65c973eef921b00e5f71ab874e9/brick commit force : FAILED : Commit failed on dhcp46-175.lab.eng.blr.redhat.com. Please check log file for details.

From gluster vol info
Volume Name: nb_glusterfs_mongodb-5_22a87523-5ffb-11e8-9dde-005056a5aac9
Type: Replicate
Volume ID: 5cad9830-3892-4859-9257-47b262b372cd
Status: Started
Snapshot Count: 0
Number of Bricks: 1 x 3 = 3
Transport-type: tcp
Bricks:
Brick1: 10.70.46.1:/var/lib/heketi/mounts/vg_8f5c1d1963326127326f9b33a732691a/brick_a3dc1a0846779ac709a347a2428ba8d6/brick
Brick2: 10.70.46.175:/var/lib/heketi/mounts/vg_e2dc26ec7ea1a6eca27eb7a719d5b92e/brick_03b3fdbe13bd1c43b4b85dc0f11fa2ab/brick
Brick3: 10.70.47.89:/var/lib/heketi/mounts/vg_7581c8554e4ea2c452eb90a2c3e01390/brick_ac3ed65c973eef921b00e5f71ab874e9/brick
Options Reconfigured:
nfs.disable: on
transport.address-family: inet
cluster.brick-multiplex: on


Putting Needinfo on Atin for the same.

The fix for it has to come in glusterd and is an intensive change. I propose to move this out of CNS 3.10.

Comment 21 Raghavendra Talur 2019-03-27 11:51:59 UTC

*** This bug has been marked as a duplicate of bug 1598340 ***


Note You need to log in before you can comment on or make changes to this bug.