1319592 – DHT-rebalance: rebalance status shows failed when replica pair bricks are brought down in distrep volume while re-name of files going on

Bug 1319592 - DHT-rebalance: rebalance status shows failed when replica pair bricks are brought down in distrep volume while re-name of files going on

Summary: DHT-rebalance: rebalance status shows failed when replica pair bricks are bro...

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Gluster Storage
Classification:	Red Hat Storage
Component:	distribute
Sub Component:
Version:	rhgs-3.1
Hardware:	x86_64
OS:	Linux
Priority:	high
Severity:	high
Target Milestone:	---
Target Release:	RHGS 3.1.3
Assignee:	Nithya Balachandran
QA Contact:	krishnaram Karthick
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1311817
TreeView+	depends on / blocked

Reported:	2016-03-21 07:21 UTC by Nithya Balachandran
Modified:	2016-06-23 05:04 UTC (History)
CC List:	17 users (show)
Fixed In Version:	glusterfs-3.7.9-1
Doc Type:	Bug Fix
Doc Text:
Clone Of:	1237059
Environment:
Last Closed:	2016-06-23 05:04:13 UTC
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2016:1240	0	normal	SHIPPED_LIVE	Red Hat Gluster Storage 3.1 Update 3	2016-06-23 08:51:28 UTC

Comment 7 krishnaram Karthick 2016-04-19 08:27:57 UTC

Verified the bug on build - glusterfs-server-3.7.9-1.el7rhgs.x86_64

steps followed to verify:

Test1:
1) created a 4 x 2 dis rep vol (say brick-1 till brick-8)
2) created a dir and under this directory created 10k files
3) Added 4 more bricks
4) Initiated rebalance process
5) killed brick 1

Rebalance process halted on replica pair of brick-1 and brick-2. Rebalance on other bricks went on to complete. There was no inconsistency with the rebalance status. This is expected behavior as rebalance of all files fail under a directory when readdirp fails. To validate this, performed test-2

Test2:

1) created a 4 x 2 dis rep vol (say brick-1 till brick-8)
2) created 100 dirs - dir-{1..100}
3) created 1k files under each directory of directory
4) Added 4 more bricks
5) Initiated rebalance process
6) killed brick 1

Rebalance process continued on all replica-pairs. When readdirp fails on one  directory, it continued on subsequent dirs. This is as expected and rebalance status was consistent across all the nodes. 

Hence, marking this bug as verified.

Comment 9 errata-xmlrpc 2016-06-23 05:04:13 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2016:1240

Note You need to log in before you can comment on or make changes to this bug.