1090298 – Addition of new server after upgrade from 3.3 results in peer rejected

Bug 1090298 - Addition of new server after upgrade from 3.3 results in peer rejected

Summary: Addition of new server after upgrade from 3.3 results in peer rejected

Keywords:
Status:	CLOSED WONTFIX
Alias:	None
Product:	GlusterFS
Classification:	Community
Component:	core
Sub Component:
Version:	3.4.3
Hardware:	All
OS:	Linux
Priority:	unspecified
Severity:	high
Target Milestone:	---
Assignee:	Ravishankar N
QA Contact:
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1095324
TreeView+	depends on / blocked

Reported:	2014-04-23 05:56 UTC by Awktane
Modified:	2015-12-01 16:45 UTC (History)
CC List:	3 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2014-05-21 14:54:37 UTC
Regression:	---
Mount Type:	---
Documentation:	---
CRM:
Verified Versions:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Description Awktane 2014-04-23 05:56:52 UTC

Description of problem:
Upgrade from 3.3 to 3.4, and then adding a new node results in Peer Rejected (connected) because the info files do not match

How reproducible:
Appears to be every time

Steps to Reproduce:
1. Take a 3.3 or prior installation, upgrade it to 3.4
2. Setup brand new 3.4 server, peer probe it from trusted member
3. New server will show as Peer Rejected (connected)

Vol file on new server(s) contain two extra lines:
op-version=2
client-op-version=2

This causes a mismatch and therefore peer rejected as the old server's info file does not contain these lines. The peer is therefore rejected. Workaround is currently to add these lines to the old servers /var/lib/glusterd/vol/{mount}/info file (or perhaps delete them from the new?)

Comment 1 Anand Avati 2014-05-09 11:27:48 UTC

REVIEW: http://review.gluster.org/7729 (glusterd: update op-version info during upgrades.) posted (#1) for review on release-3.4 by Ravishankar N (ravishankar)

Comment 2 Ravishankar N 2014-05-16 04:26:14 UTC

The patch to fix this is being abandoned for reasons described in the review comments. Proposed solution (sic):

Once all the peers have been upgraded, the user must do a dummy volume set operation on all volumes. This ensures that the volume information and checksums are updated correctly. This will allow probing new peers without any problem. For eg:

# gluster volume set <name> brick-log-level INFO

(This won't have any affect on the operation volume as the default log-level is already INFO, but would update the volume info and checksums)"

Comment 3 Awktane 2014-05-16 09:05:43 UTC

Alright, for the existing folk like me who just added those two lines are there any ramifications? I think I did do a volume set to remove lookup-unhashed as it was causing a bunch of files/folders to error out. I assumed this was due to the re-balancing state.

Comment 4 Ravishankar N 2014-05-16 09:18:43 UTC

(In reply to Awktane from comment #3)
> Alright, for the existing folk like me who just added those two lines are
> there any ramifications? I think I did do a volume set to remove
> lookup-unhashed as it was causing a bunch of files/folders to error out. I
> assumed this was due to the re-balancing state.

I don't think it should matter. If the peer status shows all peers in connected state then we are good.

Note You need to log in before you can comment on or make changes to this bug.