Bug 1694010 - peer gets disconnected during a rolling upgrade.
Summary: peer gets disconnected during a rolling upgrade.
Keywords:
Status: CLOSED NOTABUG
Alias: None
Product: GlusterFS
Classification: Community
Component: glusterd
Version: 6
Hardware: Unspecified
OS: Unspecified
unspecified
low
Target Milestone: ---
Assignee: bugs@gluster.org
QA Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2019-03-29 09:42 UTC by hari gowtham
Modified: 2019-04-02 10:17 UTC (History)
5 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2019-04-01 13:39:22 UTC
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Embargoed:


Attachments (Terms of Use)

Description hari gowtham 2019-03-29 09:42:53 UTC
Description of problem:
When we do a rolling upgrade of the cluster from 3.12, 4.1 or 5.5 to 6, the upgraded node goes into disconnected state. 

Version-Release number of selected component (if applicable):
6.0

How reproducible:
100%

Steps to Reproduce:
1.create a replica 3 cluster
2.kill gluster process on one node
3.upgrade the node and start glusterd

Actual results:
the upgrade node goes into disconnected state

Expected results:
the peer shouldnt get disconnected.

Additional info:

Comment 1 hari gowtham 2019-03-29 09:45:05 UTC
To over come this issue the following steps were performed.

Upgrade all the node in the cluster one after other.
Once all the nodes are upgraded, kill glusterd process alone.
Let other process keep running.
Now do a "iptables -F"
And then restart the glusterd on all the nodes
Try gluster peer state after this to check if the nodes are connected.

Comment 2 Nithya Balachandran 2019-04-01 06:50:09 UTC
Please explain why this happens and how the workaround solves the issue.

Comment 3 Atin Mukherjee 2019-04-01 07:58:40 UTC
> When we do a rolling upgrade of the cluster from 3.12, 4.1 or 5.5 to 6, the upgraded node goes into disconnected state. 

Isn't this only seen from 3.12 to 6 upgrade?

Comment 4 hari gowtham 2019-04-01 08:01:25 UTC
Hi Nithya,

The RCA for this is yet to be done.
I didn't find anything fishy in the logs.
As I had to move forward with the testing, I tried the usually way of flushing the iptables 
to check if it fixes the disconnects and it yes, it did connect the peers back.
The reason why this is happening is yet to be discovered.

Regards,
Hari.

Comment 5 hari gowtham 2019-04-01 08:04:41 UTC
(In reply to Atin Mukherjee from comment #3)
> > When we do a rolling upgrade of the cluster from 3.12, 4.1 or 5.5 to 6, the upgraded node goes into disconnected state. 
> 
> Isn't this only seen from 3.12 to 6 upgrade?

No, Atin. I issue happened with all the versions.
It could as well be some network issue with the machines I tried it on.
Not sure of it.

The point to note here is: some times just a glusterd restart fixed and 
in some scenarios it needed a iptables flush followed with glusterd restart.

But I found that the iptables flush with glusterd restart fixed it in every scenario i tried.

I could find time to debug this further.

Comment 6 Atin Mukherjee 2019-04-01 08:22:32 UTC
FYI.. I tested the rolling upgrade from glusterfs 3.12 latest to glusterfs-6 with out any issues.

Can some one else please try as well?

Comment 7 Sanju 2019-04-01 13:39:22 UTC
I've tested rolling upgrade from 3.12 to 6, but haven't seen any issue. The cluster is in a healthy state and all peers are in connected state. Based on my experience and comment 6, I'm closing this as not a bug. Please, feel free to re-open the bug if you face it.

Thanks,
Sanju

Comment 8 Nithya Balachandran 2019-04-02 06:57:53 UTC
(In reply to Sanju from comment #7)
> I've tested rolling upgrade from 3.12 to 6, but haven't seen any issue. The
> cluster is in a healthy state and all peers are in connected state. Based on
> my experience and comment 6, I'm closing this as not a bug. Please, feel
> free to re-open the bug if you face it.
> 
> Thanks,
> Sanju

What about the upgrades from the other versions? This BZ refers to upgrades to release 6 from 3.12, 4 and 5.

Comment 9 Sanju 2019-04-02 10:17:39 UTC
(In reply to Nithya Balachandran from comment #8) 
> What about the upgrades from the other versions? This BZ refers to upgrades
> to release 6 from 3.12, 4 and 5.

I did test upgrade to release 6 from 4 and 5. Haven't seen any issue.

Thanks,
Sanju


Note You need to log in before you can comment on or make changes to this bug.