Bug 1511903
Summary: | 3.8 -> 3.12 rolling upgrade fails | ||
---|---|---|---|
Product: | [Community] GlusterFS | Reporter: | Denis Chaplygin <dchaplyg> |
Component: | glusterd | Assignee: | bugs <bugs> |
Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | |
Severity: | high | Docs Contact: | |
Priority: | unspecified | ||
Version: | 3.12 | CC: | amukherj, bugs, dchaplyg, ederevea |
Target Milestone: | --- | Keywords: | Triaged |
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2017-11-14 06:03:06 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Denis Chaplygin
2017-11-10 11:59:54 UTC
After upgrading have you ensured that you have bumped up the op-version? If no, please do the same and then restart glusterd service of all the nodes to see if they can get accepted in the cluster. If the above is ensured and still the issue persists, can you please share the following file from all the nodes? cat /var/lib/glusterd/vols/remote/info No, i didn't as didn't finished upgrade of my cluster. All nodes was still 3.8, while just a single node became 3.12 Please provide the following: Output of cat /var/lib/glusterd/vols/remote/info from 172.19.11.7 & the node where the peer got rejected i.e. the new node from where you have attached the log. Sorry, I can't reproduce this on a clean environment and the one where we found this bug was already rebuilt. Basically the steps I did were: 1) set up a HyperConverged oVirt 4.1 environment with 3 NGN hosts, create some VMs and let them run for a few weeks 2) upgrade one host to 4.2 beta Not sure how we can proceed here without a stable reproducer, but just to clarify - what needs to mismatch to get the "Version of Cksums data differ"? What file is checksummed and how is this checksum computed? Here's the code that seems to be responsible: [1] https://github.com/gluster/glusterfs/blob/master/xlators/mgmt/glusterd/src/glusterd-utils.c#L3386 |