Bug 1108018
Summary: | peers go into `Peer Rejected' state while doing rolling upgrade | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Sachidananda Urs <surs> |
Component: | core | Assignee: | Kaushal <kaushal> |
Status: | CLOSED ERRATA | QA Contact: | Sachidananda Urs <surs> |
Severity: | urgent | Docs Contact: | |
Priority: | high | ||
Version: | rhgs-3.0 | CC: | amukherj, kaushal, kparthas, lmohanty, nsathyan, psriniva, rcyriac, rhs-bugs, ssamanta, storage-qa-internal |
Target Milestone: | --- | ||
Target Release: | RHGS 3.0.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | glusterfs-3.6.0.19-1 | Doc Type: | Bug Fix |
Doc Text: |
Previously, the glusterFS management service was not backward compatible with the Red Hat Storage 2.1 version. As a result, the peers entered the peer reject state during the rolling upgrade from Red Hat Storage 2.1. With this fix, the glusterFS management service is made backward compatible and the peers no longer enter a 'peer reject' state.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2014-09-22 19:41:02 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1067342 |
Description
Sachidananda Urs
2014-06-11 10:07:48 UTC
Please find sosreports at: http://rhsqe-repo.lab.eng.blr.redhat.com/sosreports/1108018/ RHS-2.1 doesn't store a volume's calculated operating versions (volinfo.op-version and volinfo.client-op-version), where as RHS-3.0 does this. This can lead to peers entering the rejected state during rolling upgrades. I'll be sending a fix for RHS-3.0 for this issue right away. Please, mark this bug as a blocker. After discussing with Engineering Leads in RHS3.0 Status meeting providing the qa_ack. Sending an additional patch to with a small fix to the earlier patch. New patch is under review at https://code.engineering.redhat.com/gerrit/27112 Patch has been merged. It should be available in the next build. BVT tests related to peer probe failed on build glusterfs-server-3.6.0.18-1.el6rhs.x86_64. The peer probe on a new host stays in "Probe Sent to Peer (Connected)" and did not go to "Peer in Cluster (Connected)" [root@rhsauto056 ~]# gluster peer status Number of Peers: 2 Hostname: rhsauto057.lab.eng.blr.redhat.com Uuid: 168b3f1c-2720-4cce-a1cd-d929d32aa032 State: Peer in Cluster (Connected) Hostname: rhsauto022.lab.eng.blr.redhat.com Uuid: 68bf9e48-b239-45d0-ae8f-56cc9abb6c4a State: Probe Sent to Peer (Connected) The glusterd log in the PEER node reported error about "op-version"as copied below. After taking to Kaushal, we concluded that the above patch in #7 fixes the issue. [2014-06-17 20:55:13.311898] I [glusterd-handshake.c:1014:__glusterd_mgmt_hndsk_versions_ack] 0-management: using the op-version 30000 [2014-06-17 20:55:13.311979] E [store.c:432:gf_store_handle_retrieve] 0-: Path corresponding to /var/lib/glusterd/glusterd.info, returned error: (No such file or directory) [2014-06-17 20:55:13.312051] I [glusterd.c:176:glusterd_uuid_generate_save] 0-management: generated UUID: 68bf9e48-b239-45d0-ae8f-56cc9abb6c4a [2014-06-17 20:55:13.318232] I [glusterd-handler.c:2603:__glusterd_handle_probe_query] 0-glusterd: Received probe from uuid: dfa8e32d-e270-478e-bfa0-30de617a01db [2014-06-17 20:55:13.320686] I [glusterd-handler.c:2631:__glusterd_handle_probe_query] 0-glusterd: Unable to find peerinfo for host: 10.70.40.131 (24007) [2014-06-17 20:55:13.323241] I [rpc-clnt.c:969:rpc_clnt_connection_init] 0-management: setting frame-timeout to 600 [2014-06-17 20:55:13.326728] I [glusterd-handler.c:3172:glusterd_friend_add] 0-management: connect returned 0 [2014-06-17 20:55:13.326937] I [glusterd-handler.c:2655:__glusterd_handle_probe_query] 0-glusterd: Responded to 10.70.40.131, op_ret: 0, op_errno: 0, ret: 0 [2014-06-17 20:55:13.329205] I [glusterd-handler.c:2307:__glusterd_handle_incoming_friend_req] 0-glusterd: Received probe from uuid: dfa8e32d-e270-478e-bfa0-30de617a01db [2014-06-17 20:55:13.346039] E [glusterd-utils.c:4159:gd_import_volume_op_versions] 0-management: volume1.op-version missing in payload for hosdu [2014-06-17 20:55:13.346152] E [glusterd-utils.c:4449:glusterd_import_volinfo] 0-glusterd: Failed to import op-versions for volume hosdu [2014-06-17 20:55:13.346202] E [glusterd-sm.c:1084:glusterd_friend_sm] 0-glusterd: handler returned: -2 Verified on: glusterfs 3.6.0.22 Looks good. Hi Kaushal, Please review the edited doc text and sign off on the technical accuracy. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHEA-2014-1278.html |