Bug 1405004

Summary:	[Perf] : pcs cluster resources went into stopped state during Multithreaded perf tests on RHGS layered over RHEL 6
Product:	[Community] GlusterFS	Reporter:	Kaleb KEITHLEY <kkeithle>
Component:	common-ha	Assignee:	Kaleb KEITHLEY <kkeithle>
Status:	CLOSED CURRENTRELEASE	QA Contact:
Severity:	urgent	Docs Contact:
Priority:	unspecified
Version:	3.8	CC:	amukherj, asoman, bturner, bugs, dang, ffilz, jthottan, kkeithle, mbenjamin, rcyriac, rhinduja, rhs-bugs, skoduri, storage-qa-internal
Target Milestone:	---	Keywords:	Triaged
Target Release:	---
Hardware:	x86_64
OS:	Linux
Whiteboard:
Fixed In Version:	glusterfs-3.8.8	Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:	1405002	Environment:
Last Closed:	2017-01-16 12:27:19 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:
Bug Depends On:	1403587, 1404410, 1405002
Bug Blocks:

Comment 1 Worker Ant 2016-12-15 11:23:44 UTC

REVIEW: http://review.gluster.org/16140 (common-ha: explicitly set udpu transport for corosync) posted (#1) for review on release-3.8 by Kaleb KEITHLEY (kkeithle)

Comment 2 Worker Ant 2016-12-15 18:17:12 UTC

COMMIT: http://review.gluster.org/16140 committed in release-3.8 by Kaleb KEITHLEY (kkeithle) 
------
commit 846737955b7a42a79327f6c9076248eb1fd97b4d
Author: Kaleb S. KEITHLEY <kkeithle>
Date:   Thu Dec 15 06:22:02 2016 -0500

    common-ha: explicitly set udpu transport for corosync
    
    On RHEL7 corosync uses udpu (udp unicast) by default. On RHEL6 the
    default is (now) udp multi-cast. In network environments that don't
    support udp multi-cast this causes the ever growing lists of
    [TOTEM ] Retransmit errors.
    
    Always specifying --transport udpu is thus a no-op on RHEL7.
    
    Using the same transport on both RHEL6 and RHEL7 may (or may not
    give similar behavior and performance--it's hard to say.
    
    It remains a mystery why things have always worked on RHEL6 prior to
    now. Further investigation is required to uncover why this is the
    case.
    
    main http://review.gluster.org/16122
    main BZ 1404410
    release-3.9 http://review.gluster.org/16139/
    release-3.9 BZ 1405002
    
    Change-Id: I4d0de97fe4425c47f249beaaf51aeca3e91731fa
    BUG: 1405004
    Signed-off-by: Kaleb S. KEITHLEY <kkeithle>
    Reviewed-on: http://review.gluster.org/16140
    Reviewed-by: soumya k <skoduri>
    Smoke: Gluster Build System <jenkins.org>
    NetBSD-regression: NetBSD Build System <jenkins.org>
    CentOS-regression: Gluster Build System <jenkins.org>

Comment 3 Worker Ant 2016-12-21 23:22:14 UTC

REVIEW: http://review.gluster.org/16260 (common-ha: explicitly set udpu transport for corosync) posted (#1) for review on release-3.8-fb by Kevin Vigor (kvigor)

Comment 4 Niels de Vos 2017-01-16 12:27:19 UTC

This bug is getting closed because a release has been made available that should address the reported issue. In case the problem is still not fixed with glusterfs-3.8.8, please open a new bug report.

glusterfs-3.8.8 has been announced on the Gluster mailinglists [1], packages for several distributions should become available in the near future. Keep an eye on the Gluster Users mailinglist [2] and the update infrastructure for your distribution.

[1] https://lists.gluster.org/pipermail/announce/2017-January/000064.html
[2] https://www.gluster.org/pipermail/gluster-users/