Bug 1694948

Summary:	The OCS upgrade playbook does not enable server.tcp-user-timeout on all volumes
Product:	[Red Hat Storage] Red Hat Gluster Storage	Reporter:	Ashmitha Ambastha <asambast>
Component:	cns-ansible	Assignee:	Jose A. Rivera <jarrpa>
Status:	CLOSED WONTFIX	QA Contact:	Prasanth <pprakash>
Severity:	high	Docs Contact:
Priority:	unspecified
Version:	ocs-3.11	CC:	akhakhar, hchiramm, jarrpa, jmulligan, knarra, kramdoss, madam, pprakash, rhs-bugs, rtalur, sankarshan, sarumuga
Target Milestone:	---	Keywords:	ZStream
Target Release:	---
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2019-04-18 12:28:02 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:
Bug Depends On:
Bug Blocks:	1703695

Description Ashmitha Ambastha 2019-04-02 06:55:30 UTC

Description of problem:
The OCS upgrade playbook misses enabling server.tcp-user-timeout on all volumes. The ansible playbook should be enabling the server.tcp-user-timeout as it is an important part of OCS upgrades. 

The "server.tcp-user-timeout" option specifies the maximum amount of the time (in seconds) the transmitted data from the application can remain unacknowledged from the brick.
It is used to detect force disconnections and dead connections (if a node dies unexpectedly, a firewall is activated, etc.,) early and make it possible for applications to reduce the overall failover time.

OCS upgrade playbook at the path : 

../openshift-ansible/playbooks/openshift-glusterfs/upgrade.yml

Version-Release number of selected component (if applicable): All OCS versions

How reproducible: always. 

Actual results: OCS upgrade playbook does not enable server.tcp-user-timeout to 42 for all the volumes. 

Expected results: OCS upgrade playbook should enable server.tcp-user-timeout to 42 for all the volumes.

Comment 2 Jose A. Rivera 2019-04-04 20:11:43 UTC

Seems like a legit bug.

Comment 3 Raghavendra Talur 2019-04-18 12:28:02 UTC

The requirement to set the server.tcp-user-timeout came because RHGS 3.4.0 was missing it in the code defaults.

With https://bugzilla.redhat.com/show_bug.cgi?id=1623874 and https://code.engineering.redhat.com/gerrit/#/c/150699/, it is part of the RHGS defaults.

If we ask all users to upgrade to v3.11.0 or onwards then we don't have to make any changes in the openshift-ansible.

Closing the bug for the same reason.