Bug 1694948

Summary: The OCS upgrade playbook does not enable server.tcp-user-timeout on all volumes
Product: [Red Hat Storage] Red Hat Gluster Storage Reporter: Ashmitha Ambastha <asambast>
Component: cns-ansibleAssignee: Jose A. Rivera <jarrpa>
Status: CLOSED WONTFIX QA Contact: Prasanth <pprakash>
Severity: high Docs Contact:
Priority: unspecified    
Version: ocs-3.11CC: akhakhar, hchiramm, jarrpa, jmulligan, knarra, kramdoss, madam, pprakash, rhs-bugs, rtalur, sankarshan, sarumuga
Target Milestone: ---Keywords: ZStream
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-04-18 12:28:02 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1703695    

Description Ashmitha Ambastha 2019-04-02 06:55:30 UTC
Description of problem:
The OCS upgrade playbook misses enabling server.tcp-user-timeout on all volumes. The ansible playbook should be enabling the server.tcp-user-timeout as it is an important part of OCS upgrades. 

The "server.tcp-user-timeout" option specifies the maximum amount of the time (in seconds) the transmitted data from the application can remain unacknowledged from the brick.
It is used to detect force disconnections and dead connections (if a node dies unexpectedly, a firewall is activated, etc.,) early and make it possible for applications to reduce the overall failover time.

OCS upgrade playbook at the path : 

../openshift-ansible/playbooks/openshift-glusterfs/upgrade.yml

Version-Release number of selected component (if applicable): All OCS versions

How reproducible: always. 

Actual results: OCS upgrade playbook does not enable server.tcp-user-timeout to 42 for all the volumes. 

Expected results: OCS upgrade playbook should enable server.tcp-user-timeout to 42 for all the volumes.

Comment 2 Jose A. Rivera 2019-04-04 20:11:43 UTC
Seems like a legit bug.

Comment 3 Raghavendra Talur 2019-04-18 12:28:02 UTC
The requirement to set the server.tcp-user-timeout came because RHGS 3.4.0 was missing it in the code defaults.

With https://bugzilla.redhat.com/show_bug.cgi?id=1623874 and https://code.engineering.redhat.com/gerrit/#/c/150699/, it is part of the RHGS defaults.

If we ask all users to upgrade to v3.11.0 or onwards then we don't have to make any changes in the openshift-ansible.

Closing the bug for the same reason.