Bug 1351323

Summary: Improve MMR replication convergence
Product: Red Hat Enterprise Linux 7 Reporter: Marcel Kolaja <mkolaja>
Component: 389-ds-baseAssignee: Noriko Hosoi <nhosoi>
Status: CLOSED ERRATA QA Contact: Viktor Ashirov <vashirov>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 7.3CC: arubin, ekeck, mkolaja, mreynolds, nhosoi, nkinder, pbokoc, rmeggins, salmy, snagar
Target Milestone: rcKeywords: ZStream
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: 389-ds-base-1.3.4.0-33.el7_2 Doc Type: Enhancement
Doc Text:
In a multi-master replication environment where multiple masters receive updates at the same time, it was previously possible for a single master to obtain exclusive access to a replica and hold it for a very long time due to problems such as a slow network connection. During this time, other masters were blocked from accessing the same replica, which considerably slowed down the replication process. This update adds a new configuration attribute, "nsds5ReplicaReleaseTimeout", which can be used to specify a timeout in seconds. After the specified timeout period passes, the master releases the replica, allowing other masters to access it and send their updates.
Story Points: ---
Clone Of: 1349571 Environment:
Last Closed: 2016-08-02 18:37:42 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1349571    
Bug Blocks:    

Description Marcel Kolaja 2016-06-29 17:56:44 UTC
This bug has been copied from bug #1349571 and has been proposed
to be backported to 7.2 z-stream (EUS).

Comment 5 Viktor Ashirov 2016-07-25 06:42:29 UTC
Build tested:
389-ds-base-1.3.4.0-33.el7_2

Reliability15 test was executed with replicas configured to use nsds5ReplicaReleaseTimeout: 60
Results: http://storm.idmqe.lab.eng.bos.redhat.com/qa/archive/ds/rhel72/reliab15/run4_1.3.4.0-33-convergence-fix/out/repl-monitor-output/

No issues found during the test. Convergence was achieved much sooner. Time lag was < 10 mins, compared to upto 5 hours with the default settings.

Marking as verified.

Comment 7 errata-xmlrpc 2016-08-02 18:37:42 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHBA-2016-1544.html