Bug 900481 (JBPAPP6-1418)

Summary:	CLONE - Shutdown in REPL SYNC results in TimeoutException: Unable to acquire lock after
Product:	[JBoss] JBoss Enterprise Application Platform 6	Reporter:	Radoslav Husar <rhusar>
Component:	Clustering	Assignee:	Paul Ferraro <paul.ferraro>
Status:	CLOSED CURRENTRELEASE	QA Contact:	Jitka Kozana <jkudrnac>
Severity:	high	Docs Contact:	Russell Dickenson <rdickens>
Priority:	high
Version:	6.0.0	CC:	atangrin, jkudrnac, lthon, misty, paul.ferraro, rhusar, sgilda, simone.gotti
Target Milestone:	---
Target Release:	EAP 6.1.0
Hardware:	Unspecified
OS:	Unspecified
URL:	http://jira.jboss.org/jira/browse/JBPAPP6-1418
Whiteboard:
Fixed In Version:		Doc Type:	Bug Fix
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2013-08-27 10:44:56 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:
Bug Depends On:
Bug Blocks:	900611

Description Radoslav Husar 2012-05-21 14:37:03 UTC

Affects: Release Notes
project_key: JBPAPP6

Description track on AS7-4813.

Comment 1 Radoslav Husar 2012-05-21 14:37:04 UTC

Link: Added: This issue Cloned from AS7-4813

Comment 2 Radoslav Husar 2012-05-21 14:37:05 UTC

Link: Added: This issue is incorporated by JBPAPP-7577

Comment 3 Radoslav Husar 2012-05-21 14:37:39 UTC

Workflow: Removed: GIT Pull Request workflow  Added: jira
Security: Added: Public
Docs QE Status: Added: NEW

Comment 4 Rajesh Rajasekaran 2012-05-21 16:39:26 UTC

Labels: Added: eap6_need_triage

Comment 5 Rajesh Rajasekaran 2012-06-06 20:50:30 UTC

Link: Added: This issue is a dependency of JBPAPP-9290

Comment 6 Rajesh Rajasekaran 2012-06-06 20:54:08 UTC

Link: Removed: This issue is incorporated by JBPAPP-7577

Comment 7 Misty Stanley-Jones 2012-06-12 10:04:12 UTC

Release Notes Docs Status: Added: Documented as Known Issue
Release Notes Text: Added: A <code>TimeoutException</code> can occur during node shutdown when using REPL SYNC cache mode. The root cause of this issue is still under investigation.
Affects: Added: Release Notes

Comment 8 Rajesh Rajasekaran 2012-07-11 19:43:37 UTC

Labels: Removed: eap6_need_triage Added: eap601candidate

Comment 9 Rajesh Rajasekaran 2012-09-20 19:43:07 UTC

Labels: Removed: eap601candidate Added: eap601-qe-triage

Comment 10 Jitka Kozana 2012-10-02 12:10:46 UTC

I did not see this exception during EAP 6.0.1.ER2 testing.

Comment 11 Anne-Louise Tangring 2012-10-08 15:06:15 UTC

Not approved for EAP 6.0.1. If this should be reconsidered, please add the label: eap601-qe-triage

Comment 12 Anne-Louise Tangring 2012-10-08 15:06:15 UTC

Labels: Removed: eap601-qe-triage

Comment 13 Dana Mison 2012-10-16 05:52:08 UTC

Writer: Added: mistysj

Comment 14 Misty Stanley-Jones 2012-10-18 02:12:06 UTC

Jitka, is it fixed then? This affects how we release-note it.

Comment 15 Misty Stanley-Jones 2012-10-18 02:23:56 UTC

Release Notes Docs Status: Removed: Documented as Known Issue Added: Needs More Info

Comment 16 Jitka Kozana 2012-10-18 06:09:43 UTC

We need feedback from development here.
Paul, can you please comment?

Comment 17 sgilda 2012-10-29 19:06:44 UTC

I pinged Paul Ferraro on IRC. He said he would comment on this JIRA today.

Comment 18 Paul Ferraro 2012-10-30 14:55:31 UTC

I'm fairly certain that this issue was made obsolete by the fix for https://issues.jboss.org/browse/AS7-5379.
Previously, commits were sent asynchronously, regardless of cache mode.  Now, when using synchronous mode, commits will be sent synchronously.  Consequently, the condition described by Dan's last comment in AS7-4813 will not happen.

Comment 20 Paul Ferraro 2012-10-30 19:20:40 UTC

Not quite - because commits were sent asynchronously, it meant that a request for a given session will return before the cache operation completes - thus a subsequent request can be received by another server (because the server is being shutdown, requests for the particular session will get redirected to another node), and the lock acquisition for that session could contend with lock acquisition by the original node's commit.

Comment 21 sgilda 2012-10-30 19:36:50 UTC

Paul, thanks for the clarification.

Comment 22 Misty Stanley-Jones 2012-11-07 06:17:50 UTC

Release Notes Docs Status: Removed: Needs More Info Added: Documented as Resolved Issue
Release Notes Text: Removed: A <code>TimeoutException</code> can occur during node shutdown when using REPL SYNC cache mode. The root cause of this issue is still under investigation. Added: A `TimeoutException` with the message `Undable to acquire lock` could sometimes occur during node shutdown when using REPL SYNC cache mode. This occurred because commits were sent asynchronously regardless of cache mode. Therefore, request for a given session could return before the cache operation completed. A subsequent request could be received by a different server, since requests to a server which is shutting down are redirected to a different node. The lock acquisition for that session could contend with lock acquisition by the original node's commit. Commits are now done synchronously so there should not be any issues acquiring locks.

Comment 23 Misty Stanley-Jones 2012-11-07 06:19:03 UTC

I've documented this as a resolved issue, based upon Paul's feedback. It seems like the JIRA should be "Resolved" in that case, but I will leave it up to others.

Comment 24 Anne-Louise Tangring 2012-11-13 20:57:57 UTC

Release Notes Docs Status: Removed: Documented as Resolved Issue 
Writer: Removed: mistysj 
Release Notes Text: Removed: A `TimeoutException` with the message `Undable to acquire lock` could sometimes occur during node shutdown when using REPL SYNC cache mode. This occurred because commits were sent asynchronously regardless of cache mode. Therefore, request for a given session could return before the cache operation completed. A subsequent request could be received by a different server, since requests to a server which is shutting down are redirected to a different node. The lock acquisition for that session could contend with lock acquisition by the original node's commit. Commits are now done synchronously so there should not be any issues acquiring locks. 
Docs QE Status: Removed: NEW

Comment 27 Radoslav Husar 2013-08-27 10:44:56 UTC

Closing as per last comment.