Bug 900481 (JBPAPP6-1418)

Summary: CLONE - Shutdown in REPL SYNC results in TimeoutException: Unable to acquire lock after
Product: [JBoss] JBoss Enterprise Application Platform 6 Reporter: Radoslav Husar <rhusar>
Component: ClusteringAssignee: Paul Ferraro <paul.ferraro>
Status: CLOSED CURRENTRELEASE QA Contact: Jitka Kozana <jkudrnac>
Severity: high Docs Contact: Russell Dickenson <rdickens>
Priority: high    
Version: 6.0.0CC: atangrin, jkudrnac, lthon, misty, paul.ferraro, rhusar, sgilda, simone.gotti
Target Milestone: ---   
Target Release: EAP 6.1.0   
Hardware: Unspecified   
OS: Unspecified   
URL: http://jira.jboss.org/jira/browse/JBPAPP6-1418
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2013-08-27 10:44:56 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 900611    

Description Radoslav Husar 2012-05-21 14:37:03 UTC
Affects: Release Notes
project_key: JBPAPP6

Description track on AS7-4813.

Comment 1 Radoslav Husar 2012-05-21 14:37:04 UTC
Link: Added: This issue Cloned from AS7-4813


Comment 2 Radoslav Husar 2012-05-21 14:37:05 UTC
Link: Added: This issue is incorporated by JBPAPP-7577


Comment 3 Radoslav Husar 2012-05-21 14:37:39 UTC
Workflow: Removed: GIT Pull Request workflow  Added: jira
Security: Added: Public
Docs QE Status: Added: NEW


Comment 4 Rajesh Rajasekaran 2012-05-21 16:39:26 UTC
Labels: Added: eap6_need_triage


Comment 5 Rajesh Rajasekaran 2012-06-06 20:50:30 UTC
Link: Added: This issue is a dependency of JBPAPP-9290


Comment 6 Rajesh Rajasekaran 2012-06-06 20:54:08 UTC
Link: Removed: This issue is incorporated by JBPAPP-7577 


Comment 7 Misty Stanley-Jones 2012-06-12 10:04:12 UTC
Release Notes Docs Status: Added: Documented as Known Issue
Release Notes Text: Added: A <code>TimeoutException</code> can occur during node shutdown when using REPL SYNC cache mode. The root cause of this issue is still under investigation.
Affects: Added: Release Notes


Comment 8 Rajesh Rajasekaran 2012-07-11 19:43:37 UTC
Labels: Removed: eap6_need_triage Added: eap601candidate


Comment 9 Rajesh Rajasekaran 2012-09-20 19:43:07 UTC
Labels: Removed: eap601candidate Added: eap601-qe-triage


Comment 10 Jitka Kozana 2012-10-02 12:10:46 UTC
I did not see this exception during EAP 6.0.1.ER2 testing.

Comment 11 Anne-Louise Tangring 2012-10-08 15:06:15 UTC
Not approved for EAP 6.0.1. If this should be reconsidered, please add the label: eap601-qe-triage

Comment 12 Anne-Louise Tangring 2012-10-08 15:06:15 UTC
Labels: Removed: eap601-qe-triage 


Comment 13 Dana Mison 2012-10-16 05:52:08 UTC
Writer: Added: mistysj


Comment 14 Misty Stanley-Jones 2012-10-18 02:12:06 UTC
Jitka, is it fixed then? This affects how we release-note it.

Comment 15 Misty Stanley-Jones 2012-10-18 02:23:56 UTC
Release Notes Docs Status: Removed: Documented as Known Issue Added: Needs More Info


Comment 16 Jitka Kozana 2012-10-18 06:09:43 UTC
We need feedback from development here.
Paul, can you please comment? 

Comment 17 sgilda 2012-10-29 19:06:44 UTC
I pinged Paul Ferraro on IRC. He said he would comment on this JIRA today.

Comment 18 Paul Ferraro 2012-10-30 14:55:31 UTC
I'm fairly certain that this issue was made obsolete by the fix for https://issues.jboss.org/browse/AS7-5379.
Previously, commits were sent asynchronously, regardless of cache mode.  Now, when using synchronous mode, commits will be sent synchronously.  Consequently, the condition described by Dan's last comment in AS7-4813 will not happen.

Comment 20 Paul Ferraro 2012-10-30 19:20:40 UTC
Not quite - because commits were sent asynchronously, it meant that a request for a given session will return before the cache operation completes - thus a subsequent request can be received by another server (because the server is being shutdown, requests for the particular session will get redirected to another node), and the lock acquisition for that session could contend with lock acquisition by the original node's commit.

Comment 21 sgilda 2012-10-30 19:36:50 UTC
Paul, thanks for the clarification. 

Comment 22 Misty Stanley-Jones 2012-11-07 06:17:50 UTC
Release Notes Docs Status: Removed: Needs More Info Added: Documented as Resolved Issue
Release Notes Text: Removed: A <code>TimeoutException</code> can occur during node shutdown when using REPL SYNC cache mode. The root cause of this issue is still under investigation. Added: A `TimeoutException` with the message `Undable to acquire lock` could sometimes occur during node shutdown when using REPL SYNC cache mode. This occurred because commits were sent asynchronously regardless of cache mode. Therefore, request for a given session could return before the cache operation completed. A subsequent request could be received by a different server, since requests to a server which is shutting down are redirected to a different node. The lock acquisition for that session could contend with lock acquisition by the original node's commit. Commits are now done synchronously so there should not be any issues acquiring locks.


Comment 23 Misty Stanley-Jones 2012-11-07 06:19:03 UTC
I've documented this as a resolved issue, based upon Paul's feedback. It seems like the JIRA should be "Resolved" in that case, but I will leave it up to others.

Comment 24 Anne-Louise Tangring 2012-11-13 20:57:57 UTC
Release Notes Docs Status: Removed: Documented as Resolved Issue 
Writer: Removed: mistysj 
Release Notes Text: Removed: A `TimeoutException` with the message `Undable to acquire lock` could sometimes occur during node shutdown when using REPL SYNC cache mode. This occurred because commits were sent asynchronously regardless of cache mode. Therefore, request for a given session could return before the cache operation completed. A subsequent request could be received by a different server, since requests to a server which is shutting down are redirected to a different node. The lock acquisition for that session could contend with lock acquisition by the original node's commit. Commits are now done synchronously so there should not be any issues acquiring locks. 
Docs QE Status: Removed: NEW 


Comment 27 Radoslav Husar 2013-08-27 10:44:56 UTC
Closing as per last comment.