Bug 957171

Summary: Client does not retry the invocation if a node is shutting down or undeploying and an EJB invocation reach the node in that state
Product: [JBoss] JBoss Enterprise Application Platform 6 Reporter: wfink
Component: EJBAssignee: Jaikiran Pai <jpai>
Status: CLOSED CURRENTRELEASE QA Contact:
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 6.0.1CC: dandread, dimitris, jkudrnac, jmartisk, lcestari, lthon, mlittle, mnovak, myarboro, rjanik, rsvoboda
Target Milestone: ER7   
Target Release: EAP 6.1.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Deadline: 2013-05-03   

Description wfink 2013-04-26 14:01:44 UTC
Description of problem:
The EJB client should do a retry if the invocation can not be processed because it reached the node in a state where the EJB is not available.
This might happen during start/deploy/undeploy/shutdown.

How reproducible:
Due to another issue BZ900999 this might happen more often.
I use a simple application and script to call the EJB in a loop during permanent server restarts or deployments.

The problem happen with bz900999 in about 1 of 20 server stops



Steps to Reproduce:
1.
 I've used the ejb-multi-server quickstart [1], use the test EJB of app-one or app-two deployed at two nodes.

2.
 The client application call the EJB in a endles loop.
3.
The server is started/stopped by CLI with script:

   while [ 1 -eq 1 ]; do
     bin/jboss-cli.sh --connect --command="/host=master/server-config=app-oneA:stop(blocking=true)"
     bin/jboss-cli.sh --connect --command="/host=master/server-config=app-oneA:start(blocking=true)"
   done


  
[1] https://github.com/wfink/jboss-as-quickstart/tree/ejb-multi-server/ejb-multi-server

Comment 2 Jaikiran Pai 2013-04-30 14:08:18 UTC
Update - I've a solution to this which is currently being tested before I send out a PR.

Comment 9 Dimitris Andreadis 2013-05-01 16:42:16 UTC
Skynet must have started its career as a bugbot.

Comment 10 JBoss JIRA Server 2013-05-02 13:43:22 UTC
jaikiran pai <jpai> made a comment on jira WFLY-1308

Pull request sent

Comment 15 Jaikiran Pai 2013-05-06 07:10:16 UTC
Pull request has been merged

Comment 17 JBoss JIRA Server 2013-06-14 11:29:35 UTC
Paul Ferraro <paul.ferraro> made a comment on jira WFLY-1306

Fix is just a workaround.  Need to discuss a better way to improve transaction commit reliability with infinispan team.