Description of problem: If a multithreaded client fire put operations in parallel and one server of the cluster (here one of two) will be stopped the client will see sporadical an Exception but might continue after the server is down completely. org.infinispan.client.hotrod.exceptions.HotRodClientException:Request for message id[19416] returned server error (status=0x85): org.infinispan.commons.CacheException: Could not commit implicit transaction at org.infinispan.client.hotrod.impl.protocol.Codec20.checkForErrorsInResponseStatus(Codec20.java:343) at org.infinispan.client.hotrod.impl.protocol.Codec20.readPartialHeader(Codec20.java:133) at org.infinispan.client.hotrod.impl.protocol.Codec20.readHeader(Codec20.java:119) at org.infinispan.client.hotrod.impl.operations.HotRodOperation.readHeaderAndValidate(HotRodOperation.java:56) at org.infinispan.client.hotrod.impl.operations.AbstractKeyValueOperation.sendPutOperation(AbstractKeyValueOperation.java:56) at org.infinispan.client.hotrod.impl.operations.PutIfAbsentOperation.executeOperation(PutIfAbsentOperation.java:37) at org.infinispan.client.hotrod.impl.operations.RetryOnFailureOperation.execute(RetryOnFailureOperation.java:56) at org.infinispan.client.hotrod.impl.RemoteCacheImpl.putIfAbsent(RemoteCacheImpl.java:263) at org.infinispan.client.hotrod.impl.RemoteCacheSupport.putIfAbsent(RemoteCacheSupport.java:54) at HotRodTestClient.updateCache(HotRodTestClient.java:69) at HotRodTestClient.lambda$queuePut$0(HotRodTestClient.java:89) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745)
Created attachment 1133828 [details] client/server log and jstack files (2sec delay) - several times stop/start server2
The client stuck for about 30sec until server2 is shutting down and show the following exception after continue: 2016-03-07 16:38:05,070 WARN [Codec21] (pool-1-thread-3:) ISPN004005: Error received from the server: org.infinispan.commons.CacheException: Could not commit implicit transaction Exception in thread 14, thread will die. org.infinispan.client.hotrod.exceptions.HotRodClientException:Request for message id[19416] returned server error (status=0x85): org.infinispan.commons.CacheException: Could not commit implicit transaction at org.infinispan.client.hotrod.impl.protocol.Codec20.checkForErrorsInResponseStatus(Codec20.java:343) at org.infinispan.client.hotrod.impl.protocol.Codec20.readPartialHeader(Codec20.java:133) at org.infinispan.client.hotrod.impl.protocol.Codec20.readHeader(Codec20.java:119) at org.infinispan.client.hotrod.impl.operations.HotRodOperation.readHeaderAndValidate(HotRodOperation.java:56) at org.infinispan.client.hotrod.impl.operations.AbstractKeyValueOperation.sendPutOperation(AbstractKeyValueOperation.java:56) at org.infinispan.client.hotrod.impl.operations.PutIfAbsentOperation.executeOperation(PutIfAbsentOperation.java:37) at org.infinispan.client.hotrod.impl.operations.RetryOnFailureOperation.execute(RetryOnFailureOperation.java:56) at org.infinispan.client.hotrod.impl.RemoteCacheImpl.putIfAbsent(RemoteCacheImpl.java:263) at org.infinispan.client.hotrod.impl.RemoteCacheSupport.putIfAbsent(RemoteCacheSupport.java:54) at HotRodTestClient.updateCache(HotRodTestClient.java:69) at HotRodTestClient.lambda$queuePut$0(HotRodTestClient.java:89) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745)
Created attachment 1133890 [details] second try server with ispn=TRACE This time the client and server1 did not recover after server2 finished shutdown
This bug was accidentally moved from POST to MODIFIED via an error in automation, please see mmccune with any questions
Sebastian Łaskawiec <slaskawi> updated the status of jira ISPN-5507 to Closed
Sebastian Łaskawiec <slaskawi> updated the status of jira ISPN-6341 to Closed
Tristan Tarrant <ttarrant> updated the status of jira ISPN-5507 to Reopened
Tristan Tarrant <ttarrant> updated the status of jira ISPN-5507 to Resolved
Tristan Tarrant <ttarrant> updated the status of jira ISPN-6341 to Reopened
Tristan Tarrant <ttarrant> updated the status of jira ISPN-6341 to Resolved
Dan Berindei <dberinde> updated the status of jira JDG-83 to Resolved