EAP 6.1.0.ER2: At server shutdown, we are seeing the following errors: perf20: 13:34:02,045 ERROR [org.infinispan.statetransfer.StateConsumerImpl] (OOB-60,shared=udp) ISPN000208: No live owners found for segment 17 of cache default-host/clusterbench. Current owners are: [perf21/web]. Faulty owners: [perf21/web] perf21 was at the time shutting down too and complaining about the same thing too (No live owners found). See the server logs here: https://jenkins.mw.lab.eng.bos.redhat.com/hudson/view/EAP6/view/EAP6-Clustering/view/EAP6-Failover/job/eap-6x-failover-http-session-jvmkill-dist-async/35/artifact/report/config/jboss-perf20/server.log https://jenkins.mw.lab.eng.bos.redhat.com/hudson/view/EAP6/view/EAP6-Clustering/view/EAP6-Failover/job/eap-6x-failover-http-session-jvmkill-dist-async/35/artifact/report/config/jboss-perf21/server.log This upstream jira seems to be dealing with something similar: https://issues.jboss.org/browse/ISPN-1239
Seen again in 6.1.0.ER8. Link to server log: https://jenkins.mw.lab.eng.bos.redhat.com/hudson/job/eap-6x-failover-http-session-undeploy-dist-async/28/artifact/report/config/jboss-perf18/server.log Link to job: https://jenkins.mw.lab.eng.bos.redhat.com/hudson/job/eap-6x-failover-http-session-undeploy-dist-async/28/
Still seeing this with EAP 6.1.1.ER7. For example: https://jenkins.mw.lab.eng.bos.redhat.com/hudson/job/eap-6x-failover-http-session-undeploy-dist-sync/28/ https://jenkins.mw.lab.eng.bos.redhat.com/hudson/job/eap-6x-failover-http-session-jvmkill-dist-sync/47/
This happens when all of the owners of a given leave the group, so that data can no longer be rebalanced. When using DIST, you need to either: * Increase the number of owners according to the expected availability * Stagger shutdown of nodes according to the number of owners * Ignore exceptions like this when you plan to shutdown everything
Marking for exclusion from Release Notes documentation as not a bug.