1192324 – Inconsistent logging for starting and finished rebalance

Bug 1192324 - Inconsistent logging for starting and finished rebalance

Summary: Inconsistent logging for starting and finished rebalance

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	JBoss Data Grid 6
Classification:	JBoss
Component:	Infinispan
Sub Component:
Version:	6.4.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	ER1
Target Release:	6.4.1
Assignee:	Dan Berindei
QA Contact:	Martin Gencur
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1200947
TreeView+	depends on / blocked

Reported:	2015-02-13 07:31 UTC by Osamu Nagano
Modified:	2019-04-16 14:36 UTC (History)
CC List:	9 users (show)
Fixed In Version:
Clone Of:
Clones:	1200947 (view as bug list)
Environment:
Last Closed:	2015-04-02 12:14:01 UTC
Type:	Bug
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Issue Tracker	ISPN-5287	0	Major	Resolved	Inconsistent logging for starting and finished rebalance	2017-10-06 04:03:51 UTC
Red Hat Issue Tracker	PRODMGT-930	0	Major	Resolved	Rebalancing log level	2017-10-06 04:03:50 UTC

Description Osamu Nagano 2015-02-13 07:31:14 UTC

Description of problem:
In JDG 6.4.0, starting and finished cluster-wide (and local, too) rebalance events are logged in INFO level for better administration.

~~~
15:51:46,624 INFO  [org.infinispan.CLUSTER] (remote-thread-0) ISPN000310: Starting cluster-wide rebalance for cache default, topology CacheTopology{id=1, rebalanceId=1, currentCH=DefaultConsistentHash{ns = 20, owners = (1)[svr01/clustered: 20+0]}, pendingCH=DefaultConsistentHash{ns = 20, owners = (2)[svr01/clustered: 10+10, svr02/clustered: 10+10]}, unionCH=null, actualMembers=[svr01/clustered, svr02/clustered]}
15:51:46,629 INFO  [org.infinispan.CLUSTER] (undefined) ISPN000328: Finished local rebalance for cache default on node svr01/clustered, topology id = 1
15:51:46,696 INFO  [org.infinispan.CLUSTER] (remote-thread-2) ISPN000328: Finished local rebalance for cache default on node svr02/clustered, topology id = 1
~~~

But finished cluster-wide rebalance event is logged in DEBUG level, which is not visible by default.

~~~
15:51:46,697 DEBUG [org.infinispan.topology.ClusterCacheStatus] (remote-thread-2) Finished cluster-wide rebalance for cache default, topology id = 1
~~~


Version-Release number of selected component (if applicable):
infinispan-core-6.2.0.Final-redhat-4.jar


How reproducible:
Always.


Steps to Reproduce:
Just start two out-of-the-box clustered instances.  The following debug category should have been set.
~~~
            <logger category="org.infinispan.topology.ClusterCacheStatus">
                <level name="DEBUG"/>
            </logger>
~~~


Actual results:
See "Description of problem".


Expected results:
Finished cluster-wide rebalance event should be in INFO and org.infinispan.CLUSTER category as well.


Additional info:
Thread named "(undefined)" reduces debuggability.  This should be addressed as well.

Comment 4 Alan Field 2015-02-13 08:22:49 UTC

Marking for JDG 6.4.1

Comment 6 Dan Berindei 2015-03-11 17:03:00 UTC

PR: https://github.com/infinispan/jdg/pull/554

The PR does not address the (undefined) thread name, as I couldn't find the cause. The thread is from the "transport executor" thread pool, which seems to be properly defined in the configuration.

Comment 7 Martin Gencur 2015-03-12 13:54:40 UTC

Osamu, could you please check with the customer whether the fix is sufficient for them? Thanks

Comment 8 Vojtech Juranek 2015-03-12 14:02:50 UTC

Hi,
I'm not bale to reproduce "undefined" thread name in log (in my log thread names appear correctly). If customer wants this to be fixed as well, it would be nice to provide a reproducer. Thanks!

Comment 10 Alan Field 2015-03-12 14:21:46 UTC

I saw these log messages while verifying BZ1198211:

10:17:07,284 INFO  [org.infinispan.CLUSTER] (RMI TCP Connection(1)-127.0.0.1) ISPN000309: Rebalancing suspended
10:17:22,450 INFO  [org.infinispan.remoting.transport.jgroups.JGroupsTransport] (Incoming-2,shared=udp) ISPN000094: Received new cluster view: [node0/clustered|1] (2) [node0/clustered, node1/clustered]
10:17:38,429 INFO  [org.infinispan.CLUSTER] (RMI TCP Connection(5)-127.0.0.1) ISPN000308: Rebalancing enabled
10:17:38,434 INFO  [org.infinispan.CLUSTER] (RMI TCP Connection(5)-127.0.0.1) ISPN000310: Starting cluster-wide rebalance for cache default, topology CacheTopology{id=1, rebalanceId=1, currentCH=DefaultConsistentHash{ns = 20, owners = (1)[node0/clustered: 20+0]}, pendingCH=DefaultConsistentHash{ns = 20, owners = (2)[node0/clustered: 10+10, node1/clustered: 10+10]}, unionCH=null, actualMembers=[node0/clustered, node1/clustered]}
10:17:38,446 INFO  [org.infinispan.CLUSTER] (RMI TCP Connection(5)-127.0.0.1) ISPN000310: Starting cluster-wide rebalance for cache ___hotRodTopologyCache, topology CacheTopology{id=1, rebalanceId=1, currentCH=ReplicatedConsistentHash{ns = 60, owners = (1)[node0/clustered: 60]}, pendingCH=ReplicatedConsistentHash{ns = 60, owners = (2)[node0/clustered: 30, node1/clustered: 30]}, unionCH=null, actualMembers=[node0/clustered, node1/clustered]}
10:17:38,453 INFO  [org.infinispan.CLUSTER] (RMI TCP Connection(5)-127.0.0.1) ISPN000310: Starting cluster-wide rebalance for cache ___protobuf_metadata, topology CacheTopology{id=1, rebalanceId=1, currentCH=ReplicatedConsistentHash{ns = 60, owners = (1)[node0/clustered: 60]}, pendingCH=ReplicatedConsistentHash{ns = 60, owners = (2)[node0/clustered: 30, node1/clustered: 30]}, unionCH=null, actualMembers=[node0/clustered, node1/clustered]}
10:17:38,456 INFO  [org.infinispan.CLUSTER] (RMI TCP Connection(5)-127.0.0.1) ISPN000310: Starting cluster-wide rebalance for cache memcachedCache, topology CacheTopology{id=1, rebalanceId=1, currentCH=DefaultConsistentHash{ns = 20, owners = (1)[node0/clustered: 20+0]}, pendingCH=DefaultConsistentHash{ns = 20, owners = (2)[node0/clustered: 10+10, node1/clustered: 10+10]}, unionCH=null, actualMembers=[node0/clustered, node1/clustered]}
10:17:38,460 INFO  [org.infinispan.CLUSTER] (RMI TCP Connection(5)-127.0.0.1) ISPN000310: Starting cluster-wide rebalance for cache namedCache, topology CacheTopology{id=1, rebalanceId=1, currentCH=DefaultConsistentHash{ns = 80, owners = (1)[node0/clustered: 80+0]}, pendingCH=DefaultConsistentHash{ns = 80, owners = (2)[node0/clustered: 40+40, node1/clustered: 40+40]}, unionCH=null, actualMembers=[node0/clustered, node1/clustered]}
10:17:38,496 INFO  [org.infinispan.CLUSTER] (remote-thread-0) ISPN000336: Finished cluster-wide rebalance for cache default, topology id = 1
10:17:38,504 INFO  [org.infinispan.CLUSTER] (remote-thread-0) ISPN000336: Finished cluster-wide rebalance for cache memcachedCache, topology id = 1
10:17:38,506 INFO  [org.infinispan.CLUSTER] (remote-thread-1) ISPN000336: Finished cluster-wide rebalance for cache ___hotRodTopologyCache, topology id = 1
10:17:38,520 INFO  [org.infinispan.CLUSTER] (remote-thread-1) ISPN000336: Finished cluster-wide rebalance for cache ___protobuf_metadata, topology id = 1
10:17:38,523 INFO  [org.infinispan.CLUSTER] (remote-thread-0) ISPN000336: Finished cluster-wide rebalance for cache namedCache, topology id = 1

Comment 12 Osamu Nagano 2015-03-13 00:27:50 UTC

I also confirmed the fix in JDG_6.4.1.ER1.  "Finished cluster-wide rebalance" is INFO and "Finished local rebalance" is DEBUG now.

For "undefined" thread name, I found an upstream fix, https://issues.jboss.org/browse/ISPN-4989, which may be related to this.

Note You need to log in before you can comment on or make changes to this bug.