Bug 745857 (EDG-101) - HotRod: InvalidResponseException: Invalid magic number. Expected a1 and received 6e
Summary: HotRod: InvalidResponseException: Invalid magic number. Expected a1 and recei...
Keywords:
Status: CLOSED NEXTRELEASE
Alias: EDG-101
Product: JBoss Data Grid 5
Classification: JBoss
Component: Infinispan
Version: EAP 5.1.0 EDG TP
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: EAP 5.1.0 EDG TP
Assignee: Default User
QA Contact:
URL: http://jira.jboss.org/jira/browse/EDG...
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2011-03-08 14:37 UTC by Radoslav Husar
Modified: 2011-10-11 17:06 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2011-05-03 09:53:26 UTC
Type: Bug


Attachments (Terms of Use)
hotrod-client.zip (58.70 KB, application/zip)
2011-03-31 15:03 UTC, Michal Linhard
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Bugzilla 745867 0 urgent CLOSED HotRod: InvalidResponseException:: Invalid message id. Expected cfec and received 84b6> 2021-02-22 00:41:40 UTC
Red Hat Issue Tracker EDG-101 0 None None None Never

Internal Links: 745867

Description Radoslav Husar 2011-03-08 14:37:51 UTC
project_key: EDG

Strange thing happening while some nodes failed and rehashing is going on. Regardless, this looks like a result of a bigger problem.

{code}
        org.infinispan.client.hotrod.exceptions.InvalidResponseException: Invalid magic number. Expected a1 and received 6e
	at org.infinispan.client.hotrod.impl.operations.HotRodOperation.readHeaderAndValidate(HotRodOperation.java:84)
	at org.infinispan.client.hotrod.impl.operations.AbstractKeyValueOperation.sendPutOperation(AbstractKeyValueOperation.java:48)
	at org.infinispan.client.hotrod.impl.operations.PutOperation.executeOperation(PutOperation.java:27)
	at org.infinispan.client.hotrod.impl.operations.RetryOnFailureOperation.execute(RetryOnFailureOperation.java:38)
	at org.infinispan.client.hotrod.impl.RemoteCacheImpl.put(RemoteCacheImpl.java:166)
	at org.infinispan.CacheSupport.put(CacheSupport.java:28)
	at org.jboss.smartfrog.edg.adapter.HotRodAdapter$HotRodRemoteCacheAdapter.put(HotRodAdapter.java:150)
	at org.jboss.smartfrog.edg.loaddriver.CacheRequestProcessorFactory$InfinispanRequestProcessor.processRequest(CacheRequestProcessorFactory.java:163)
	at org.jboss.smartfrog.loaddriver.CompoundRequestProcessorFactoryImpl$CompoundRequestProcessor.processRequest(CompoundRequestProcessorFactoryImpl.java:51)
	at org.jboss.smartfrog.loaddriver.Runner.run(Runner.java:87)
	at java.lang.Thread.run(Thread.java:619)
{code}

Comment 1 Radoslav Husar 2011-03-08 14:49:04 UTC
Link: Added: This issue is incorporated by JBPAPP-6054


Comment 2 Galder Zamarreño 2011-03-08 15:21:13 UTC
No idea

Comment 3 Galder Zamarreño 2011-03-16 14:48:40 UTC
Rado, it's difficult to say what the cause of this is. The first I'd try doing, if the failure can be replicated easily, would be to generate some TRACE logs for both the client and server. We could start with having:

For the client side: TRACE on org.infinispan.client
For the server side: TRACE on org.infinispan.server

Comment 4 Radoslav Husar 2011-03-16 15:13:14 UTC
Yeah, its easy to reproduce. I will turn on the logging but it will be very verbose. I belive you can easily reproduce by starting few nodes, start hotrod client and kill a node.

Comment 5 Radoslav Husar 2011-03-22 09:52:58 UTC
With TRACE logging, the preceeding event (client side) to this issue is:

Mar 22, 2011 5:49:08 AM org.infinispan.client.hotrod.impl.operations.HotRodOperation readHeaderAndValidate
SEVERE: Invalid magic number. Expected a1 and received 72


Comment 6 Galder Zamarreño 2011-03-22 10:11:06 UTC
Hmmm, that means little to me. Could you send me the TRACE log for the client and can you make sure you use standard log formats used in AS? i.e.

{code:xml}<layout class="org.apache.log4j.PatternLayout">
  <!-- The full pattern: Date MS Priority [Category] (Thread:NDC) Message\n -->
  <param name="ConversionPattern" value="%d %-5r %-5p [%c] (%t:%x) %m%n"/>
</layout>{code}


Comment 7 Radoslav Husar 2011-03-22 11:31:19 UTC
I know its a problem, I have a workaround in mind I will try now.

Update: due to infrastructure problems JBQA-4431 we wont be able to provide this logging on *client side* atm.

Comment 10 Michal Linhard 2011-03-31 11:47:03 UTC
I started to see a lot of these when tried to switch to changed hotrod client (adapted TcpTransportFactory) see JBQA-4479.

http://hudson.qa.jboss.com/hudson/view/EDG/job/edg-51x-stress-client-size4-hotrod/20/console-perf01/consoleText
(beware: it is a big output)

Comment 11 Michal Linhard 2011-03-31 11:56:39 UTC
here I fixed a problem in smartfrog, which leads to much less garbage in the output, but you can still see the problems:
http://hudson.qa.jboss.com/hudson/view/EDG/job/edg-51x-stress-client-size4-hotrod/21/console-perf01/consoleText

Comment 12 Michal Linhard 2011-03-31 15:03:44 UTC
miraculously I managed to reproduce this on my localhost using 100 threads against two EDG instances. I was able to reproduce it only with the first run of the testcase after fresh boot of both instances.

the testcase: testHotRodDisableSmartRouting() in https://svn.devel.redhat.com/repos/jboss-qa/load-testing/sf-components/project-edg/trunk/src/test/java/org/jboss/smartfrog/edg/adapter/BasicAdapterManualTest.java

the trace log: attached hotrod-client.zip

Comment 13 Michal Linhard 2011-03-31 15:03:44 UTC
Attachment: Added: hotrod-client.zip


Comment 14 Galder Zamarreño 2011-03-31 15:10:28 UTC
Wicked Michal. I'll have a look to them once I've fixed JBPAPP-6113. It could well be the case that the problem with both this and JBPAPP-6113, originate at the same place but can't tell for sure yet.

Comment 15 Michal Linhard 2011-04-01 08:16:36 UTC
Link: Added: This issue is a dependency of JBQA-4479


Comment 16 Galder Zamarreño 2011-04-04 08:47:00 UTC
Michal, do you have the corresponding TRACE logs for the two EDG servers? That was key for JBPAPP-6113 and I think it's key here as well. In fact, JBPAPP-6113 could have solved this issue as well cos I see several "Invalid message id" messages too. So, if after using the new snapshot uploaded last Thursday you see this issue again, please make sure you upload both the client and server TRACE logs.

Comment 17 Michal Linhard 2011-04-04 08:56:37 UTC
OK.

Comment 18 Michal Linhard 2011-04-04 09:43:39 UTC
The issue disappeared in my runs, when I ran with the new snapshot(20110331.163428) containing fix for JBPAPP-6113.

Comment 19 Galder Zamarreño 2011-04-04 14:08:02 UTC
Hmmm, Ok. Let's be optimistic and mark it as resolved and if it appears again, just reopen with some TRACE logs if possible :)

Comment 20 Galder Zamarreño 2011-04-04 14:08:35 UTC
Link: Added: This issue is related to JBPAPP-6113


Comment 22 Galder Zamarreño 2011-05-03 09:52:20 UTC
Rado, let's close this cos this JIRA was for "Invalid Magic Number", and let's focus on JBPAPP-6113

Comment 23 Michal Linhard 2011-05-03 10:06:14 UTC
note: rado is on PTO May 2-10 and 18-27

galder, we also saw "Invalid magic number" problem in the logs from JBPAPP-6113.

but I'm OK with treating them as one issue, because they seem to be connected...



Note You need to log in before you can comment on or make changes to this bug.