1106393 – Managed server shutdown unexpectedly when timeout during connection request to HC

Bug 1106393 - Managed server shutdown unexpectedly when timeout during connection request to HC

Summary: Managed server shutdown unexpectedly when timeout during connection request t...

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	JBoss Enterprise Application Platform 6
Classification:	JBoss
Component:	Domain Management
Sub Component:
Version:	6.2.3
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	urgent
Target Milestone:	DR1
Target Release:	EAP 6.4.0
Assignee:	John Allen
QA Contact:	Petr Kremensky
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1140453 1153383 1186949
TreeView+	depends on / blocked

Reported:	2014-06-09 08:52 UTC by Takayoshi Kimura
Modified:	2020-03-11 14:50 UTC (History)
CC List:	12 users (show)
Fixed In Version:
Clone Of:
Clones:	1140453 1153383 1186949 (view as bug list)
Environment:
Last Closed:
Type:	Bug
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Issue Tracker	WFLY-3468	0	Minor	Closed	Managed server shutdown unexpectedly when timeout during connection request to HC	2018-10-26 18:27:46 UTC
Red Hat Knowledge Base (Solution)	744793	0	None	None	None	Never

Description Takayoshi Kimura 2014-06-09 08:52:49 UTC

Sometimes managed server shutdown unexpectedly when managed server tries to connect to the HC and got lengthly Full GC.

java.io.IOException: JBAS012175: Channel closed
	at org.jboss.as.server.mgmt.domain.HostControllerConnection.getChannel(HostControllerConnection.java:101)
	at org.jboss.as.protocol.mgmt.ManagementChannelHandler.executeRequest(ManagementChannelHandler.java:117)
	at org.jboss.as.protocol.mgmt.ManagementChannelHandler.executeRequest(ManagementChannelHandler.java:100)
	at org.jboss.as.server.mgmt.domain.HostControllerConnection.reConnect(HostControllerConnection.java:171)
	at org.jboss.as.server.mgmt.domain.HostControllerClient.reconnect(HostControllerClient.java:98)
	at org.jboss.as.server.DomainServerMain.main(DomainServerMain.java:138)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.jboss.modules.Module.run(Module.java:292)
	at org.jboss.modules.Main.main(Main.java:455)

The current DomainServerMain exits on the connetion error.

The scenario is:

1. server tries to connect to the HC
2. server Full GC
3. HC is waiting a request from server, read timeout, closes the sock
4. server resumed from GC, failed to send the req to the HC and exit

Comment 2 kylin 2014-07-10 07:17:22 UTC

I have a customer hit this issue again, I extract some log as [1], this may helpful for resolving the issue.

The log [1] contain the following info:

1. Server almost exhausted, from 15:04:15 to 15:11:30, more than 7 minutes no log output, this may caused by gc, OS level issue, VM hypervisor issues(customer use VMware virtual platform)

2. Server no shut down log output after 15:14:12, but the Server exit time should be 15:18:12, we can find evidence from PC log

3. HC hit Read timed out at 15:10:01, at the same time Server keep stuck as step 1

4. PC monitor Server exit, receive server exit at 15:18:12, this hints Server exit at 15:18:12


[1] https://github.com/kylinsoong/wildfly-samples/blob/master/domain/bug-1106393-log.md

Comment 3 Emanuel Muckenhuber 2014-07-10 09:46:29 UTC

Which JVM version are you using? Apparently using JDK 7 was helping with this issues, we are going to fix this though in a future release.

Comment 8 James Livingston 2014-09-03 22:56:50 UTC

Merged upstream PR: https://github.com/wildfly/wildfly-core/pull/150
Upstream commit: https://github.com/wildfly/wildfly-core/commit/70e2286fa6e737df2c4daa5b7f2330a8bd6d43fb

Comment 13 Patrick 2014-09-19 09:23:12 UTC

Adding a new case 01183081 to the list of impacted customers.

Thanks

Comment 14 Petr Kremensky 2014-10-07 08:29:05 UTC

Server process now waits until the host-controller is available again and reconnects.

Verified on EAP 6.4.0.DR3

Comment 18 Federico Bellizia 2017-03-22 13:59:10 UTC

Good morning, 
     we analyzed similar problem on our EAP 6 installation and we found a correlation bewteen high swap usage and HC-DC Disconnection.

Full GC on a low memory machine cause this problems.

Solution, upgrade ram of machine.

Note You need to log in before you can comment on or make changes to this bug.