Bug 1106393
Summary: | Managed server shutdown unexpectedly when timeout during connection request to HC | |||
---|---|---|---|---|
Product: | [JBoss] JBoss Enterprise Application Platform 6 | Reporter: | Takayoshi Kimura <tkimura> | |
Component: | Domain Management | Assignee: | John Allen <joallen> | |
Status: | CLOSED CURRENTRELEASE | QA Contact: | Petr Kremensky <pkremens> | |
Severity: | urgent | Docs Contact: | ||
Priority: | unspecified | |||
Version: | 6.2.3 | CC: | brian.stansberry, dandread, federico, hnaram, jlee, joallen, kkhan, klape, pyadav, sjadhav, wfink, yqu | |
Target Milestone: | DR1 | |||
Target Release: | EAP 6.4.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | Bug Fix | ||
Doc Text: |
In a previous version of JBoss EAP 6, after a managed server's connection to it's Host Controller failed, it would only make a single re-connection attempt.
This could cause the product to shut down unexpectedly if the re-connection failed.
In this release, connections to the Host Controller are re-tried indefinitely. Server instances no longer shut down due to loss of connection to the Host Controller.
|
Story Points: | --- | |
Clone Of: | ||||
: | 1140453 1153383 1186949 (view as bug list) | Environment: | ||
Last Closed: | Type: | Bug | ||
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1140453, 1153383, 1186949 |
Description
Takayoshi Kimura
2014-06-09 08:52:49 UTC
I have a customer hit this issue again, I extract some log as [1], this may helpful for resolving the issue. The log [1] contain the following info: 1. Server almost exhausted, from 15:04:15 to 15:11:30, more than 7 minutes no log output, this may caused by gc, OS level issue, VM hypervisor issues(customer use VMware virtual platform) 2. Server no shut down log output after 15:14:12, but the Server exit time should be 15:18:12, we can find evidence from PC log 3. HC hit Read timed out at 15:10:01, at the same time Server keep stuck as step 1 4. PC monitor Server exit, receive server exit at 15:18:12, this hints Server exit at 15:18:12 [1] https://github.com/kylinsoong/wildfly-samples/blob/master/domain/bug-1106393-log.md Which JVM version are you using? Apparently using JDK 7 was helping with this issues, we are going to fix this though in a future release. Merged upstream PR: https://github.com/wildfly/wildfly-core/pull/150 Upstream commit: https://github.com/wildfly/wildfly-core/commit/70e2286fa6e737df2c4daa5b7f2330a8bd6d43fb Adding a new case 01183081 to the list of impacted customers. Thanks Server process now waits until the host-controller is available again and reconnects. Verified on EAP 6.4.0.DR3 Good morning, we analyzed similar problem on our EAP 6 installation and we found a correlation bewteen high swap usage and HC-DC Disconnection. Full GC on a low memory machine cause this problems. Solution, upgrade ram of machine. |