Bug 1015098
Summary: | Management operations on slave host will corrupt preceding commands (CLI - batch) | ||||||
---|---|---|---|---|---|---|---|
Product: | [JBoss] JBoss Enterprise Application Platform 6 | Reporter: | Petr Kremensky <pkremens> | ||||
Component: | Domain Management | Assignee: | Emanuel Muckenhuber <emuckenh> | ||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | Petr Kremensky <pkremens> | ||||
Severity: | high | Docs Contact: | Russell Dickenson <rdickens> | ||||
Priority: | unspecified | ||||||
Version: | 6.2.0 | CC: | brian.stansberry, dandread, emuckenh, myarboro | ||||
Target Milestone: | CR1 | ||||||
Target Release: | EAP 6.2.0 | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2013-12-15 16:17:18 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Petr Kremensky
2013-10-03 12:55:55 UTC
I can't reproduce this with the code that will become EAP 6.2 ER5. Following the steps indicated, there is some problem with the servers on the slave starting. Lots of messages like this, and the servers never complete start. [Server:server-two] 19:55:28,414 WARN [org.hornetq.core.server] (Thread-1 (HornetQ-server-HornetQServerImpl::serverUUID=0742d2b4-2d45-11e3-97e7-5ded94e6bdd7-1382700530)) HQ222137: Unable to announce backup, retrying I suspect this is something to do with a conflict between the server in host-slave.xml vs those in the default host.xml. I don't see the problem when master uses host-master.xml. In any case it's a separate issue from this BZ. Here's what I get when I execute the CLI commands: [domain@localhost:9999 /] batch [domain@localhost:9999 / #] /profile=test:add #1 /profile=test:add [domain@localhost:9999 / #] /host=taozi.local/server-config=server-one:restart #2 /host=taozi.local/server-config=server-one:restart [domain@localhost:9999 / #] r read-attribute read-operation reload remove-batch-line rollout-plan run-batch [domain@localhost:9999 / #] run-batch {"host-failure-descriptions" => {"taozi.local" => {"JBAS014653: Composite operation failed and was rolled back. Steps that failed:" => {"Operation step-2" => "JBAS010946: Cannot restart server server-one as it is not currently started; it is STARTING"}}}} Proper error there, and when I check for the 'test' profile on both hosts, it does not exist and can be added. The slave servers can also be stopped via the CLI. It's just "restart" that doesn't work, which is valid. When I use host-master.xml on the master, avoiding the server start completion issue, the batch completes successfully. In a unit test I wrote using the configs used in the testsuite/domain tests, a composite operation that matches what the batch produces succeeds. In that test the master has servers on it as well, but the conflict with the slave servers mentioned above does not occur. I'm sure there are some differences in the master's host config or in domain.xml that account for that. When ER5 comes out, I'm interested whether you get equivalent results. Created attachment 809326 [details]
Issue reproduced on RHEL 6.4 with 6.2.0.ER5
I am getting same results also with ER5 see attachment 809326 [details]. Also, I get the same result if I use host-master.xml for DC.
Emanuel Muckenhuber <emuckenh> updated the status of jira WFLY-2410 to Resolved This issue was verified using the 6.2.0.CR1 preview bits. |