Bug 1026428 - RFE: Ship with RHQ server tuned for higher capacity
RFE: Ship with RHQ server tuned for higher capacity
Product: JBoss Operations Network
Classification: JBoss
Component: Documentation (Show other bugs)
JON 3.2
Unspecified Unspecified
unspecified Severity high
: GA
: JON 3.2.0
Assigned To: Deon Ballard
Mike Foley
: Documentation
Depends On: 1025844
Blocks: 1012435
  Show dependency treegraph
Reported: 2013-11-04 10:52 EST by Mike Foley
Modified: 2014-09-05 11:40 EDT (History)
3 users (show)

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: 1025844
Last Closed: 2014-09-05 11:40:24 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---

Attachments (Terms of Use)

  None (edit)
Description Mike Foley 2013-11-04 10:52:04 EST
+++ This bug was initially created as a clone of Bug #1025844 +++

Description of problem:

The RHQ server, if used with over a hundred (or a thousand agents) cannot handle, out-the-box, enough load to cleanly do an upgrade.

This may not be customer-typical, but also doesn't seem likely to do much harm to adjust even on smaller installations.

Although I cannot identify what is most important, the following need to be tuned:

1) Increase the default size of the storage node memory usage. I would say that for about 1000 nodes, around 5GB of heap memory for Cassandra is good. Though I think the installer should simply pick a good number based on the local free memory size.

Example error:

01:44:14,249 ERROR [org.jboss.as.ejb3.invocation] (http-/ JBAS014134: EJB Invocation failed on component ResourceManagerBean for method public
 abstract void org.rhq.enterprise.server.resource.ResourceManagerLocal.addResourceError(org.rhq.core.domain.resource.ResourceError): javax.ejb.EJBException: J
BAS014516: Failed to acquire a permit within 5 MINUTES
        at org.jboss.as.ejb3.pool.strictmax.StrictMaxPool.get(StrictMaxPool.java:109) [jboss-as-ejb3-7.2.0.Alpha1-redhat-4.jar:7.2.0.Alpha1-redhat-4]
        at org.jboss.as.ejb3.component.pool.PooledInstanceInterceptor.processInvocation(PooledInstanceInterceptor.java:47) [jboss-as-ejb3-7.2.0.Alpha1-redhat-4.jar:7.2.0.Alpha1-redhat-4]

2) Increase the size of the EJB pool. What happened with 4.5.1 -> 4.9 upgrade was that the number of inventory requests went up substantially in a short time. This caused many, many timeouts.
                    <strict-max-pool name="slsb-strict-max-pool" max-pool-size="2000" instance-acquisition-timeout="1" instance-acquisition-timeout-unit="MINUTES"/>

3) Increase the out-of-box communication limits:


Version-Release number of selected component (if applicable): 4.9 (from 4.5.1)

--- Additional comment from Mike Foley on 2013-11-01 14:00:14 EDT ---

minimally, this should be considered as documentation for jon 3.2
Comment 1 Deon Ballard 2014-01-24 12:42:23 EST
This is covered in a section for tuning the server for a large number of agents:

A soft-limit as 100+ agents being "a large number" is mentioned in inventory baselines:

Note You need to log in before you can comment on or make changes to this bug.