Description of problem:
There is a lot of established connections from RHQ server to cassandra (port 9142) which results in many open files -> given user is hitting ulimits -> e.g. ssh to the machine using given user is not working.
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. install and start the rhq (rhqctl install, rhqctl start)
2. import resources and keep it running
3. check periodically number of open files for rhq server (lsof -p <rhqServerPID> | wc -l)
Number of open files for RHQ server process is increasing until it hits ulimit for given user.
Number of open files is not increasing in time
After clean installation there is 1202 open files (lsof -p <serverPID> | wc -l)
This number is still increasing so in a few hours ulimit for open files is hit.
No exceptions in server.log
This issue was introduced in following rhq build http://hudson.qa.jboss.com/hudson/view/RHQ/job/rhq-master-gwt-locales/1570/
time: 2015-10-05 15:41:16 +0200
author: Libor Zoubek - email@example.com
message: Bug 1234912 - Do not authenticate against new storage node when
replication_factor of system_auth keyspace is wrong
Correctly close storage cluster session and fix scheduling
interval of job