Bug 1002210
| Summary: | Too many exceptions in server.log | ||
|---|---|---|---|
| Product: | [Other] RHQ Project | Reporter: | Armine Hovsepyan <ahovsepy> |
| Component: | Core Server | Assignee: | Nobody <nobody> |
| Status: | ON_QA --- | QA Contact: | |
| Severity: | high | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 4.9 | CC: | hrupp |
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | All | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | Type: | Bug | |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1019807, 1019841 | ||
| Bug Blocks: | 951619 | ||
|
Description
Armine Hovsepyan
2013-08-28 15:47:09 UTC
Some of the exceptions in the server log are due to bug 1002238. Other exceptions like com.datastax.driver.core.exceptions.UnavailableException can occur while trying to read/write metrics when a node is being added to or removed from the cluster and the cluster is being rebalanced. com.datastax.driver.core.exceptions.NoHostAvailableException is thrown when we try to read/write metrics when the storage cluster is down. These are both RuntimeExceptions and they are getting wrapped in an EJBException which is resulting is a much larger (than necessary) stack trace. The following will help clean things up a bit. I will add a new StorageException class that wraps those C* exceptions and make it an application exception. Then we will get a stack trace that does not include all of the internal, container calls. This will help a lot with debugging. I have made some changes to reduce the noise in server.log. From my commit message: There were some methods in MeasurementDataManagerBean with default transaction support, but they should be NOT_SUPPORTED since they read/write to and from Cassandra. This will help reduce stacktraces because when exceptions bubble up from those methods they will no longer get wrapped in EJBExceptions. When an error occurs while inserting raw data, we are no longer logging the full exception. There is a better than likely change that if an exception occurs for one write, it will ocurr for several. Logging each of the exceptions resulted in a lot of noise in the logs. Now only the error message is logged. The full exception will be logged with DEBUG logging. master commit hash: 98c76cebf These changes should be in build 2596 of the rhq-master job. update: new time-out exceptions in server.log --http://pastebin.test.redhat.com/161393 --- will update bug as soon as reproduced. The description lists a few different exceptions. As I mentioned comment 1, one of the exceptions is related to bug 1002238. The other exceptions are addressed by the commit 98c76cebf cited in comment 2. The error cited in comment 3 is unrelated, and I would rather if necessary call that out in a separate BZ. I do not want this BZ to become a catch-all bucket for errors that appear in the server log. I have opened bug 1003191 to track the issue cited in comment 3. |