Description of problem: The Tomcat instances of Candlepin seem to intermittently hang and cause all requests to fail. This seems to likely be cause by c3p0. Version-Release number of selected component (if applicable): c3p0-0.9.0 How reproducible: Happens in QA and Stage daily. Steps to Reproduce: 1. 2. 3. Actual results: Expected results: The Ruby tier triggers a nagios alert and the following is found in the logs: "Exception PartialOutageException: Could not find shard for XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX" In the Candlepin logs, among other errors, we see: "Caused by: com.mysql.jdbc.exceptions.jdbc4.CommunicationsException: The last packet successfully received from the server was 34,864,978 milliseconds ago. The last packet sent successfully to the server was 34,864,979 milliseconds ago. is longer than the server configured value of 'wait_timeout'. You should consider either expiring and/or testing connection validity before use in your application, increasing the server configured values for client timeouts, or using the Connector/J connection property 'autoReconnect=true' to avoid this problem." Additional info: Our current version of c3p0 is from 2005
Sam, Is 0.9.1.2 ok? That seems to be the latest version. Also, how difficult is it to reproduce this issue?
0.9.1.2 works. The issue crops up daily in QA and Stage. I don't think we can reproduce it on demand.
(In reply to comment #1) > Sam, > > Is 0.9.1.2 ok? That seems to be the latest version. > > Also, how difficult is it to reproduce this issue?
Fixed in branch 0.5 with 19dac38c215768ba3f7e244f5d7f8f62da396f6e Released with candlepin-0.5.33-1