826602 – Connections hang daily in hosted Candlepin

Bug 826602 - Connections hang daily in hosted Candlepin

Summary: Connections hang daily in hosted Candlepin

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	Candlepin
Classification:	Community
Component:	candlepin
Sub Component:
Version:	0.9
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Target Release:	---
Assignee:	Chris Duryee
QA Contact:	Eric Sammons
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	826663
TreeView+	depends on / blocked

Reported:	2012-05-30 15:11 UTC by Samuel Munilla
Modified:	2015-05-14 14:53 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Clone Of:
Clones:	826663 (view as bug list)
Environment:
Last Closed:	2012-05-30 19:47:34 UTC
Embargoed:

Attachments	(Terms of Use)

Description Samuel Munilla 2012-05-30 15:11:26 UTC

Description of problem: The Tomcat instances of Candlepin seem to intermittently hang and cause all requests to fail. This seems to likely be cause by c3p0.


Version-Release number of selected component (if applicable): c3p0-0.9.0


How reproducible: Happens in QA and Stage daily.


Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:

The Ruby tier triggers a nagios alert and the following is found in the logs:

"Exception PartialOutageException: Could not find shard for XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX"

In the Candlepin logs, among other errors, we see:

"Caused by: com.mysql.jdbc.exceptions.jdbc4.CommunicationsException: The last packet successfully received from the server was 34,864,978 milliseconds ago.  The last packet sent successfully to the server was 34,864,979 milliseconds ago. is longer than the server configured value of 'wait_timeout'. You should consider either expiring and/or testing connection validity before use in your application, increasing the server configured values for client timeouts, or using the Connector/J connection property 'autoReconnect=true' to avoid this problem."

Additional info: Our current version of c3p0 is from 2005

Comment 1 Chris Duryee 2012-05-30 15:23:03 UTC

Sam,

Is 0.9.1.2 ok? That seems to be the latest version.

Also, how difficult is it to reproduce this issue?

Comment 2 Samuel Munilla 2012-05-30 15:36:02 UTC

0.9.1.2 works. 

The issue crops up daily in QA and Stage. I don't think we can reproduce it on demand.

Comment 3 Samuel Munilla 2012-05-30 15:38:14 UTC

(In reply to comment #1)
> Sam,
> 
> Is 0.9.1.2 ok? That seems to be the latest version.
> 
> Also, how difficult is it to reproduce this issue?

Comment 4 Alex Wood 2012-05-30 19:47:34 UTC

Fixed in branch 0.5 with 19dac38c215768ba3f7e244f5d7f8f62da396f6e

Released with candlepin-0.5.33-1

Note You need to log in before you can comment on or make changes to this bug.