Bug 1285341

Summary: After a certain amount of time the EventCatcher worker (thread) is stopped and deleted
Product: Red Hat CloudForms Management Engine Reporter: Daniel Korn <dkorn>
Component: ProvidersAssignee: Federico Simoncelli <fsimonce>
Status: CLOSED ERRATA QA Contact: Dafna Ron <dron>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 5.5.0CC: atal, cpelland, dron, fsimonce, gblomqui, jfrey, jhardy, jvlcek, obarenbo, simaishi
Target Milestone: GAKeywords: Reopened
Target Release: 5.5.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: 5.5.0.13 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2015-12-08 13:50:05 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Daniel Korn 2015-11-25 12:27:52 UTC
Description of problem:
After a certain amount of time the EventCatcher thread is being stopped and deleted.
from evm.log it is clear that the Event Monitor is gone and it attempts to restart every second ("Event Monitor Thread gone. Restarting...").

Version-Release number of selected component (if applicable):
cfme-5.5.0

How reproducible:
100%

Steps to Reproduce:
1. add a Openshift provider and refresh it
2. wait for the EventCatcher worker to start running and verify that it's status is started ($ bundle exec rake evm:status) and that it starts to collect events.
3. wait several hours (at least for me) and check that the worker is dead (easily detected in the evm.log and doesn't appear in the workers table ($ bundle exec rake evm:status)

Actual results:
events collection is stopped

Expected results:
events should be collected at all time and worker status should be started

Additional info:

Comment 1 Federico Simoncelli 2015-11-27 14:42:44 UTC
Proposed fix:

https://github.com/ManageIQ/manageiq/pull/5626

Comment 2 Federico Simoncelli 2015-11-30 11:02:04 UTC
*** Bug 1286618 has been marked as a duplicate of this bug. ***

Comment 3 Joe Vlcek 2015-11-30 15:33:18 UTC

*** This bug has been marked as a duplicate of bug 1281746 ***

Comment 4 Federico Simoncelli 2015-11-30 15:39:18 UTC
This is not a duplicate of #1281746 it happens on 5.5.0.11 as well.

Comment 6 Dafna Ron 2015-12-03 16:15:37 UTC
verified on cfme-5.5.0.13-1.el7cf.x86_64

Comment 8 errata-xmlrpc 2015-12-08 13:50:05 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2015:2551