Bug 772916

Summary: RFE - ovirt-etl should have a WatchDog
Product: [Retired] oVirt Reporter: Yaniv Lavi <ylavi>
Component: ovirt-engine-dwhAssignee: Yaniv Lavi <ylavi>
Status: CLOSED CURRENTRELEASE QA Contact:
Severity: high Docs Contact:
Priority: high    
Version: unspecifiedCC: acathrow, iheim, ykaul
Target Milestone: ---   
Target Release: 3.1   
Hardware: x86_64   
OS: Linux   
Whiteboard: infra integration
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-08-09 08:02:08 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Yaniv Lavi 2012-01-10 10:35:18 UTC
Description of problem:
ovirt-etl currently doesn't stop on the case of connection refuse or any other error, it just logs the error and tries again. In order to prevent ovirt-etl from stopping and starting only by a user intervention, we need to implement a WatchDog.

How reproducible:
always

Steps to Reproduce:
1. Casue a error to ovirt-etl (shutdown potgres for instance).
2. Check the ovirt-etl status & tail /var/log/ovirt/ovirt-dwh/ovirt-etl.log 

Actual results:
ovirt-etl is not stopped, only error is logged.

Expected results:
ovirt-etl should die on error and start automatically when the error state is resolved (tested with the watchdog). Watchdog should log it as started ovirt-etl from error state.

Comment 2 Itamar Heim 2012-08-09 08:02:08 UTC
closing ON_QA bugs as oVirt 3.1 was released:
http://www.ovirt.org/get-ovirt/

Comment 3 Itamar Heim 2012-08-09 08:03:29 UTC
closing ON_QA bugs as oVirt 3.1 was released:
http://www.ovirt.org/get-ovirt/