Bug 971324

Summary: PRD33 - notification service - should send an email if db is down
Product: Red Hat Enterprise Virtualization Manager Reporter: Itamar Heim <iheim>
Component: ovirt-engine-notification-serviceAssignee: Mooli Tayer <mtayer>
Status: CLOSED CURRENTRELEASE QA Contact: Ilanit Stein <istein>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 3.3.0CC: acathrow, bazulay, bdagan, iheim, istein, jkt, mtayer, Rhev-m-bugs, talayan, yzaslavs, zdover
Target Milestone: ---Keywords: Improvement
Target Release: 3.3.0Flags: mtayer: needinfo-
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: infra
Fixed In Version: is3 Doc Type: Enhancement
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Infra RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Itamar Heim 2013-06-06 09:29:28 UTC
today if the db is down (local or remote), the notification service can't send an alert.
a general alert, similar to "engine is down" scenario should be sent.
difference is we don't have the db, so need in the config a new parameter for list of email addresses to send notification on "db is down".
probably after X retries, every Y minutes.

Comment 1 Ilanit Stein 2013-07-17 07:37:49 UTC
Verified on is5,

As long as DB is down, endless "db down" notification are sent.
Is this intentionally, or doed it need to be fixed to be only a single notification?

Comment 2 Mooli Tayer 2013-07-17 09:03:04 UTC
It is intentional in the default configuration.

along with FAILED_QUERIES_NOTIFICATION_RECIPIENTS which defines where emails should be sent, another configuration parameter was defined:
FAILED_QUERIES_NOTIFICATION_THRESHOLD. from the conf file documentation: 

# Send a notification email after first failure to fetch notifications,
# and then once every failedQueriesNotificationThreshold times.
# 0 or 1 means notify on each failure.

The default value is 30. together with the default value for INTERVAL_IN_SECONDS (120) we will get one message every hour in the default configuration(!).

Ilanit, I see we have the docs_scoped flag as '?' does it mean the appropriate person will check if this feature (specifically the parameters) should be documented?

Comment 3 Itamar Heim 2014-01-21 22:32:38 UTC
Closing - RHEV 3.3 Released

Comment 4 Itamar Heim 2014-01-21 22:32:41 UTC
Closing - RHEV 3.3 Released