Bug 1162553

Summary: [RFE]RHSC / Nagios alerts e-mails related to Cluster should have Cluster label instead of Host
Product: [Red Hat Storage] Red Hat Gluster Storage Reporter: Dusmant <dpati>
Component: nagios-server-addonsAssignee: Sahina Bose <sabose>
Status: CLOSED CANTFIX QA Contact: RHS-C QE <rhsc-qe-bugs>
Severity: high Docs Contact:
Priority: urgent    
Version: rhgs-3.0CC: rhsc-qe-bugs, sabose, sankarshan, ssampat
Target Milestone: ---Keywords: FutureFeature, ZStream
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Enhancement
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-01-30 07:55:23 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Dusmant 2014-11-11 10:04:48 UTC
Description of problem: Currently, when customer gets e-mails related to cluster level alerts (primarily all the services running under cluster in nagios space ) the message reads like this, which is confusing the customers.


***************current e-mails********************
e.g.1

***** Nagios *****

Notification Type: PROBLEM

Service: Cluster Utilization
Host: Eskan
Address: Eskan
State: CRITICAL

Date/Time: Tue Sept 30 21:27:04 AST 2014

Additional Info:

(null)
 
-----------------------------------------------------
e.g.2

***** Nagios *****

Notification Type: PROBLEM

Service: Volume Self-Heal - data
Host: Eskan
Address: Eskan
State: WARNING

Date/Time: Tue Sept 30 21:29:34 AST 2014

Additional Info:

(null)

**********************************************

How reproducible: Everytime


Steps to Reproduce:
1. This can happen for multiple services running under cluster (volume status, volume-unitilization, cluster-unitilization, geo-rep, self-heal, etc... ) . Lets take the case of volume status. If you stop one volume under the cluster, it will change the status to CRITICAL.
2. If e-mail notification is configured, it will send the alert to the mentioned e-mail
3. And the content of the e-mail is similar to shown above

Actual results: 
***** Nagios *****

Notification Type: PROBLEM

Service: Volume Self-Heal - data
Host: Eskan
Address: Eskan
State: WARNING

Date/Time: Tue Sept 30 21:29:34 AST 2014

Additional Info:

(null)
*******************


Expected results:
***** Nagios *****

Notification Type: PROBLEM

Service: Volume Self-Heal - data
Cluster Name: Eskan
State: WARNING

Date/Time: Tue Sept 30 21:29:34 AST 2014

Additional Info: 
(null)
********************


Additional info: In the above example, the (null) shown for additional info, should not be null. That is being tracked in a separate bug.

The fix is name the Replace the "Host:" to "Cluster Name:" and remove the "Address:<address>" completely from the e-mail

Comment 2 Sahina Bose 2015-02-09 07:15:41 UTC
As per triage meeting, not a must fix. This bz is related to feature regarding notification suppression.

Comment 3 Sahina Bose 2015-03-26 05:45:09 UTC
Moving this to 3.1.1 as custom templates require some amount of work. Please retarget if necessary

Comment 4 Sahina Bose 2015-06-02 07:23:13 UTC
Setting appropriate flags based on comment 3

Comment 7 Sahina Bose 2018-01-30 07:55:23 UTC
Thank you for the bug report. However, closing this as the bug is filed against gluster nagios monitoring for which no further new development is being undertaken.