Bug 1559426 - Sometimes monitoring-integration is not creating panels for a particular resource in alert dashbaord
Summary: Sometimes monitoring-integration is not creating panels for a particular reso...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: web-admin-tendrl-monitoring-integration
Version: rhgs-3.4
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: RHGS 3.4.0
Assignee: gowtham
QA Contact: Filip Balák
URL:
Whiteboard:
Depends On:
Blocks: 1503137
TreeView+ depends on / blocked
 
Reported: 2018-03-22 14:21 UTC by gowtham
Modified: 2018-09-04 07:03 UTC (History)
5 users (show)

Fixed In Version: tendrl-monitoring-integration-1.6.1-3.el7rhgs
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2018-09-04 07:02:28 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github Tendrl monitoring-integration issues 381 0 None None None 2018-03-28 15:59:43 UTC
Red Hat Product Errata RHSA-2018:2616 0 None None None 2018-09-04 07:03:21 UTC

Description gowtham 2018-03-22 14:21:07 UTC
Description of problem:
If panel for a particular resource is already present in alerting dashboard then monitoring-integration won't create a duplicate panel for that resource. While checking duplication monitoring-integration is matching substring also, Because of this any resource_name which match with some other resource_name's substring then the panel is not created for that new resource.

(e.g) if panel name is v11 then if new panel name is v1 then v1 is matching with a substring of v11, So it considered as duplicate and alert panel is not created.

Version-Release number of selected component (if applicable):


How reproducible:
Create volume with name v11 first and create volume with name v1

Steps to Reproduce:
1. 
2.
3.

Actual results:
Volume_name if matches with existing volume_name substring then alert panel for a new volume is not created. 

Expected results:
Alert dashbaord should have panels for all resources

Additional info:

Comment 2 Martin Bukatovic 2018-03-28 08:25:27 UTC
Could you provide more details about:

* Is it possible to reproduce this on previously released RHGS WA?
* Version where the bug is present.
* Where should we look for the panes mentioned in BZ description in
  grafana exactly?

Comment 3 gowtham 2018-03-28 15:58:52 UTC
It can be reproducible in the previous release like 1.6.1-1, Actually, alert panels are hidden for the users, to check the alert dashbaord please sign in with credentials in grafana. Then you can see the two organization:
   1. main org
   2. Alert dashbaord org

So we are using the second organization for raising alerts when threshold breach crosses 75%.

For each resource, we are creating a separate alert panel for each resource.
So while creating an alert panel we are comparing alert panel resource name with an already present alert panel resource name, problem is if new resource name match with already existing resource names substring then alert panel for new resource is not created. So no utilization alert will be raised for that resource.

(e.g) if volume_alert_dashboard have panels for volume V11 then if new volume is created with name V1 then alert panel for V1 is not created because V1 is matching with substring V11. So no utilization alerts raised for V1.

To see alert dashbaord:
 1. Do sign-in with grafana
 2. Then you can see new see a new organization with name Alert dashboard
 3. Switch the control to Alert dashboard
 4. press Home to see list of alert dashbaords

Comment 4 gowtham 2018-03-28 16:42:51 UTC
In version 3.3.1 it can be reproducible

Comment 5 Martin Bukatovic 2018-03-29 11:55:03 UTC
I'm assuming that alert dashboard in grafana is mentioned
only as a hint for qe team and this
feature is still not supported (as this is not documented, there is no feature
BZ for it and my understanding was that this is internal implementation details
to support alerts[1])

[1] see eg. this note from Mrugesh:

> Alerts org will contain the panels created for alert callbacks and will be
> hidden from the end users.

from https://github.com/Tendrl/specifications/issues/191#issuecomment-326197800

Is this correct?

Comment 6 Nishanth Thomas 2018-03-29 11:59:32 UTC
That's correct. But if alert dashboard is not created properly, grafana will not send any alerts to tendrl for any thing related to cpu, memory etc threshold chnages and will not reflect in the alert panel

Comment 9 Filip Balák 2018-05-31 14:24:09 UTC
Seems ok. --> VERIFIED

Tested with:
tendrl-ansible-1.6.3-4.el7rhgs.noarch
tendrl-api-1.6.3-3.el7rhgs.noarch
tendrl-api-httpd-1.6.3-3.el7rhgs.noarch
tendrl-commons-1.6.3-5.el7rhgs.noarch
tendrl-grafana-plugins-1.6.3-3.el7rhgs.noarch
tendrl-grafana-selinux-1.5.4-2.el7rhgs.noarch
tendrl-monitoring-integration-1.6.3-3.el7rhgs.noarch
tendrl-node-agent-1.6.3-5.el7rhgs.noarch
tendrl-notifier-1.6.3-3.el7rhgs.noarch
tendrl-selinux-1.5.4-2.el7rhgs.noarch
tendrl-ui-1.6.3-2.el7rhgs.noarch

Comment 11 errata-xmlrpc 2018-09-04 07:02:28 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2018:2616


Note You need to log in before you can comment on or make changes to this bug.