Bug 1616208 - glustershd alerts should mention affected node
Summary: glustershd alerts should mention affected node
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: web-admin-tendrl-notifier
Version: rhgs-3.4
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: RHGS 3.4.0
Assignee: gowtham
QA Contact: Filip Balák
URL:
Whiteboard:
Depends On:
Blocks: 1503137
TreeView+ depends on / blocked
 
Reported: 2018-08-15 09:23 UTC by Filip Balák
Modified: 2018-09-04 07:09 UTC (History)
5 users (show)

Fixed In Version: tendrl-gluster-integration-1.6.3-10.el7rhgs
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2018-09-04 07:08:56 UTC
Embargoed:


Attachments (Terms of Use)
events page (122.99 KB, image/png)
2018-08-15 09:23 UTC, Filip Balák
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Github Tendrl gluster-integration issues 694 0 None None None 2018-08-16 14:50:25 UTC
Red Hat Bugzilla 1611601 0 unspecified CLOSED Alert Service: glustershd is disconnected in cluster is not cleared 2021-02-22 00:41:40 UTC
Red Hat Bugzilla 1616215 0 unspecified CLOSED All alerts Service: glustershd is disconnected in cluster are cleared when service starts on one node 2021-02-22 00:41:40 UTC
Red Hat Product Errata RHSA-2018:2616 0 None None None 2018-09-04 07:09:57 UTC

Internal Links: 1611601 1616215

Description Filip Balák 2018-08-15 09:23:43 UTC
Created attachment 1476098 [details]
events page

Description of problem:
Currently there are alerts and event messages:
```
Service: glustershd is connected in cluster <cluster>
Service: glustershd is disconnected in cluster <cluster>
```
It is not clear from these messages what nodes are affected.

Version-Release number of selected component (if applicable):
tendrl-ansible-1.6.3-6.el7rhgs.noarch
tendrl-api-1.6.3-5.el7rhgs.noarch
tendrl-api-httpd-1.6.3-5.el7rhgs.noarch
tendrl-commons-1.6.3-12.el7rhgs.noarch
tendrl-grafana-plugins-1.6.3-10.el7rhgs.noarch
tendrl-grafana-selinux-1.5.4-2.el7rhgs.noarch
tendrl-monitoring-integration-1.6.3-10.el7rhgs.noarch
tendrl-node-agent-1.6.3-10.el7rhgs.noarch
tendrl-notifier-1.6.3-4.el7rhgs.noarch
tendrl-selinux-1.5.4-2.el7rhgs.noarch
tendrl-ui-1.6.3-10.el7rhgs.noarch

How reproducible:
100%

Steps to Reproduce:
1. Import cluster with distributed replicated volume.
2. Connect to one of the volume nodes and get pid of glustershd process:
$ cat /var/run/gluster/glustershd/glustershd.pid
<glustershd-pid>
3. kill <glustershd-pid>
4. Wait for alert in UI.
5. restart glusterd service on node with killed glustershd. This should start glustershd.

Actual results:
Events and alerts
Service: glustershd is disconnected in cluster <cluster>
Service: glustershd is connected in cluster <cluster>
are generated.

Expected results:
There should be mention of node in alert because user now doesn't know what nodes or how many are affected.

Additional info:

Comment 2 gowtham 2018-08-16 14:52:04 UTC
PR is under review: https://github.com/Tendrl/gluster-integration/pull/695

Comment 3 gowtham 2018-08-17 10:16:36 UTC
I have modified alert message formate for SVC related alerts like 

Service: {service_nmae} is Connected/Disconnected on node {peer_hostname/ip} of cluster {cluster_short_name}

Comment 4 Martin Bukatovic 2018-08-20 11:13:15 UTC
QE team will test this bug as noted in the description.

Comment 5 Nishanth Thomas 2018-08-20 11:15:15 UTC
https://github.com/Tendrl/gluster-integration/pull/695

Comment 6 gowtham 2018-08-21 08:35:50 UTC
fixed in tendrl-gluster-integration-1.6.3-10.el7rhgs

Comment 7 Martin Bukatovic 2018-08-21 08:39:38 UTC
All acks provided, attaching to the tracker.

Comment 9 Filip Balák 2018-08-21 12:50:32 UTC
Messages look like Gowtham described in comment 3. --> VERIFIED

Tested with:
tendrl-ansible-1.6.3-7.el7rhgs.noarch
tendrl-api-1.6.3-5.el7rhgs.noarch
tendrl-api-httpd-1.6.3-5.el7rhgs.noarch
tendrl-commons-1.6.3-12.el7rhgs.noarch
tendrl-gluster-integration-1.6.3-10.el7rhgs.noarch
tendrl-grafana-plugins-1.6.3-10.el7rhgs.noarch
tendrl-grafana-selinux-1.5.4-2.el7rhgs.noarch
tendrl-monitoring-integration-1.6.3-10.el7rhgs.noarch
tendrl-node-agent-1.6.3-10.el7rhgs.noarch
tendrl-notifier-1.6.3-4.el7rhgs.noarch
tendrl-selinux-1.5.4-2.el7rhgs.noarch
tendrl-ui-1.6.3-11.el7rhgs.noarch

Comment 11 errata-xmlrpc 2018-09-04 07:08:56 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2018:2616


Note You need to log in before you can comment on or make changes to this bug.