Bug 1107998 - [Nagios] Auto-config removes all the configuration if the host used for discovery is detached from the cluster
Summary: [Nagios] Auto-config removes all the configuration if the host used for disco...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: nagios-server-addons
Version: rhgs-3.0
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: RHGS 3.0.0
Assignee: Ramesh N
QA Contact: Shruti Sampat
URL:
Whiteboard:
Depends On:
Blocks: 1139228
TreeView+ depends on / blocked
 
Reported: 2014-06-11 09:23 UTC by Kanagaraj
Modified: 2016-04-18 10:06 UTC (History)
6 users (show)

Fixed In Version: nagios-server-addons-0.1.3-2.el6rhs.x86_64
Doc Type: Bug Fix
Doc Text:
Clone Of:
: 1139228 (view as bug list)
Environment:
Last Closed: 2014-09-22 19:11:13 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHEA-2014:1277 0 normal SHIPPED_LIVE Red Hat Storage Console 3.0 enhancement and bug fix update 2014-09-22 23:06:30 UTC

Description Kanagaraj 2014-06-11 09:23:25 UTC
Description of problem:
Auto configuration removes all the configurations(hosts,volumes, bricks) from Nagios if the host used for discovery is no longer part of the cluster

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1. Create a cluster with 3 nodes (HostA, HostB, HostC).
2. Create some volumes and start them
3. Run discovery script by providing name of the cluster and ip of HostA
4. Make sure all the volumes/hosts show up in nagios UI
5. Detach HostA from the cluster using "gluster peer detach" command
6. Re-schedule the auto-config in nagios ui

Actual results:
Except HostA all other hosts/volumes removed from the nagios configuration.

Expected results:
hosts/volumes should not removed from the nagios configuration. User should run the discovery by providing the ip of HostB.

Additional info:

Comment 1 Ramesh N 2014-06-11 10:39:22 UTC
Patch sent upstrean : http://review.gluster.org/#/c/8024/

Comment 2 Shruti Sampat 2014-06-17 11:50:44 UTC
Verified as fixed in nagios-server-addons-0.1.4-1.el6rhs.x86_64

Performed the following steps -

1. Created a cluster of four nodes, host1, host2, host3 and host4.
2. Created a couple of volumes, and started them.
3. Configured this cluster to be monitored via nagios server, which is also one of the RHS nodes.
4. Removed host4 from the cluster using gluster peer detach command.
5. Attempted to run cluster auto-config service using the Nagios UI. Saw the status of the service change to critical with the following status information -

Can't remove all hosts except sync host in 'auto' mode. Run auto discovery manually

6. Ran auto-discovery at the nagios server using command line, manually -

# /usr/lib64/nagios/plugins/gluster/discovery.py -c cluster_auto -H host1

Cluster configurations changed

Changes :
Hostgroup cluster_auto - UPDATE
Host cluster_auto - UPDATE
         Service - Cluster Auto Config -UPDATE 
Host host4 - REMOVE
Are you sure, you want to commit the changes? (Yes, No) [Yes]: 
Cluster configurations synced successfully from host host1
Do you want to restart Nagios to start monitoring newly discovered entities? (Yes, No) [Yes]: 
Nagios re-started successfully

In the Nagios UI, the host host4 was removed, and the status of the cluster auto-config service changed to OK.

Comment 3 errata-xmlrpc 2014-09-22 19:11:13 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHEA-2014-1277.html


Note You need to log in before you can comment on or make changes to this bug.