Bugzilla will be upgraded to version 5.0. The upgrade date is tentatively scheduled for 2 December 2018, pending final testing and feedback.
Bug 1177129 - [New] - glusterd service in nagios is not marked critical when glusterd is hung on the node
[New] - glusterd service in nagios is not marked critical when glusterd is h...
Status: CLOSED ERRATA
Product: Red Hat Gluster Storage
Classification: Red Hat
Component: nagios-server-addons (Show other bugs)
unspecified
Unspecified Unspecified
medium Severity high
: ---
: RHGS 3.1.0
Assigned To: Sahina Bose
RamaKasturi
:
Depends On:
Blocks: 1202842
  Show dependency treegraph
 
Reported: 2014-12-24 05:22 EST by RamaKasturi
Modified: 2015-07-29 01:27 EDT (History)
4 users (show)

See Also:
Fixed In Version: nagios-server-addons-0.2.1-3.el7rhgs, nagios-server-addons-0.2.1-2.el6rhs
Doc Type: Bug Fix
Doc Text:
Previously, the Nagios plugin monitored if glusterd process is present. As a consequence, the Plugin returned OK status even if the glusterd process is dead but the pid file existed. With this fix, the plugin is updated to monitor glusterd service state and the glusterd service status is now reflected correctly.
Story Points: ---
Clone Of:
Environment:
Last Closed: 2015-07-29 01:27:18 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)
Screenshot when glusterd is stopped (191.48 KB, image/png)
2015-06-01 06:28 EDT, RamaKasturi
no flags Details
Screenshot when glusterd dead but pid file exists (191.55 KB, image/png)
2015-06-01 06:29 EDT, RamaKasturi
no flags Details


External Trackers
Tracker ID Priority Status Summary Last Updated
Red Hat Product Errata RHEA-2015:1494 normal SHIPPED_LIVE Red Hat Gluster Storage Console 3.1 Enhancement and bug fixes 2015-07-29 05:24:02 EDT

  None (edit)
Description RamaKasturi 2014-12-24 05:22:34 EST
Description of problem:
glusterd service in nagios is not marked critical when glusterd is hung i.e when "service glusterd status" gives the output as glusterd dead but pid file exists.

Version-Release number of selected component (if applicable):
nagios-server-addons-0.1.11-1.el6rhs.noarch

How reproducible:
Always

Steps to Reproduce:
1. perform any operation where glusterd gets hung/ service glusterd status says "glusterd dead but pid file exists."
2.
3.

Actual results:
glusterd is not marked as critical, status and status information for glusterd shows as "ok" with "process glusterd is running"

Expected results:
glusterd service should be marked critical with status information as "glusterd not running".

Additional info:
Comment 2 RamaKasturi 2014-12-25 00:48:56 EST
Attaching the sos reports link

http://rhsqe-repo.lab.eng.blr.redhat.com/sosreports/rhsc/1177129/
Comment 4 Timothy Asir 2015-04-15 03:14:42 EDT
Patch sent to master: http://review.gluster.org/10246
Comment 6 RamaKasturi 2015-05-28 08:52:28 EDT
Can you please put FIV for this bug?
Comment 7 RamaKasturi 2015-06-01 06:12:40 EDT
Moving this bug back because when glusterd is stopped on the node or when glusterd is dead put pid file exists, then the status of glusterd is moving to UNKNOWN, but glusterd should be marked critical.
Comment 8 RamaKasturi 2015-06-01 06:28:01 EDT
Created attachment 1033250 [details]
Screenshot when glusterd is stopped
Comment 9 RamaKasturi 2015-06-01 06:29:17 EDT
Created attachment 1033251 [details]
Screenshot when glusterd dead but pid file exists
Comment 10 RamaKasturi 2015-06-01 06:31:58 EDT
Ignore comment 7.

Moving this bug back because when glusterd is stopped on the node service status moves to UNKNOWN with status information "glusterd is stopped".

When glusterd is killed on the node using kill -9 <glusterpid> service status is shown as WARNING with status information as "gluster dead but pid file exists".

In both the above cases glusterd should be marked critical.
Comment 11 Sahina Bose 2015-06-10 09:38:47 EDT
Posted patch http://review.gluster.org/11161 to correct this
Comment 12 RamaKasturi 2015-06-18 02:54:11 EDT
Verified and works fine with build nagios-server-addons-0.2.1-2.el6rhs.noarch.

When glusterd is stopped on the node, status of gluster Management is marked as critical with status information "Process glusterd is not running".

When glusterd is killed on the node using kill -9 <glusterpid> service status is shown as WARNING with status information as "gluster dead but pid file exists".
Comment 13 RamaKasturi 2015-06-18 03:21:11 EDT
Ignore comment 12.

Verified and works fine with build nagios-server-addons-0.2.1-2.el6rhs.noarch.

When glusterd is stopped on the node, status of gluster Management is marked as CRITICAL with status information "Process glusterd is not running".

When glusterd is killed on the node using kill -9 <glusterpid> service status is shown as CRITICAL with status information as "gluster dead but pid file exists".
Comment 14 Divya 2015-07-26 01:40:12 EDT
Sahina,

Please review and sign-off the edited doc text.
Comment 15 Sahina Bose 2015-07-27 01:02:51 EDT
acked
Comment 17 errata-xmlrpc 2015-07-29 01:27:18 EDT
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHEA-2015-1494.html

Note You need to log in before you can comment on or make changes to this bug.