Description of problem: glusterd service in nagios is not marked critical when glusterd is hung i.e when "service glusterd status" gives the output as glusterd dead but pid file exists. Version-Release number of selected component (if applicable): nagios-server-addons-0.1.11-1.el6rhs.noarch How reproducible: Always Steps to Reproduce: 1. perform any operation where glusterd gets hung/ service glusterd status says "glusterd dead but pid file exists." 2. 3. Actual results: glusterd is not marked as critical, status and status information for glusterd shows as "ok" with "process glusterd is running" Expected results: glusterd service should be marked critical with status information as "glusterd not running". Additional info:
Attaching the sos reports link http://rhsqe-repo.lab.eng.blr.redhat.com/sosreports/rhsc/1177129/
Patch sent to master: http://review.gluster.org/10246
Can you please put FIV for this bug?
Moving this bug back because when glusterd is stopped on the node or when glusterd is dead put pid file exists, then the status of glusterd is moving to UNKNOWN, but glusterd should be marked critical.
Created attachment 1033250 [details] Screenshot when glusterd is stopped
Created attachment 1033251 [details] Screenshot when glusterd dead but pid file exists
Ignore comment 7. Moving this bug back because when glusterd is stopped on the node service status moves to UNKNOWN with status information "glusterd is stopped". When glusterd is killed on the node using kill -9 <glusterpid> service status is shown as WARNING with status information as "gluster dead but pid file exists". In both the above cases glusterd should be marked critical.
Posted patch http://review.gluster.org/11161 to correct this
Verified and works fine with build nagios-server-addons-0.2.1-2.el6rhs.noarch. When glusterd is stopped on the node, status of gluster Management is marked as critical with status information "Process glusterd is not running". When glusterd is killed on the node using kill -9 <glusterpid> service status is shown as WARNING with status information as "gluster dead but pid file exists".
Ignore comment 12. Verified and works fine with build nagios-server-addons-0.2.1-2.el6rhs.noarch. When glusterd is stopped on the node, status of gluster Management is marked as CRITICAL with status information "Process glusterd is not running". When glusterd is killed on the node using kill -9 <glusterpid> service status is shown as CRITICAL with status information as "gluster dead but pid file exists".
Sahina, Please review and sign-off the edited doc text.
acked
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHEA-2015-1494.html