Description of problem: Cluster-Quorum status remains as "ok" with status information "Server quorum turned on for vol4,vol1 " when one of the node in the cluster is powered off. Version-Release number of selected component (if applicable): nagios-server-addons-0.2.1-2.el6rhs.noarch How reproducible: Always Steps to Reproduce: 1. Add two nodes in the cluster. 2. set quorum on any one of the volume by running the command "gluster volume set <vol-name> cluster.server-quorum-type server 3. Now run cluster auto-config. 4. Now power off one of the node. Actual results: status remains "OK" with status information "Server quorum turned on for <vol_names>" Expected results: Cluster-Quorum status should change the status to CRITICAL with status information " QUORUM: Cluster server-side quorum lost." Additional info:
Seeing the following in nagios.log. [1435403539] EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;cluster1;Cluster - Quorum;2;QUORUM: Cluster server-side quorum lost. [1435403539] Warning: Passive check result was received for service 'Cluster - Quorum' on host 'cluster1', but the service could not be found!
Issue due to change in service name in Nagios, and ncsa was sending alert to older service. The service name was changed to ensure that the command definition was modified on update, as new freshness check was introduced. http://review.gluster.org/#/c/11465 - posted to fix this
Doc text is edited. Please sign off to be included in Known Issues.
minor edit.
Hi Sahina, After enabling server quorum on a volume, i powered off one node in the cluster and now my quorum status goes to UNKNOWN with status information "Server quorum not turned on for any volume". Can you please check this? Thanks kasturi
Verified on RHS+Nagios deployment and works fine with build gluster-nagios-addons-0.2.5-1.el7rhgs.x86_64. When one of the nodes in the cluster is powered off, Cluster - Quorum Status is marked as CRITICAL with status information "QUORUM: Cluster server-side quorum lost".
Hi Sahina, The doc text is updated. Please review it and share your technical review comments. If it looks ok, then sign-off on the same.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2015-1848.html