Description of problem: ------------------------- Consider a cluster having two volume, say vol1 and vol2, both with server-side quorum enabled. When quorum is lost for vol1, the status of the cluster-quorum service changes to critical, and the status information reads - QUORUM: Server quorum lost for volume vol1. Stopping local bricks. A notification is sent via e-mail and SNMP traps. If quorum is lost for vol2, later, (before it was regained for vol1), the status of the service would remain critical. The status information would read - QUORUM: Server quorum lost for volume vol2. Stopping local bricks. Since the status of the quorum service did not change, notifications will not be sent. Version-Release number of selected component (if applicable): gluster-nagios-addons-0.1.2-1.el6rhs.x86_64 How reproducible: Always Steps to Reproduce: 1. Create two distributed-replicate volumes, vol1 and vol2. 2. Cause quorum to be lost for vol1, observe that the status of the service changes to critical and that notifications are sent. 3. Cause quorum to be lost for vol2. Actual results: The status of the service remains critical, hence status change is not involved and notifications are not sent. Expected results: Notifications should be sent whenever quorum is lost/regained for any volume in the cluster. Additional info:
Please add doc text for the known issue
Please review and signoff edited doc text.
Looks good to me
As per redesign, notification is sent only once as Quorum is cluster level service
Verified and works fine with build gluster-nagios-addons-0.2.0-1. As per comment 7, notification is sent only once as Quorum is cluster level service. Below is the way notification comes. ** PROBLEM Service Alert: cluster1/Cluster - Quorum is CRITICAL ** ***** Nagios ***** Notification Type: PROBLEM Service: Cluster - Quorum Host: cluster1 Address: cluster1 State: CRITICAL Date/Time: Mon Jun 1 16:46:20 IST 2015 Additional Info: QUORUM: Cluster server-side quorum lost. When it is regained, email notification is sent as : ** RECOVERY Service Alert: cluster1/Cluster - Quorum is OK ** ***** Nagios ***** Notification Type: RECOVERY Service: Cluster - Quorum Host: cluster1 Address: cluster1 State: OK Date/Time: Mon Jun 1 16:56:00 IST 2015 Additional Info: QUORUM: Cluster server-side quorum regained.
Sahina, Could you review the edited doc text and sign-off.
Acked
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHEA-2015-1494.html