Description of problem: ----------------------- When glusterd was stopped on a couple of nodes in the cluster, the cluster auto-config service was seen to be in warning status with "(null)" as the status information. Version-Release number of selected component (if applicable): gluster-nagios-addons-0.1.2-1.el6rhs.x86_64 How reproducible: Always Steps to Reproduce: 1. Setup a cluster of RHS nodes (I had 7 nodes in the cluster) 2. Monitor the cluster using Nagios. 3. Stop glusterd on a couple of nodes. Observer the cluster auto-config service. Actual results: The status of the service is warning and the status information reads "(null)". Expected results: glusterd being down on the nodes should not affect the auto-config service. Additional info:
Similar behavior was seen when some nodes in the cluster were powered off. See BZ #1109025.
Fixed in Patch : http://review.gluster.org/#/c/8074/
Downstream patch https://code.engineering.redhat.com/gerrit/#/c/27038/
Review and signoff the edited doc text.
Doc text looks good to me.
Verified as fixed in nagios-server-addons-0.1.8-1.el6rhs.noarch When glusterd is down on some of the nodes in the cluster, the cluster auto-config service remains OK and does run successfully to sync the cluster configurations. If glusterd is down on the node that is used to sync the cluster configurations via the discovery script, then trying to run the auto-config service will cause it to be in CRITICAL state with the following in the status information - Failed to execute NRPE command 'discover_volume_list' in host <hostname> This is expected as the discovery script fails to run the required commands owing to glusterd being down.
Hi Ramesh, Can you please review the edited doc text for technical accuracy and sign off?
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2015-0039.html