Bug 1258718
Summary: | [New] - Passive checks for services does not work as expected due to version mismatch of nsca server and client | ||||||
---|---|---|---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | RamaKasturi <knarra> | ||||
Component: | nagios-server-addons | Assignee: | Ramesh N <rnachimu> | ||||
Status: | CLOSED ERRATA | QA Contact: | Triveni Rao <trao> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | high | ||||||
Version: | rhgs-3.1 | CC: | asrivast, divya, rnachimu, sabose, sashinde, trao | ||||
Target Milestone: | --- | Keywords: | ZStream | ||||
Target Release: | RHGS 3.1.2 | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | nsca-2.9.1-4.1.el6rhs | Doc Type: | Bug Fix | ||||
Doc Text: |
Previously, passive checks for services did not work as expected due to version mismatch of nsca server and client. As a consequence, nsca-client running on the Red Hat Enterprise Linux 7 Red Hat Gluster Storage nodes were unable to send the passive service status to the Nagios server running on Red Hat Enterprise Linux 6 Nodes. With this fix, the nsca package being shipped in Red Hat Enterprise Linux 6 Red Hat Gluster Storage is upgraded to nsca-2.9.1 version. Now, passive checks sent to the Nagios server (Red Hat Enterprise Linux 6) from Red Hat Enterprise Linux 7 Red Hat Gluster Storage nodes and Nagios monitoring works as expected.
|
Story Points: | --- | ||||
Clone Of: | Environment: | ||||||
Last Closed: | 2016-03-01 06:11:09 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | 1277082 | ||||||
Bug Blocks: | 1260783 | ||||||
Attachments: |
|
Description
RamaKasturi
2015-09-01 05:58:00 UTC
Gluster Management, Gluster NFS, Quota, SMB, Brick Status, Self-Heal, Quorum does not work due to mismatch of nsca server and client versions. The RHEL 7 rhgs nodes have a higher version of nsca-client (nsca-client-2.9.1-11.2.el7rhgs.x86_64.rpm) . The nagios server running on RHEL 6 (console) has nsca version nsca-2.7.2-8.2.el6rhs.x86_64.rpm. As per nagios mailing lists, the 2.9 version of nsca is not compatible with older servers due to change in packet size introduced in 2.9 (http://permalink.gmane.org/gmane.network.nagios.user/73111) However since the newer server (2.9) is compatible with older clients, one way to solve this is to increase the nsca version in RHEL 6 that we ship in RHSC. Ramesh, what do you think? Latest NSCA package nsca-2.9.1-4 will fix this issue. This bug is verified and found no issues: Steps Followed: 1. Installed latest RHSC+Nagios 312 2. Add RHGS3.1.2 nodes based out of RHEL7.2 3. Run configure-gluser-nagios -c <clustername> -H <hostname> 4. Poweroff one of the nodes in cluster. 5. Cluster quorum showed proper messages. On Gluster node version of nsca: [root@dhcp37-127 ~]# rpm -qa | grep nsca nsca-client-2.9.1-11.2.el7rhgs.x86_64 [root@dhcp37-127 ~]# On RHSC+Nagios node nsca version: [root@dhcp37-142 yum.repos.d]# rpm -qa | grep nsca nsca-2.9.1-4.1.el6rhs.x86_64 [root@dhcp37-142 yum.repos.d Attached the screenshot taken during power off of one of the nodes. Created attachment 1097100 [details]
screen shot 1
Ramesh, Please review and sign-off the edited doc text. Doc text looks good technically correct but it looks really complex to understand because of more "Red Hat Enterprise Linux 6 Red Hat Gluster Storage" kind of sentence. Can we simplify? Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2016-0310.html The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days |