Bug 504158
| Summary: | fenced doesn't work with IP addresses for node names | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 5 | Reporter: | David Teigland <teigland> | ||||
| Component: | cman | Assignee: | David Teigland <teigland> | ||||
| Status: | CLOSED ERRATA | QA Contact: | Cluster QE <mspqa-list> | ||||
| Severity: | medium | Docs Contact: | |||||
| Priority: | low | ||||||
| Version: | 5.3 | CC: | ccaulfie, cluster-maint, cward, edamato, jkortus, lhh, m.nietz-redhat, tao | ||||
| Target Milestone: | rc | ||||||
| Target Release: | --- | ||||||
| Hardware: | All | ||||||
| OS: | Linux | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | cman-2_0_115-8_el5 | Doc Type: | Bug Fix | ||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2010-03-30 08:42:09 UTC | Type: | --- | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
David Teigland
2009-06-04 14:19:38 UTC
Left out the end of the story,
> Hm, the only other way for "averting" to happen is if it thinks the failed
> node has become a cluster member again. I wonder if those member checks are
> failing because you're using IP addresses for names (I thought IP's worked).
> Could you configure names in place of those IP's and try it?
I reinstalled the whole cluster, and uses names instead of ip's as
adviced by you.
The first Tests where all succesful !!!
I've verified the fenced bug on my own cluster. The bug may not be obvious to people using IP addresses, because fenced just silently skips fencing a failed node. It sounds like rgmanager is effected, though, probably because cman does not report the node as having been fenced. When I kill the third node I see,
# group_tool -v
type level name id state node id local_done
fence 0 default 00000000 none
[1 2]
# cman_tool nodes -f
Node Sts Inc Joined Name
1 M 472 2009-06-04 09:56:32 10.15.84.91
2 M 476 2009-06-04 09:56:47 10.15.84.92
3 X 480 10.15.84.93
Node has not been fenced since it went down
Created attachment 346556 [details]
Patch to fix
The attached patch causes the node check routine to exit if the node is an IP address and there hasn't been an exact string match.
This should stop (eg)
10.2.1.1 matching with 10.2.1.2
Committed to RHEL5 branch: Commit: f27717f5ec074b5567734d09ac04746c21fcff01 fence: Allow IP addresses as node names Also on STABLE2 & STABLE3 This is now in RHEL5.5 distcvs ~~ Attention Customers and Partners - RHEL 5.5 Beta is now available on RHN ~~ RHEL 5.5 Beta has been released! There should be a fix present in this release that addresses your request. Please test and report back results here, by March 3rd 2010 (2010-03-03) or sooner. Upon successful verification of this request, post your results and update the Verified field in Bugzilla with the appropriate value. If you encounter any issues while testing, please describe them and set this bug into NEED_INFO. If you encounter new defects or have additional patch(es) to request for inclusion, please clone this bug per each request and escalate through your support representative. Fencing + automatic service relocation are successful with node names as IP addresses and cman-2.0.115-33.el5. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2010-0266.html The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days |