Bug 456403
| Summary: | cluster will recover even if a fence device failed | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 5 | Reporter: | David Juran <djuran> | ||||||
| Component: | cman | Assignee: | David Teigland <teigland> | ||||||
| Status: | CLOSED ERRATA | QA Contact: | |||||||
| Severity: | high | Docs Contact: | |||||||
| Priority: | low | ||||||||
| Version: | 5.0 | CC: | agk, ccaulfie, clasohm, cluster-maint, fdinitto, lhh, mbroz, swhiteho | ||||||
| Target Milestone: | --- | ||||||||
| Target Release: | --- | ||||||||
| Hardware: | All | ||||||||
| OS: | Linux | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2009-01-20 21:53:12 UTC | Type: | --- | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Attachments: |
|
||||||||
|
Description
David Juran
2008-07-23 12:02:05 UTC
Created attachment 312467 [details]
example cluster configuration
This sounds similar to something we fixed a long time ago, will check. update_cman() is called after the first device succeeds. Ouch. Created attachment 312490 [details]
Pass 1 fixing order bug. Untested.
This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. commit in RHEL5 branch 9567fe17bf33eb0008831551b76c7f46c55ba40b I've not tested the fix yet since I don't have either a RHEL5 or STABLE2 cluster readily available. If no one else can do a quick test to verify the patch, I'll get a cluster set up. I've tested Lon's patch from #4 on my F-9 cluster (cman-2.03.05-1) and it seems to solve the issue. Also, I can confirm that the fencing agents are executed in the order they are mentioned in cluster.conf. Which is good. Since doing power-on followed by power-off is not quite the same as off followed by on (-: An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2009-0189.html |