Bug 472782
Summary: | Master in qdisk does not win and both nodes are fenced off in race condition | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | [Retired] Red Hat Cluster Suite | Reporter: | Shane Bradley <sbradley> | ||||||
Component: | cman | Assignee: | Lon Hohberger <lhh> | ||||||
Status: | CLOSED WONTFIX | QA Contact: | Cluster QE <mspqa-list> | ||||||
Severity: | medium | Docs Contact: | |||||||
Priority: | medium | ||||||||
Version: | 4 | CC: | cluster-maint, dash, iannis, jko, rbinkhor, tao | ||||||
Target Milestone: | --- | ||||||||
Target Release: | --- | ||||||||
Hardware: | All | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2010-05-11 17:05:21 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Shane Bradley
2008-11-24 16:06:38 UTC
Created attachment 324493 [details]
sosreport for node1
Created attachment 324494 [details]
sosreport for node2
First of all this is a feature request. While I believe this is a reasonable course of action, there is no current master-wins behavior in the feature set of qdiskd if no heuristics are present. The only way to do this cleanly is to interrupt the fencing operation in the non-master node. Since CMAN decides on a new membership view prior to fencing operation taking place, the only method to ensure this works is to notify qdiskd that CMAN has decided to fence and to have qdiskd do something based on: - whether or not a master exists - whether or not the other node exists, and - if a master exists, which node is master Some possible solutions as well as a workaround are here: https://bugzilla.redhat.com/show_bug.cgi?id=372901#c7 Since administrators cannot control which node is the qdiskd master (nor will this be an option), a workaround causing a node to hang will provide predictable behavior in a network partition - moreso than implementation of master-wins. https://bugzilla.redhat.com/show_bug.cgi?id=372901#c9 ^^ simple design |