Bug 183383
Summary: | mount deadlock after recovery during regression tests (2) | ||
---|---|---|---|
Product: | [Retired] Red Hat Cluster Suite | Reporter: | Chris Feist <cfeist> |
Component: | gulm | Assignee: | Chris Feist <cfeist> |
Status: | CLOSED ERRATA | QA Contact: | Cluster QE <mspqa-list> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 4 | CC: | cluster-maint, cmarthal, nstraz |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | RHBA-2007-0145 | Doc Type: | Bug Fix |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2007-05-10 21:27:52 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Chris Feist
2006-02-28 19:18:53 UTC
Taking off blocker list, some of the issues have been fixed, but there still might be problems outstanding. I hit this today during RHEL4-U3 errata testing. I was running gulm-1.0.6-0. 2 of 3 server nodes were shot. It doesn't appear that the server that rejoined to form quorum expired the locks it had prior to being shot. I'm still hitting this in RHEL4-U4 testing. Problem occurs if you kill enough masters for the remaining gulm server to lose quorum. It then may not fence all of the killed gulm servers resulting in an inconsistent lock state. The problem can be easily fixed by fencing the lock servers that were killed but not fence previously. I'm working on a solution. I'm still hitting this in RHEL4-U4 testing. x86 cluster. Hit this over the weekend on x86_64 during the "GULM kill Master and all but one Slave" revolver senario. Devel ACK. Ok, so it appears that gulm was not properly propagating all of the slaves/clients to the slaves. This should fix one type of lockup, and hopefully the lockup that was occurring in this bug. The fix is built in gulm-1.0.9-2. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2007-0145.html |