Bug 234589
Summary: | rgmanager not working when using a quorum disk | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Robert Hell <robert.hell> | ||||||
Component: | rgmanager | Assignee: | Lon Hohberger <lhh> | ||||||
Status: | CLOSED ERRATA | QA Contact: | Cluster QE <mspqa-list> | ||||||
Severity: | medium | Docs Contact: | |||||||
Priority: | medium | ||||||||
Version: | 5.0 | CC: | bachmann, cluster-maint, jobot, lpleiman | ||||||
Target Milestone: | --- | ||||||||
Target Release: | --- | ||||||||
Hardware: | x86_64 | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | RHBA-2007-0580 | Doc Type: | Bug Fix | ||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2007-11-07 16:45:54 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Robert Hell
2007-03-30 12:16:52 UTC
Created attachment 151269 [details]
Cluster Configuration File
Additional info: clurgmgrd appears to be suffering the same fate as ccs_tool in bug #223519, treating the quorum disk as an actual node. When clurgmgrd first starts, it attempts to make contact with the quorum disk "node" to determine the status of the services its running. This times out, causing an "abort": [12453] info: State change: Local UP [12453] info: State change: sys-b UP [12453] info: State change: /dev/dm-3 UP #Note: Quorum Disk ... aight, need responses from 3 guys VF: Push 2.12453 #1 (X#00020001) VF: Checking for consensus... ... VF: YES VF: YES VF: Timed out waiting for 1 responses VF: Broadcasting ABORT (X#00020002) VF: Aborted! I was able to construct a proof of concept by adding code to rgmanager/src/daemons/main.c:membership_update() that sets cn_member to 0 for the cml_members element which has a cn_nodeid of 0. Afterwords, the resource manager appears to function as expected. Additionally, clustat no longer hangs with a “Timed out waiting for a response from Resource Group Manager” message. I hope that this information assists in leading to a proper patch, as mine was a rather brute force solution. Created attachment 152699 [details]
Fix fix
Hi, this should fix it.
Actually, it sounds like exactly what you did, but in a different location. ;) Thanks for that! Will there be an official errata for this problem? I can't confirm one way or the other at this point, but it looks like it will be in update 1 for certain. Fixing Product Name. Cluster Suite was integrated into the Enterprise Linux for version 5.0. This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. Hi! Do you have any news for me if this fix will be in an upcoming errata or in the next Update for RHEL5? Regards, Robert Update 1 for RHEL5 :) lpleiman An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2007-0580.html |