Bug 314641
Summary: | CMAN dies after qdiskd calls cman_poll_quorum_device() | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Lon Hohberger <lhh> | ||||
Component: | openais | Assignee: | Steven Dake <sdake> | ||||
Status: | CLOSED ERRATA | QA Contact: | GFS Bugs <gfs-bugs> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | high | ||||||
Version: | 5.1 | CC: | cluster-maint, gustavo.prada, h.plankl, pkennedy, rkenna, sdake | ||||
Target Milestone: | --- | Keywords: | Regression | ||||
Target Release: | --- | ||||||
Hardware: | All | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | RHBA-2007-0599 | Doc Type: | Bug Fix | ||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2007-11-07 17:00:18 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 253836 | ||||||
Attachments: |
|
Description
Lon Hohberger
2007-10-01 19:56:00 UTC
Created attachment 212761 [details]
Basic debugging (thread backtrace from aisexec on all nodes, qdisk backtrace on all nodes but 1, cman_tool backtrace)
It looks to me, like aisexec is deadlocking because it doesn't like me calling openais_timer_add_duration() whilst in a timer callback function. Is there another way of doing a repeating timer ? ahhh, there seems to be a bug :-) According to the man page: The pthread_equal() function shall return a non-zero value if t1 and t2 are equal; otherwise, zero shall be returned. So I think we need to do this in openais_timer_add_absolute() and openais_timer_add_duration() - if (pthread_equal (pthread_self(), expiry_thread) == 0) { + if (pthread_equal (pthread_self(), expiry_thread) != 0) { [root@tng3-5 ~]# clustat Member Status: Quorate Member Name ID Status ------ ---- ---- ------ tng3-1 1 Online tng3-2 2 Online tng3-3 3 Online tng3-5 5 Online, Local /dev/sdd1 0 Online, Quorum Disk [root@tng3-5 ~]# rpm -q openais openais-0.80.3-6.el5 Preliminary tests pass. [root@tng3-5 ~]# cman_tool status Version: 6.0.1 Config Version: 9 Cluster Name: tng3-cluster Cluster Id: 41908 Cluster Member: Yes Cluster Generation: 2448 Membership state: Cluster-Member Nodes: 4 Expected votes: 4 Total votes: 7 Quorum: 4 Active subsystems: 9 Flags: Ports Bound: 0 11 177 Node name: tng3-5 Node ID: 5 Multicast addresses: 239.192.163.88 Node addresses: 10.15.89.178 [root@tng3-5 ~]# cman_tool members cman_tool: unknown option members [root@tng3-5 ~]# cman_tool nodes Node Sts Inc Joined Name 0 M 0 2007-10-02 12:15:25 /dev/sdd1 1 M 2448 2007-10-02 12:15:11 tng3-1 2 M 2448 2007-10-02 12:15:11 tng3-2 3 M 2448 2007-10-02 12:15:11 tng3-3 5 M 2444 2007-10-02 12:15:11 tng3-5 going to do some functional tests. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2007-0599.html *** Bug 253836 has been marked as a duplicate of this bug. *** *** Bug 431382 has been marked as a duplicate of this bug. *** |