Bug 129445 - BUG() on joining
Summary: BUG() on joining
Keywords:
Status: CLOSED WORKSFORME
Alias: None
Product: Red Hat Cluster Suite
Classification: Retired
Component: gfs
Version: 4
Hardware: i686
OS: Linux
medium
medium
Target Milestone: ---
Assignee: Christine Caulfield
QA Contact: GFS Bugs
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2004-08-09 09:40 UTC by Christine Caulfield
Modified: 2010-01-12 02:56 UTC (History)
0 users

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2004-09-09 10:03:53 UTC
Embargoed:


Attachments (Terms of Use)

Description Christine Caulfield 2004-08-09 09:40:20 UTC
Reported by Lazar Obradovic <laza>

CMAN: Waiting to join or form a Linux-cluster
CMAN: sending membership request
CMAN: got node new-noc
Got ENDTRANS from a node not the master: master: 6, sender: 1
CMAN: node new-noc is not responding - removing from the cluster
------------[ cut here ]------------
kernel BUG at /usr/src/cvs/cluster/cman-kernel/src/membership.c:2892!
invalid operand: 0000 [#1]
PREEMPT SMP
Modules linked in: ipv6 qla2300 qla2xxx ohci_hcd gfs lock_dlm
lock_harness dlm cman
CPU:    2
EIP:    0060:[<f885c669>]    Tainted: GF
EFLAGS: 00010246   (2.6.7-gentoo-r11)
EIP is at elect_master+0x2a/0x41 [cman]
eax: 00000080   ebx: 00000080   ecx: f88a4000   edx: 00000000
esi: f8870c08   edi: f8870c00   ebp: f7139fc0   esp: f7139f90
ds: 007b   es: 007b   ss: 0068
Process cman_memb (pid: 7327, threadinfo=f7138000 task=c22ed1e0)
Stack: f7afdb28 f8859d34 f7139fa4 00000001 f8858883 f7bf7494 fffffffb
00000000
       f7138000 0000001f 00000000 c0103fb6 00000000 c22ed1e0 c01176e2
00100100
       00200200 00000000 00000000 00000000 f88584d8 00000000 00000000
00000000
Call Trace:
 [<f8859d34>] a_node_just_died+0x130/0x181 [cman]
 [<f8858883>] membership_kthread+0x3ab/0x3e4 [cman]
 [<c0103fb6>] ret_from_fork+0x6/0x14
 [<c01176e2>] default_wake_function+0x0/0x12
 [<f88584d8>] membership_kthread+0x0/0x3e4 [cman]
 [<c01022a1>] kernel_thread_helper+0x5/0xb

Code: 0f 0b 4c 0b a0 51 86 f8 31 c0 5b c3 8b 44 24 08 89 10 8b 42


Version-Release number of selected component (if applicable):


How reproducible:
Didn't try

This maybe specific to his multicast setup, I'm not sure.

Comment 1 Christine Caulfield 2004-09-09 10:03:53 UTC
email from  Lazar:

"Well, I haven't been able to reproduce that any longer. It seems it's
two-node related, since it stopped happening when I added more nodes. As
far as I'm concerned, you may close the bug, and someone else might
reopen it if this is still happening..."


Comment 2 Kiersten (Kerri) Anderson 2004-11-16 19:13:46 UTC
Updating version to the right level in the defects.  Sorry for the storm.


Note You need to log in before you can comment on or make changes to this bug.