Bug 473953

Summary: error message from kernel from dlm
Product: Red Hat Enterprise Linux 5 Reporter: Baptiste Mille-Mathias <baptiste.millemathias>
Component: kernelAssignee: David Teigland <teigland>
Status: CLOSED NEXTRELEASE QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: medium Docs Contact:
Priority: low    
Version: 5.2CC: cluster-maint, edamato
Target Milestone: rc   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2008-12-02 19:38:19 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Baptiste Mille-Mathias 2008-12-01 16:08:40 UTC
Description of problem:
I had this error in the system log when trying multipath failover to my SAN, by removing some path (fiber, Fiber switches, ...)

Dec  1 11:53:17 sa-nas001-wb3 kernel: 
Dec  1 11:53:17 sa-nas001-wb3 kernel: DLM:  Assertion failed on line 3487 of file fs/dlm/lock.c
Dec  1 11:53:17 sa-nas001-wb3 kernel: DLM:  assertion:  "is_process_copy(lkb)"
Dec  1 11:53:17 sa-nas001-wb3 kernel: DLM:  time = 4298675660
Dec  1 11:53:17 sa-nas001-wb3 kernel: lkb: nodeid 0 id 1d50001 remid 17300c1 exflags 4 flags 0
Dec  1 11:53:17 sa-nas001-wb3 kernel:      status 2 rqmode -1 grmode 5 wait_type 0 ast_type 0
Dec  1 11:53:17 sa-nas001-wb3 kernel: 
Dec  1 11:53:17 sa-nas001-wb3 kernel: ----------- [cut here ] --------- [please bite here ] ---------
Dec  1 11:53:17 sa-nas001-wb3 kernel: Kernel BUG at fs/dlm/lock.c:3487
Dec  1 11:53:17 sa-nas001-wb3 kernel: invalid opcode: 0000 [1] SMP 
Dec  1 11:53:17 sa-nas001-wb3 kernel: last sysfs file: /kernel/dlm/WEB_data/control

Version-Release number of selected component (if applicable):
kernel is latest kernel available in RHEL 5.2 so it is 2.6.18-92.1.18.el5

How reproducible:
I don't know unfortunately :/

Additional info:
Installation architecture is X86_64

Comment 1 David Teigland 2008-12-02 19:38:19 UTC
The multipath events do not have a direct relationship with the dlm,
but gfs and cluster events do.  I expect that a larger section of
/var/log/messages will show a gfs or cluster event at about this same
time.

This will be fixed in RHEL5.3 by one of the patches added for bug 450132;
in particular the patches with subjects:

dlm: reject messages from non-members
dlm: validate messages before processing