Bug 672128

Summary: [Kdump] NMI Watchdog detected LOCKUP on5 CPU 5
Product: Red Hat Enterprise Linux 5 Reporter: Chao Ye <cye>
Component: kernelAssignee: Cong Wang <amwang>
Status: CLOSED WORKSFORME QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: medium Docs Contact:
Priority: medium    
Version: 5.8CC: amwang, rkhan, tgraf
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-08-07 02:37:12 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Chao Ye 2011-01-24 03:55:04 UTC
Description of problem:
dell-pe2900-02.rhts.eng.bos.redhat.com login: NMI Watchdog detected LOCKUP on5
CPU 5 
Modules linked in: lkdtm(U) autofs4 hidp rfcomm l2cap bluetooth lockd sunrpc d
Pid: 4077, comm: beah-fwd-backen Tainted: G      2.6.18-238.el5 #1
RIP: 0010:[<ffffffff886cd12a>]  [<ffffffff886cd12a>] :lkdtm:lkdtm_handler+0xbd
RSP: 0018:ffff81006cc41b28  EFLAGS: 00000082
RAX: 0000000000000010 RBX: 0000000000000000 RCX: 0000000000000086
RDX: 00000000ffffffff RSI: 0000000000000000 RDI: ffffffff80319f5c
RBP: 0000000000000000 R08: ffffffff80319f28 R09: 0000000000000001
R10: 0000000000000000 R11: 0000000000000080 R12: 0000000000000000
R13: 0000000000000000 R14: 0000000000000000 R15: ffff810064f4ccf0
FS:  00002ab78373a1e0(0000) GS:ffff810037c1d540(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00002ab78711e000 CR3: 000000006d650000 CR4: 00000000000006e0
Process beah-fwd-backen (pid: 4077, threadinfo ffff81006cc40000, task ffff810)
Stack:  0000000000000000 ffffffff886cd1a2 ffff810064f4ccf0 ffffffff880530ef
 ffff81006cc41cb0 ffff81006d733300 0000000900e5000f ffff81007ee9bc00
 000000007b86a858 0000000100000000 ffff810000000001 ffff81007b86a858
Call Trace:
 [<ffffffff886cd1a2>] :lkdtm:jp_ll_rw_block+0x9/0x10
 [<ffffffff880530ef>] :ext3:ext3_find_entry+0x362/0x575
 [<ffffffff80012ef9>] __do_page_cache_readahead+0x5d/0x179
 [<ffffffff8012d28b>] avc_has_perm+0x46/0x58
 [<ffffffff880549ba>] :ext3:ext3_lookup+0x33/0x162
 [<ffffffff8000d008>] do_lookup+0xe5/0x1e6
 [<ffffffff8000a2c5>] __link_path_walk+0xa2a/0xfb9
 [<ffffffff8000ea74>] link_path_walk+0x42/0xb2
 [<ffffffff8000cda3>] do_path_lookup+0x275/0x2f1
 [<ffffffff80012898>] getname+0x15b/0x1c2
 [<ffffffff800239b0>] __user_walk_fd+0x37/0x4c
 [<ffffffff800288aa>] vfs_stat_fd+0x1b/0x4a
 [<ffffffff800236e2>] sys_newstat+0x19/0x31
 [<ffffffff8005d229>] tracesys+0x71/0xe0
 [<ffffffff8005d28d>] tracesys+0xd5/0xe0


Code: eb fe 48 c7 c7 87 d5 6c 88 31 c0 e8 3c 67 9c f7 31 ff e8 ed

Version-Release number of selected component (if applicable):
RHEL5-Server-U6
kernel-2.6.18-238.el5.x86_64
kexec-tools-1.102pre-126.el5.x86_64

How reproducible:


Steps to Reproduce:
1.wget http://porkchop.devel.redhat.com/qa/rhts/lookaside/ltp-kdump-20080228.tar.gz
2.tar zxvf ltp-kdump-20080228.tar.gz
3.cd kdump
4.export USE_SYMBOL_NAME=1
5.make
6.insmod lkdtm.ko cpoint_name=INT_TASKLET_ENTRY cpoint_type=BUG cpoint_count=10
  
Actual results:
Test pass

Expected results:
Machine got stuck

Additional info:
This stuck was found when I'm trying to reproduce Bug 435698 on dell-pe2900-02.rhts.eng.bos.redhat.com.
The original reproducer is:
https://bugzilla.redhat.com/show_bug.cgi?id=435698#c9
The original bug is:
https://bugzilla.redhat.com/show_bug.cgi?id=435698

Comment 1 Neil Horman 2011-01-31 14:50:25 UTC
Triage assignment.  If you feel this bug doesn't belong to you, or that it cannot be handled in a timely fashion, please contact me for re-assignment

Comment 2 Cong Wang 2011-02-15 06:43:29 UTC
I can't reproduce it on dell-pe2900-02.rhts.eng.bos.redhat.com, the second kernel starts successfully after inserting that module.

Comment 3 Cong Wang 2012-08-07 02:37:12 UTC
Based on comment #2, close it now.