Bug 170893
Summary: | bug in checkpoint.c causes system panic - __journal_remove_checkpoint | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | Sean Plaice <splaice> |
Component: | kernel | Assignee: | Eric Sandeen <esandeen> |
Status: | CLOSED NOTABUG | QA Contact: | Brian Brock <bbrock> |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | 4.0 | CC: | bugzilla, jbaron, jwest, rwheeler, sct |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | i386 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2010-06-07 04:52:31 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Sean Plaice
2005-10-15 04:40:49 UTC
This looks like something for which you would be best off going through our support. In order to do this, please either contact Red Hat's Technical Support line at 888-GO-REDHAT or file a web ticket at http://www.redhat.com/apps/support/. Bugzilla is not an official support channel, has no response guarantees, and may not route your issue to the correct area to assist you. Using the official support channels above will guarantee that your issue is handled appropriately and routed to the individual or group which can best assist you with this issue and will also allow Red Hat to track the issue, ensuring that any applicable bug fix is included in all releases and is not dropped from a future update or major release. I have filed a support request to follow up on the problem via that channel. This problem appears to still be occuring with 2.6.9-22 kernel. Though it fails to provide the same details in the system messages log. The server is in a remote location, and will not have a serial console till next week to capture the complete panic log from the console. We _seem_ to be affected by the same problem on a different architecture (i386), the following error: Feb 3 04:53:51 ebsdb kernel: Debug: sleeping function called from invalid context at include/linux/rwsem.h:43 Feb 3 04:53:51 ebsdb kernel: in_atomic():0[expected: 0], irqs_disabled():1 Feb 3 04:53:51 ebsdb kernel: [<02120c1d>] __might_sleep+0x7d/0x88 Feb 3 04:53:51 ebsdb kernel: [<0215796c>] rw_vm+0xe4/0x29c Feb 3 04:53:51 ebsdb kernel: [<02131675>] find_pid+0x26/0x3a Feb 3 04:53:51 ebsdb kernel: [<02131675>] find_pid+0x26/0x3a Feb 3 04:53:51 ebsdb kernel: [<02157de3>] get_user_size+0x30/0x57 Feb 3 04:53:51 ebsdb kernel: [<02131675>] find_pid+0x26/0x3a Feb 3 04:53:51 ebsdb kernel: [<0211b5c4>] __is_prefetch+0x1d5/0x2ba Feb 3 04:53:51 ebsdb kernel: [<02138ba8>] search_module_extables+0x5d/0x64 Feb 3 04:53:51 ebsdb kernel: [<02131675>] find_pid+0x26/0x3a Feb 3 04:53:51 ebsdb kernel: [<0211b9f9>] do_page_fault+0x350/0x5f7 Feb 3 04:53:51 ebsdb kernel: [<022d43d9>] __cond_resched+0x14/0x39 Feb 3 04:53:51 ebsdb kernel: [<021442b9>] rmqueue_bulk+0x5b/0x65 Feb 3 04:53:51 ebsdb kernel: [<02144648>] buffered_rmqueue+0x17d/0x1a5 Feb 3 04:53:51 ebsdb kernel: [<0211b6a9>] do_page_fault+0x0/0x5f7 Feb 3 04:53:51 ebsdb kernel: [<02131675>] find_pid+0x26/0x3a Feb 3 04:53:51 ebsdb kernel: [<02131803>] find_task_by_pid_type+0x8/0x1d Feb 3 04:53:51 ebsdb kernel: [<0211e04c>] sched_exit+0x1d/0xbc Feb 3 04:53:51 ebsdb kernel: [<021241ca>] release_task+0xb6/0xfa Feb 3 04:53:51 ebsdb kernel: [<02125d5c>] wait_task_zombie+0x475/0x48b Feb 3 04:53:51 ebsdb kernel: [<021262fd>] do_wait+0x183/0x3b8 Feb 3 04:53:51 ebsdb kernel: [<0211f28b>] default_wake_function+0x0/0xc Feb 3 04:53:51 ebsdb kernel: [<0212dfb9>] sys_rt_sigaction+0x73/0x88 Feb 3 04:53:51 ebsdb kernel: [<0211f28b>] default_wake_function+0x0/0xc Feb 3 04:53:51 ebsdb kernel: [<021265c5>] sys_wait4+0x27/0x2a Feb 3 04:53:51 ebsdb kernel: [<021265db>] sys_waitpid+0x13/0x17 Feb 3 04:53:51 ebsdb kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000000 Feb 3 04:53:51 ebsdb kernel: printing eip: Feb 3 04:53:51 ebsdb kernel: 02131675 Feb 3 04:53:51 ebsdb kernel: *pde = 00004001 Feb 3 04:53:51 ebsdb kernel: Oops: 0000 [#1] Feb 3 04:53:51 ebsdb kernel: SMP Feb 3 04:53:51 ebsdb kernel: Modules linked in: mptctl mptbase hpilo(U) nfsd exportfs autofs4 nfs lockd nfs_acl sunrpc 8021q dm_mirror dm_round_robin dm_multipath button battery ac ohci_hcd hw_random k8_edac edac_mc tg3 bonding(U) floppy sg ext3 jbd dm_mod cciss sd_mod qla2xxx(U) scsi_mod qla2xxx_conf(U) Feb 3 04:53:51 ebsdb kernel: CPU: 0 Feb 3 04:53:51 ebsdb kernel: EIP: 0060:[<02131675>] Not tainted VLI Feb 3 04:53:51 ebsdb kernel: EFLAGS: 00010086 (2.6.9-89.0.18.ELhugemem) Feb 3 04:53:51 ebsdb kernel: EIP is at find_pid+0x26/0x3a Feb 3 04:53:51 ebsdb kernel: eax: 0f1e1000 ebx: 00002fcf ecx: 00000000 edx: c1e7586c Feb 3 04:53:51 ebsdb kernel: esi: f3581430 edi: 00000000 ebp: c1259ed0 esp: c1259eac Feb 3 04:53:51 ebsdb kernel: ds: 007b es: 007b ss: 0068 Feb 3 04:53:51 ebsdb kernel: Process hpetfe (pid: 12239, threadinfo=c1259000 task=c1e757b0) Feb 3 04:53:51 ebsdb kernel: Stack: 00000000 02131803 f3581430 0211e04c f3581430 f3581430 f3581430 f3581430 Feb 3 04:53:51 ebsdb kernel: 00000000 00000000 021241ca f3581430 00002fd3 00000000 00000000 02125d5c Feb 3 04:53:51 ebsdb kernel: 03000000 00000000 00000003 00000000 a0ff8080 0011a6e2 39e805b0 c1e757b0 Feb 3 04:53:51 ebsdb kernel: Call Trace: Feb 3 04:53:51 ebsdb kernel: [<02131803>] find_task_by_pid_type+0x8/0x1d Feb 3 04:53:51 ebsdb kernel: [<0211e04c>] sched_exit+0x1d/0xbc Feb 3 04:53:51 ebsdb kernel: [<021241ca>] release_task+0xb6/0xfa Feb 3 04:53:51 ebsdb kernel: [<02125d5c>] wait_task_zombie+0x475/0x48b Feb 3 04:53:51 ebsdb kernel: [<021262fd>] do_wait+0x183/0x3b8 Feb 3 04:53:51 ebsdb kernel: [<0211f28b>] default_wake_function+0x0/0xc Feb 3 04:53:51 ebsdb kernel: [<0212dfb9>] sys_rt_sigaction+0x73/0x88 Feb 3 04:53:51 ebsdb kernel: [<0211f28b>] default_wake_function+0x0/0xc Feb 3 04:53:51 ebsdb kernel: [<021265c5>] sys_wait4+0x27/0x2a Feb 3 04:53:51 ebsdb kernel: [<021265db>] sys_waitpid+0x13/0x17 Feb 3 04:53:51 ebsdb kernel: Code: c8 ff 5b 5e c3 53 b9 20 00 00 00 8b 04 85 84 fe 43 02 2b 0d 94 fe 43 02 89 d3 69 d2 01 00 37 9e d3 ea 8b 14 90 85 d2 74 12 8b 0a <0f> 18 01 90 39 5a fc 8d 42 fc 74 06 89 ca eb ea 31 c0 5b c3 55 Feb 3 04:53:51 ebsdb kernel: <0>Fatal exception: panic in 5 seconds This issue has occurred several times in the past months (since the server was re-installed with RHEL4 instead of RHEL3). This is a HP Proliant DL585 G1 server (RHEL 4 update 9) with the following kernel: Linux ebsdb 2.6.9-89.0.18.ELhugemem #1 SMP Wed Nov 25 06:13:02 EST 2009 i686 athlon i386 GNU/Linux Further specs: 2 dual-core AMD Opteron 848 processors, 24GB memory (24 * 1GB dimms, ECC). Please ignore the comment above, I meant to attach the comment to https://bugzilla.redhat.com/show_bug.cgi?id=175189 . |