Bug 603620
Summary: | Unable to handle kernel paging request in __d_rehash | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Wade Mealing <wmealing> |
Component: | kernel | Assignee: | Larry Woodman <lwoodman> |
Status: | CLOSED NOTABUG | QA Contact: | Red Hat Kernel QE team <kernel-qe> |
Severity: | medium | Docs Contact: | |
Priority: | low | ||
Version: | 5.5 | CC: | charlieb-fedora-bugzilla, pasteur, sdodson, tao |
Target Milestone: | rc | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2011-08-12 01:04:13 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Wade Mealing
2010-06-14 07:49:09 UTC
Kernel: 2.6.18-194.el5 #1 SMP Tue Mar 16 21:52:39 EDT 2010 x86_64 x86_64 x86_64 Red Hat Enterprise Linux version: Red Hat Enterprise Linux Server release 5.5 (Tikanga) CPU model: Intel(R) Xeon(R) CPU L5638 @ 2.00GHz Memory: 32882100 kB This request was evaluated by Red Hat Product Management for inclusion in Red Hat Enterprise Linux 5.6 and Red Hat does not plan to fix this issue the currently developed update. Contact your manager or support representative in case you need to escalate this bug. This request was evaluated by Red Hat Product Management for inclusion in Red Hat Enterprise Linux 5.7 and Red Hat does not plan to fix this issue the currently developed update. Contact your manager or support representative in case you need to escalate this bug. (In reply to comment #2) > Kernel: > 2.6.18-194.el5 #1 SMP Tue Mar 16 21:52:39 EDT 2010 x86_64 x86_64 x86_64 I've had a report of what could be the same problem. I'll attach a screenshot (which is all I have). Call trace was: d_rehash+0x21/0x34 create_write_pipe+0x155/0x1dc do_sigaction+0x76/0x199 do_pipe+0x16/0xee sys_pipe+0x13/0x4e sysenter_do_call+0x1e/0x76 RIP was: __d_rehash+0x18/0x20 Kernel was 2.6.18-194.17.1.el5 (CentOS compiled). System has four six-core Xeon E7- 4820, with 64G. Let me know if there's anything I can do to help. (In reply to comment #9) > Kernel was 2.6.18-194.17.1.el5 (CentOS compiled). System has four six-core Xeon > E7- 4820, with 64G. Sorry, dual six-core, with HT, showing 32 CPUs. Here's more information from syslog: Aug 9 11:44:30 micd-01 kernel: Unable to handle kernel paging request at 000000007f873133 RIP: Aug 9 11:44:30 micd-01 kernel: [<ffffffff8003a0cf>] __d_rehash+0x18/0x20 Aug 9 11:44:30 micd-01 kernel: PGD 1063c28067 PUD 0 Aug 9 11:44:30 micd-01 kernel: Oops: 0002 [1] SMP Aug 9 11:44:30 micd-01 kernel: last sysfs file: /devices/pci0000:00/0000:00:00.0/local_cpus Aug 9 11:44:30 micd-01 kernel: CPU 9 Aug 9 11:44:30 micd-01 kernel: Modules linked in: tun bonding ipv6 xfrm_nalgo crypto_api xt_multiport xt_connmark xt_CONNMARK ipt_REJECT ipt_M ASQUERADE xt_state ipt_TOS xt_tcpudp ip_nat_ftp ip_conntrack_ftp iptable_mangle iptable_nat ip_nat ip_conntrack nfnetlink iptable_filter ip_tab les ipt_ULOG x_tables loop dm_multipath scsi_dh video backlight sbs power_meter hwmon i2c_ec i2c_core dell_wmi wmi button battery asus_acpi acp i_memhotplug ac parport_pc lp parport sr_mod cdrom joydev sg bnx2 serio_raw pcspkr dm_snapshot dm_zero dm_mirror dm_log dm_mod usb_storage ata_ piix libata shpchp megaraid_sas sd_mod scsi_mod raid1 ext3 jbd uhci_hcd ohci_hcd ehci_hcd Aug 9 11:44:30 micd-01 kernel: Pid: 19291, comm: calctimezoneoff Not tainted 2.6.18-194.17.1.el5 #1 Aug 9 11:44:30 micd-01 kernel: RIP: 0010:[<ffffffff8003a0cf>] [<ffffffff8003a0cf>] __d_rehash+0x18/0x20 Aug 9 11:44:30 micd-01 kernel: RSP: 0018:ffff8110650fbeb0 EFLAGS: 00010206 Aug 9 11:44:30 micd-01 kernel: RAX: 000000007f87312b RBX: ffff81106a14ab70 RCX: 0000000000000017 Aug 9 11:44:30 micd-01 kernel: RDX: ffff81106a14ab88 RSI: ffff810004013000 RDI: ffff81106a14ab70 Aug 9 11:44:30 micd-01 kernel: RBP: ffff81107d55a910 R08: 00000000ffffffff R09: 0000000000000020 Aug 9 11:44:30 micd-01 kernel: R10: 0000000000000000 R11: ffffffff80127cd8 R12: ffff81107ce38c80 Aug 9 11:44:30 micd-01 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: ffff8110650fbf58 Wade, did you do a memory test? Have you learnt any more since June? This crash on a gentoo system (EIP is at __d_rehash+0x1c/0x30) looks like it was a memory issue: http://forums.gentoo.org/viewtopic-t-438365-start-0.html Charlie: According to the customer -- After a BIOS upgrade including new Intel microcode, the crash does not occur anymore. Thanks for your assistance. -- Because this did not happen in the same make/model machine in the labs with any version, the information is inconclusive. Maybe a bios update may solve the problem for you, however the bios update prevented this corruption from happening and solved the problem for the customer. Thanks. |