475843 – kdump boot hangs in msleep on several HP XW systems

Bug 475843 - kdump boot hangs in msleep on several HP XW systems

Summary: kdump boot hangs in msleep on several HP XW systems

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 5
Classification:	Red Hat
Component:	kexec-tools
Sub Component:
Version:	5.3
Hardware:	x86_64
OS:	Linux
Priority:	urgent
Severity:	urgent
Target Milestone:	rc
Target Release:	---
Assignee:	Neil Horman
QA Contact:	Martin Jenner
Docs Contact:
URL:
Whiteboard:
Duplicates (2):	471065 475498 (view as bug list)
Depends On:
Blocks:	479811
TreeView+	depends on / blocked

Reported:	2008-12-10 18:57 UTC by Doug Chapman
Modified:	2009-09-02 09:12 UTC (History)
CC List:	8 users (show)
Fixed In Version:
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2009-09-02 09:12:17 UTC
Target Upstream Version:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
patch to pass all reserved regions to kdump kernel (2.29 KB, patch) 2008-12-12 23:18 UTC, Doug Chapman	no flags	Details \| Diff
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2009:1258	0	normal	SHIPPED_LIVE	kexec-tools bug fix and enhancement update	2009-09-01 09:09:40 UTC

Description Doug Chapman 2008-12-10 18:57:05 UTC

Description of problem:

during a kdump the new kernel hangs at:
Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled

I see this on hp-xw8600-01 and several others.  I will try to get a more complete list and include that info here.

it appears to hang at this line in autoconfig_irq()

        /* forget possible initially masked and pending IRQ */
        probe_irq_off(probe_irq_on());



Version-Release number of selected component (if applicable):
kernel-2.6.18-125  (probably all RHEL5.X kernels)

How reproducible:
100%

Steps to Reproduce:
1. try kdump with serial console on hp-xw8600-01
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 1 Doug Chapman 2008-12-10 20:37:04 UTC

I tracked this down a little deeper.  The hang happens here:

kernel/irq/autoprobe.c:64


     63         /* Wait for longstanding interrupts to trigger. */
     64         msleep(20);


Once I comment out this msleep and another msleep later at line 86 kdump then works just fine.

So, now to figure out why msleep hangs, I am guessing something is not initialized correctly with the timers.  Note that I am using a -125 kernel so this does not have the code that disables HPET on shutdown.  Using that patch does not appear to make any difference for this issue.

Comment 2 Neil Horman 2008-12-12 20:34:25 UTC

*** Bug 473404 has been marked as a duplicate of this bug. ***

Comment 3 Doug Chapman 2008-12-12 20:39:38 UTC

After a lot more digging I have found the root of the problem and have a fix.

The problem is it is unable to map the ACPI tables. This is because the BIOS does not flag the ACPI regions as ACPI but simply marks them as "reserved".

BIOS-e820: 0000000000000000 - 0000000000097000 (usable)
BIOS-e820: 0000000000097000 - 00000000000a0000 (reserved)
BIOS-e820: 00000000000e8000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 000000007ffc2840 (usable)
BIOS-e820: 000000007ffc2840 - 0000000080000000 (reserved)
BIOS-e820: 00000000e0000000 - 00000000f0000000 (reserved)
BIOS-e820: 00000000fec00000 - 0000000100000000 (reserved)

In the table above I found that the ACPI tables live in the 000000007ffc2840 - 0000000080000000 region. On most systems this would be marked as ACPI data however here it is simply "reserved".

The kdump kernel is told by kexec to ignore all of the data it finds on its own (via memmap=exactmap) and then passes in the memory info. Currently it only passes in what is known to be usable memory and ACPI memory, it does not tell it about reserved memory. This made the kdump kernel think that ACPI lived outside of legal memory so it was unable to map it. The fix is the pass in the reserved memory as well.

This should be considered a temporary workaround only since this leads to the potential of overflowing the command line length since we pass in many more memmap= arguments. However this has been tested and shown to fix kdump issues on several (potentially all) HP XW workstations.

Also, other testing has shown that this same fix also resolves the problem where kdump would hang when trying to enable HPET (bug 473404) which was seen on HP and other vendors hardware. That is a similar reason. In that case the HPET config register lived in a "reserved" region and could not be mapped by the kdump kernel.

I will attach the patch here however a prebuilt rpm is available inside Red Hat at:

http://hptestsvr.lab.bos.redhat.com/~dchapman/kexec/

Comment 5 Doug Chapman 2008-12-12 23:18:31 UTC

Created attachment 326793 [details]
patch to pass all reserved regions to kdump kernel

this patch tells kexec to pass all reserved e820 regions to the kdump kernel.  This should be considered a workaround only, there is the danger that since we are adding more arguments to the kernel command line we might cause overflow since the command line length is a finite length.

I have found that upstream kernels work OK even without this change to kexec-tools.  I am still investigating as to why that is.  One the kernel change that fixes this upstream is found that will be backported to the RHEL5.X kernel as the "real" fix.

Comment 6 Neil Horman 2008-12-15 11:44:23 UTC

*** Bug 475987 has been marked as a duplicate of this bug. ***

Comment 8 Neil Horman 2008-12-15 15:51:50 UTC

*** Bug 475498 has been marked as a duplicate of this bug. ***

Comment 9 Doug Chapman 2008-12-18 18:56:44 UTC

I did more digging in upstream kernel code.  As I mentioned earlier upstream works without this change to kdump.  I was hoping to find the fix upstream and backport it however from closer inspection that does not appear to be possible.

Upstream works in part because __acpi_map_table() has the ability to fall back to "fixed" mapping if it cannot directly map the table.  That bit of code was fairly easy to backport but it did not work due to the fact that it relies on the early memory reservation code (i.e. reserve_early() and related code in e820.c) which does not exist in RHEL5 and is too large to justify backporting when this userspace fix to kexec-tools will do the trick.

We had a concern that we might overflow the command line with all the additional memmap= arguments but since the command line length for RHEL5 is 4k that is not likely and worst case of overflowing is we would loose some of the memmap= args at the end of the command line.  In the rare case where a system needed one of those late reserved sections in a kdump boot we would be in the same situation we are now.

Comment 11 Neil Horman 2008-12-19 19:20:42 UTC

I'll incorporate this as soon as it has the appropriate pm acks to allow me to check it in.

Comment 13 Neil Horman 2008-12-20 00:53:22 UTC

Ok, so are we doing a hotfix or a z stream release here?  I see both in this bug (comment #10 or comment #12)

Comment 14 David Aquilina 2008-12-20 01:11:16 UTC

(In reply to comment #13)
> Ok, so are we doing a hotfix or a z stream release here?  I see both in this
> bug (comment #10 or comment #12)

Z-stream... we use the hotfix tracker to request all accelerated fixes (fastrack/hotfix/zstream/etc) and then support management makes the call on which is appropriate and sets the right flags.

Comment 15 Neil Horman 2008-12-22 12:21:31 UTC

*** Bug 471065 has been marked as a duplicate of this bug. ***

Comment 16 Neil Horman 2008-12-22 18:55:11 UTC

comitted to kexec=-tools-1.102-pre57.el5.  When the ztream/hotfix decision is made, I'll update cvs to reflect that appropriately.

Comment 18 Issue Tracker 2009-02-10 15:09:57 UTC

Hi Gary,

I got the system from our QA colleagues and was able to reproduce the
problem. I modified the kernel so that I could issue a NMI when the system
was hanging in the kdump kernel. This got me the following stack trace:

nmi....
IRQ0xa9_interrupt + 0x0/0xa
probe_irq_on + 0x6e/0x151
serial8250_config_port + 0x7c7/0x9c3
uart_add_one_port + 0xf7/0x278
platform_device_add + 0x111/0x148
serial8250_init + 0xdd/0x127
init + 0x1f9/0x2f7


It seems, the system is stuck just after the interrupt enable in
probe_irq_on. IRQ 0xa9 (169) belongs to the disk controller. Maybe, caused
by the high load, there was just a interrupt raised when the crash dump was
initiated.
Hopefully, this investigation can give your engineers some hints.

Kind regards,

Gerhard


This event sent from IssueTracker by streeter 
 issue 248647

Comment 22 errata-xmlrpc 2009-09-02 09:12:17 UTC

An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2009-1258.html

Note You need to log in before you can comment on or make changes to this bug.