Bug 677538

Summary: kdump failed to save vmcore on dell-pe6950-01.rhts.eng.bos.redhat.com
Product: Red Hat Enterprise Linux 6 Reporter: Chao Ye <cye>
Component: kernelAssignee: Cong Wang <amwang>
Status: CLOSED DUPLICATE QA Contact: Kernel Dump QE <kernel-dump-qe>
Severity: medium Docs Contact:
Priority: medium    
Version: 6.1CC: czhang, nhorman, peterm, rkhan
Target Milestone: rc   
Target Release: ---   
Hardware: i686   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-09-27 04:24:31 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Chao Ye 2011-02-15 07:21:02 UTC
Description of problem:
When run test /kernel/kdump/crash-crasher, system failed to dump vmcore.
===========================================
/mnt/tests/kernel/kdump/crash-crasher /mnt/tests/kernel/kdump/crash-crasher  
loaded crasher module 
Kernel panic - not syncing: KDUMP test panic 
 
Pid: 14043, comm: runtest.sh Not tainted 2.6.32-114.0.1.el6.i686 #1 
Call Trace: 
 [<c081e22e>] ? panic+0x42/0xf7 
 [<f7e1d0e0>] ? crasher_write+0x80/0x90 [crasher] 
 [<c0576d9d>] ? proc_file_write+0x5d/0x90 
 [<c0576d40>] ? proc_file_write+0x0/0x90 
 [<c0571fb4>] ? proc_reg_write+0x64/0xa0 
 [<c0571f50>] ? proc_reg_write+0x0/0xa0 
 [<c05259b0>] ? vfs_write+0xa0/0x190 
 [<c04ad35c>] ? audit_syscall_entry+0x21c/0x240 
 [<c0526431>] ? sys_write+0x41/0x70 
 [<c0409a7f>] ? sysenter_do_call+0x12/0x28 
Initializing cgroup subsys cpuset 
Initializing cgroup subsys cpu 
Linux version 2.6.32-114.0.1.el6.i686 (mockbuild.redhat.com) (gcc version 4.4.4 20100726 (Red Hat 4.4.4-13) (GCC) ) #1 SMP Thu Feb 10 15:58:19 EST 2011 
KERNEL supported cpus: 
  Intel GenuineIntel 
  AMD AuthenticAMD 
  NSC Geode by NSC 
  Cyrix CyrixInstead 
  Centaur CentaurHauls 
  Transmeta GenuineTMx86 
  Transmeta TransmetaCPU 
  UMC UMC UMC UMC 
BIOS-provided physical RAM map: 
 BIOS-e820: 0000000000000100 - 00000000000a0000 (usable) 
 BIOS-e820: 0000000000100000 - 00000000dfee8000 (usable) 
 BIOS-e820: 00000000dfee8000 - 00000000dff07c00 (ACPI data) 
 BIOS-e820: 00000000dff07c00 - 00000000e0000000 (reserved) 
 BIOS-e820: 00000000f0000000 - 00000000f8000000 (reserved) 
 BIOS-e820: 00000000fe000000 - 0000000100000000 (reserved) 
 BIOS-e820: 0000000100000000 - 0000000220000000 (usable) 
last_pfn = 0x220000 max_arch_pfn = 0x400000 
user-defined physical RAM map: 
 user: 0000000000000000 - 00000000000a0000 (usable) 
 user: 0000000002000000 - 0000000009f5a000 (usable) 
 user: 0000000009f5a800 - 0000000009f5f000 (usable) 
 user: 0000000009fff000 - 000000000a000000 (usable) 
 user: 00000000dfee8000 - 00000000dff07c00 (ACPI data) 
 user: 00000000dff07c00 - 00000000e0000000 (reserved) 
 user: 00000000f0000000 - 00000000f8000000 (reserved) 
 user: 00000000fe000000 - 0000000100000000 (reserved) 
DMI 2.4 present. 
last_pfn = 0xa000 max_arch_pfn = 0x400000 
x86 PAT enabled: cpu 0, old 0x7010600070106, new 0x7010600070106 
init_memory_mapping: 0000000000000000-000000000a000000 
NX (Execute Disable) protection: active 
RAMDISK: 09aa1000 - 09f4e0f4 
ACPI: RSDP 000f2340 00024 (v02 DELL  ) 
ACPI: XSDT 000f23a8 00064 (v01 DELL   PE_SC3   00000001 DELL 00000001) 
ACPI: FACP 000f2480 000F4 (v03 DELL   PE_SC3   00000001 DELL 00000001) 
ACPI: DSDT dfee8000 05551 (v01 DELL   PE_SC3   00000001 INTL 20050624) 
ACPI: FACS dff03400 00040 
ACPI: APIC 000f2574 000E0 (v01 DELL   PE_SC3   00000001 DELL 00000001) 
ACPI: SPCR 000f2655 00050 (v01 DELL   PE_SC3   00000001 DELL 00000001) 
ACPI: HPET 000f26a5 00038 (v01 DELL   PE_SC3   00000001 DELL 00000001) 
ACPI: MCFG 000f26dd 0003C (v01 DELL   PE_SC3   00000001 DELL 00000001) 
ACPI: SLIC 000f2719 00176 (v01 DELL   PE_SC3   00000001 DELL 00000001) 
ACPI: SRAT 000fc0e4 001A0 (v01 DELL   PE_SC3   00000001 DELL 00000001) 
ACPI: SSDT dff03800 00030 (v01 DELL   PE_SC3   00000001 DELL 00000001) 
0MB HIGHMEM available. 
160MB LOWMEM available. 
  mapped low ram: 0 - 0a000000 
  low ram: 0 - 0a000000 
  node 0 low ram: 00000000 - 0a000000 
  node 0 bootmap 00002000 - 00003400 
(9 early reservations) ==> bootmem [0000000000 - 000a000000] 
  #0 [0000000000 - 0000001000]   BIOS data page ==> [0000000000 - 0000001000] 
  #1 [0000001000 - 0000002000]    EX TRAMPOLINE ==> [0000001000 - 0000002000] 
  #2 [0000006000 - 0000007000]       TRAMPOLINE ==> [0000006000 - 0000007000] 
  #3 [0002400000 - 0002be8d90]    TEXT DATA BSS ==> [0002400000 - 0002be8d90] 
  #4 [0009aa1000 - 0009f4e0f4]          RAMDISK ==> [0009aa1000 - 0009f4e0f4] 
  #5 [000009f000 - 0000100000]    BIOS reserved ==> [000009f000 - 0000100000] 
  #6 [0002be9000 - 0002c011bc]              BRK ==> [0002be9000 - 0002c011bc] 
  #7 [0000007000 - 000000a000]          PGTABLE ==> [0000007000 - 000000a000] 
  #8 [0000002000 - 0000004000]          BOOTMAP ==> [0000002000 - 0000004000] 
found SMP MP-table at [c00fe710] fe710 
Zone PFN ranges: 
  DMA      0x00000001 -> 0x00001000 
  Normal   0x00001000 -> 0x0000a000 
  HighMem  0x0000a000 -> 0x0000a000 
Movable zone start PFN for each node 
early_node_map[4] active PFN ranges 
    0: 0x00000001 -> 0x000000a0 
    0: 0x00002000 -> 0x00009f5a 
    0: 0x00009f5b -> 0x00009f5f 
    0: 0x00009fff -> 0x0000a000 
Using APIC driver default 
Detected use of extended apic ids on hypertransport bus 
Detected use of extended apic ids on hypertransport bus 
Detected use of extended apic ids on hypertransport bus 
Detected use of extended apic ids on hypertransport bus 
ACPI: PM-Timer IO Port: 0x808 
ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled) 
ACPI: LAPIC (acpi_id[0x02] lapic_id[0x02] enabled) 
ACPI: LAPIC (acpi_id[0x03] lapic_id[0x04] enabled) 
ACPI: LAPIC (acpi_id[0x04] lapic_id[0x06] enabled) 
ACPI: LAPIC (acpi_id[0x05] lapic_id[0x01] enabled) 
ACPI: LAPIC (acpi_id[0x06] lapic_id[0x03] enabled) 
ACPI: LAPIC (acpi_id[0x07] lapic_id[0x05] enabled) 
ACPI: LAPIC (acpi_id[0x08] lapic_id[0x07] enabled) 
ACPI: LAPIC (acpi_id[0x09] lapic_id[0x18] disabled) 
ACPI: LAPIC (acpi_id[0x0a] lapic_id[0x19] disabled) 
ACPI: LAPIC (acpi_id[0x0b] lapic_id[0x1a] disabled) 
ACPI: LAPIC (acpi_id[0x0c] lapic_id[0x1b] disabled) 
ACPI: LAPIC (acpi_id[0x0d] lapic_id[0x1c] disabled) 
ACPI: LAPIC (acpi_id[0x0e] lapic_id[0x1d] disabled) 
ACPI: LAPIC (acpi_id[0x0f] lapic_id[0x1e] disabled) 
ACPI: LAPIC (acpi_id[0x10] lapic_id[0x1f] disabled) 
ACPI: LAPIC_NMI (acpi_id[0xff] high edge lint[0x1]) 
ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0]) 
IOAPIC[0]: apic_id 8, version 17, address 0xfec00000, GSI 0-15 
ACPI: IOAPIC (id[0x09] address[0xfec01000] gsi_base[32]) 
IOAPIC[1]: apic_id 9, version 17, address 0xfec01000, GSI 32-47 
ACPI: IOAPIC (id[0x0a] address[0xfec02000] gsi_base[64]) 
IOAPIC[2]: apic_id 10, version 17, address 0xfec02000, GSI 64-79 
ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) 
Using ACPI (MADT) for SMP configuration information 
ACPI: HPET id: 0x1166a201 base: 0xfed00000 
SMP: Allowing 16 CPUs, 8 hotplug CPUs 
PM: Registered nosave memory: 00000000000a0000 - 0000000002000000 
PM: Registered nosave memory: 0000000009f5a000 - 0000000009f5b000 
PM: Registered nosave memory: 0000000009f5f000 - 0000000009fff000 
Allocating PCI resources starting at a000000 (gap: a000000:d5ee8000) 
Booting paravirtualized kernel on bare hardware 
NR_CPUS:32 nr_cpumask_bits:32 nr_cpu_ids:16 nr_node_ids:1 
PERCPU: Embedded 14 pages/cpu @c2200000 s34584 r0 d22760 u131072 
pcpu-alloc: s34584 r0 d22760 u131072 alloc=1*2097152 
pcpu-alloc: [0] 00 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15  
Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 32446 
Kernel command line: ro root=/dev/mapper/vg_dellpe695001-lv_root rd_LVM_LV=vg_dellpe695001/lv_root rd_LVM_LV=vg_dellpe695001/lv_swap rd_NO_LUKS rd_NO_MD rd_NO_DM LANG=en_US.UTF-8 SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us console=ttyS1,57600 irqpoll maxcpus=1 reset_devices cgroup_disable=memory  memmap=exactmap memmap=640K@0K memmap=130408K@32768K memmap=18K@163178K memmap=4K@163836K memmap=127K#3668896K memmap=993K$3669023K memmap=131072K$3932160K memmap=32768K$4161536K elfcorehdr=163176K 
Misrouted IRQ fixup and polling support enabled 
This may significantly impact system performance 
Disabling memory control group subsystem 
PID hash table entries: 512 (order: -1, 2048 bytes) 
Dentry cache hash table entries: 16384 (order: 4, 65536 bytes) 
Inode-cache hash table entries: 8192 (order: 3, 32768 bytes) 
Enabling fast FPU save and restore... done. 
Enabling unmasked SIMD FPU exception support... done. 
Initializing CPU#0 
Initializing HighMem for node 0 (00000000:00000000) 
Memory: 112676k/163840k available (4249k kernel code, 18244k reserved, 2253k data, 524k init, 0k highmem) 
virtual kernel memory layout: 
    fixmap  : 0xffad5000 - 0xfffff000   (5288 kB) 
    pkmap   : 0xff600000 - 0xff800000   (2048 kB) 
    vmalloc : 0xca800000 - 0xff5fe000   ( 845 MB) 
    lowmem  : 0xc0000000 - 0xca000000   ( 160 MB) 
      .init : 0xc2a5a000 - 0xc2add000   ( 524 kB) 
      .data : 0xc28264ba - 0xc2a59c48   (2253 kB) 
      .text : 0xc2400000 - 0xc28264ba   (4249 kB) 
Checking if this processor honours the WP bit even in supervisor mode...Ok. 
Hierarchical RCU implementation. 
NR_IRQS:2304 nr_irqs:1488 
Extended CMOS year: 2000 
Spurious LAPIC timer interrupt on cpu 0 
Console: colour VGA+ 80x25 
console [ttyS1] enabled 
HPET: 3 timers in total, 0 timers will be used for per-cpu timer 
Fast TSC calibration using PIT 
Detected 1995.080 MHz processor. 
Calibrating delay loop (skipped), value calculated using timer frequency.. 3990.16 BogoMIPS (lpj=1995080) 
pid_max: default: 32768 minimum: 301 
Security Framework initialized 
SELinux:  Initializing. 
Mount-cache hash table entries: 512 
Initializing cgroup subsys ns 
Initializing cgroup subsys cpuacct 
Initializing cgroup subsys memory 
Initializing cgroup subsys devices 
Initializing cgroup subsys freezer 
Initializing cgroup subsys net_cls 
Initializing cgroup subsys blkio 
CPU: Physical Processor ID: 0 
CPU: Processor Core ID: 1 
mce: CPU supports 5 MCE banks 
using C1E aware idle routine 
Checking 'hlt' instruction... OK. 
SMP alternatives: switching to UP code 
ACPI: Core revision 20090903 
Overriding APIC driver with bigsmp 
Enabling APIC mode:  Physflat.  Using 3 I/O APICs 
Leaving ESR disabled. 
..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1 
CPU0: Dual-Core AMD Opteron(tm) Processor 8212 stepping 02 
Performance Events: AMD PMU driver. 
... version:                0 
... bit width:              48 
... generic registers:      4 
... value mask:             0000ffffffffffff 
... max period:             00007fffffffffff 
... fixed-purpose events:   0 
... event mask:             000000000000000f 
Brought up 1 CPUs 
Total of 1 processors activated (3990.16 BogoMIPS). 
devtmpfs: initialized 
regulator: core version 0.5 
NET: Registered protocol family 16 
ACPI FADT declares the system doesn't support PCIe ASPM, so disable it 
ACPI: bus type pci registered 
PCI: MCFG configuration 0: base f0000000 segment 0 buses 0 - 63 
PCI: MCFG area at f0000000 reserved in E820 
PCI: Using MMCONFIG for extended config space 
PCI: Using configuration type 1 for base access 
bio: create slab <bio-0> at 0 
ACPI: BIOS _OSI(Linux) query ignored 
ACPI: Interpreter enabled 
ACPI: (supports S0 S4 S5) 
ACPI: Using IOAPIC for interrupt routing 
ACPI: No dock devices found. 
<==============================================Hang here

Version-Release number of selected component (if applicable):
kernel-2.6.32-114.0.1.el6.i686
kernel-firmware-2.6.32-114.0.1.el6.noarch
kexec-tools-2.0.0-165.el6.i686

How reproducible:
Issue found on dell-pe6950-01.rhts.eng.bos.redhat.com

Steps to Reproduce:
1.Setup kdump
2.Run test /kernel/kdump/crash-crasher as TESTARGS=0
3.
  
Actual results:
Kdump failed and system hang

Expected results:
Kdump executed as expected

Additional info:
https://beaker.engineering.redhat.com/recipes/108173
http://lab2.rhts.eng.bos.redhat.com/beaker/logs/recipes/108173///console.log

Comment 3 Brock Organ 2011-03-01 14:43:05 UTC
Reporter,

Could I please ask you to provide a priority assessment (set the priority field to one of urgent/high/medium/low) for the impact of this issue?  This will help us prioritize this issue with our other outstanding bugs for the current release cycle ...

Regards,

Brock

Comment 4 Neil Horman 2011-03-02 13:22:53 UTC
Not that it should matter, but I find it odd that the last usable e820 section got left out of the user map:
BIOS-e820: 0000000100000000 - 0000000220000000 (usable) 
Amerigo, it might be worth looking into why that got dropped in kdump

Comment 5 RHEL Program Management 2011-04-04 02:03:20 UTC
Since RHEL 6.1 External Beta has begun, and this bug remains
unresolved, it has been rejected as it is not proposed as
exception or blocker.

Red Hat invites you to ask your support representative to
propose this request, if appropriate and relevant, in the
next release of Red Hat Enterprise Linux.

Comment 6 Cong Wang 2011-05-27 05:47:18 UTC
This might be a dup of Bug 601120.

Comment 7 Caspar Zhang 2011-09-27 04:24:31 UTC
closing per comment 6.

Feel free to re-open if you think they're two different bugs.

*** This bug has been marked as a duplicate of bug 601120 ***