Bug 861288

Summary: panic panic
Product: Red Hat Enterprise Linux 6 Reporter: gaoqiang <gaoqiangscut>
Component: kernelAssignee: Prarit Bhargava <prarit>
Status: CLOSED NOTABUG QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 6.2   
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-09-28 12:57:06 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description gaoqiang 2012-09-28 05:03:12 UTC
Description of problem:

kernel panic

Version-Release number of selected component (if applicable):

2.6.32-220.7.1.el6.x86_64 on centos6.2,actually,it's a self-built kernel without CONFIG_COMPACTION 

How reproducible:

it happens randomly,and here is the panic information comming form netconsole:


 BUG: unable to handle kernel 
 NULL pointer dereference
  at 0000000000000060 
 IP:
  [<ffffffff81051d92>] update_cfs_shares+0x32/0xf0 
 PGD 59de95067 
 PUD 51f46c067 
 PMD 0 
  
 Oops: 0000 [#1] 
 SMP 
  
 last sysfs file: /sys/devices/system/cpu/cpu15/cache/index2/shared_cpu_map 
 CPU 4 
  
 Modules linked in:
  netconsole
  configfs
  autofs4
  lockd
  sunrpc
  cpufreq_ondemand
  acpi_cpufreq
  freq_table
  mperf
  xfs
  exportfs
  dm_multipath
  scsi_dh
  video
  output
  sbs
  sbshc
  power_meter
  hwmon
  acpi_pad
  parport_pc
  lp
  parport
  sg
  e1000e
  snd_hda_intel
  snd_hda_codec
  snd_hwdep
  serio_raw
  snd_pcsp
  i7core_edac
  ioatdma
  snd_pcm
  dca
  pata_acpi
  snd_timer
  snd
  i2c_i801
  edac_core
  i2c_core
  soundcore
  ata_generic
  iTCO_wdt
  iTCO_vendor_support
  snd_page_alloc
  dm_raid45
  dm_memcache
  xor
  dm_snapshot
  dm_zero
  dm_mirror
  dm_region_hash
  dm_log
  dm_mod
  ata_piix
  
 RAX: 0000000000000040 RBX: ffff8806db0542c0 RCX: ffff8802dbd906c0 
 R13: 0000000000000001 R14: 0000000000000001 R15: ffff88044e414840 
 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 
  
  [<ffffffff8104d863>] deactivate_task+0x23/0x30 
  [<ffffffff81062391>] do_group_exit+0x41/0xb0 
 00 
  RSP <ffff8803b30c5cd8> 
 ---[ end trace 88a8aacb3849a04a ]--- 
 Call Trace: 
  [<ffffffff8105ff80>] ? kmsg_dump+0x130/0x180 
  [<ffffffff8103f1fc>] ? __bad_area_nosemaphore+0xec/0x1d0 
  [<ffffffff81244063>] ? cpumask_next_and+0x23/0x40 
  [<ffffffff8104ddb2>] ? select_idle_sibling+0x102/0x110 
  [<ffffffff81051fcf>] ? dequeue_task_fair+0x9f/0x140 
  [<ffffffff814a0d6b>] ? thread_return+0xcd/0x772 
  [<ffffffff810894d4>] ? switch_task_namespaces+0x24/0x70 
  [<ffffffff8100afb2>] ? system_call_fastpath+0x16/0x1b 
 BUG: NMI Watchdog detected LOCKUP
  on CPU0, ip ffffffff814a346e, registers: 
 CPU 0 
  
 Modules linked in:
  netconsole
  configfs
  autofs4
  lockd
  sunrpc
  cpufreq_ondemand
  acpi_cpufreq
  freq_table
  mperf
  xfs
  exportfs
  dm_multipath
  scsi_dh
  video
  output
  sbs
  sbshc
  power_meter
  hwmon
  acpi_pad
  parport_pc
  lp
  parport
  sg
  e1000e
  snd_hda_intel
  snd_hda_codec
  snd_hwdep
  serio_raw
  snd_pcsp
  i7core_edac
  ioatdma
  snd_pcm
  dca
  pata_acpi
  snd_timer
  snd
  i2c_i801
  edac_core
  i2c_core
  soundcore
  ata_generic
  iTCO_wdt
  iTCO_vendor_support
  snd_page_alloc
  dm_raid45
  dm_memcache
  xor
  dm_snapshot
  dm_zero
  dm_mirror
  dm_region_hash
  dm_log
  dm_mod
  ata_piix
  libata
  shpchp
  mptsas
  mptscsih
  mptbase
  scsi_transport_sas
  sd_mod
  crc_t10dif
  scsi_mod
  ext3
  jbd
  mbcache
  [last unloaded: microcode]
  
  
 Pid: 7160, comm: perl Tainted: G      D    ----------------   2.6.32-2.0.0.1 #4
  Supermicro X8DTL
 /X8DTL
  
 RIP: 0010:[<ffffffff814a346e>] 
 RAX: 0000000000001851 RBX: ffff88044e414990 RCX: 0000000000000004 
 R10: 0000000000000000 R11: 0000000000000000 R12: ffff88044e414840 
 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033 
 Stack: 
  ffff88002820fda8
  <IRQ> 
  [<ffffffff81088700>] __run_hrtimer+0x80/0x170 
  <EOI> 
 74 
 55 
 00 
 c1 
 10 
 eb 
 f4 
 00 
 Kernel panic - not syncing: Non maskable interrupt 
  [<ffffffff8105ff80>] ? kmsg_dump+0x130/0x180 
  [<ffffffff814a47d2>] ? die_nmi+0xb2/0x100 
  [<ffffffff814a346e>] ? _spin_lock+0x1e/0x30 
  [<ffffffff81088a16>] ? hrtimer_interrupt+0xd6/0x220 
  
 BUG: NMI Watchdog detected LOCKUP
  on CPU11, ip ffffffff81250009, registers: 
 CPU 11 
  
 Modules linked in:
  netconsole
  configfs
  autofs4
  lockd
  sunrpc
  cpufreq_ondemand
  acpi_cpufreq
  freq_table
  mperf
  xfs
  exportfs
  dm_multipath
  scsi_dh
  video
  output
  sbs
  sbshc
  power_meter
  hwmon
  acpi_pad
  parport_pc
  lp
  parport
  sg
  e1000e
  snd_hda_intel
  snd_hda_codec
  snd_hwdep
  serio_raw
  snd_pcsp
  i7core_edac
  ioatdma
  snd_pcm
  dca
  pata_acpi
  snd_timer
  snd
  i2c_i801
  edac_core
  i2c_core
  soundcore
  ata_generic
  iTCO_wdt
  iTCO_vendor_support
  snd_page_alloc
  dm_raid45
  dm_memcache
  xor
  dm_snapshot
  dm_zero
  dm_mirror
  dm_region_hash
  dm_log
  dm_mod
  ata_piix
  libata
  shpchp
  mptsas
  mptscsih
  mptbase
  scsi_transport_sas
  sd_mod
  crc_t10dif
  scsi_mod
  ext3
  jbd
  mbcache
  [last unloaded: microcode]
  
  
 Pid: 490, comm: cglimit_rm_dir Tainted: G      D    ----------------   2.6.32-2.0.0.1 #4
  Supermicro X8DTL
 /X8DTL
  
 RIP: 0010:[<ffffffff81250009>] 
  [<ffffffff81250009>] __write_lock_failed+0x9/0x20 
 RSP: 0018:ffff8802b6afddc8  EFLAGS: 00000087 
 RAX: 0000000000000000 RBX: 0000000001200011 RCX: ffffffff817764c0 
 RDX: 0000000000000011 RSI: ffff88038e679850 RDI: ffffffff81759000 
 RBP: ffff8802b6afddd0 R08: a000000000000000 R09: ffff88043556c850 
 R10: 00000000ffffffff R11: 0000000000000040 R12: ffff88029732e7d0 
 R13: ffff88038e679850 R14: ffff88038e679b00 R15: ffff88038e679850 
 FS:  00007f447d93a6e0(0000) GS:ffff8800282e0000(0000) knlGS:0000000000000000 
 CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b 
 CR2: 0000000000438ee0 CR3: 00000002aed88000 CR4: 00000000000006e0 
 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 
 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 
 Stack: 
  ffff8803c9e326c0
  0000000000000000
  0000000000000000
  [<ffffffff814a344e>] ? _write_lock_irq+0x1e/0x20 
  [<ffffffff8105d435>] do_fork+0xb5/0x440 
  [<ffffffff8100b2d3>] stub_clone+0x13/0x20 
 Code: 
 83 
 d9 
 c0 
 ff 
 90 
 07 
 90 
 f6 
 01 
 ff 
 Kernel panic - not syncing: Non maskable interrupt 
 Pid: 490, comm: cglimit_rm_dir Tainted: G      D    ----------------   2.6.32-2.0.0.1 #4 
  [<ffffffff8105e3c5>] ? panic+0xa5/0x190 
  [<ffffffff814a45e7>] ? oops_end+0x87/0x100 
  [<ffffffff814a4e61>] ? nmi_watchdog_tick+0x161/0x1e0 
  [<ffffffff81250009>] ? __write_lock_failed+0x9/0x20 
  [<ffffffff8105d435>] ? do_fork+0xb5/0x440 
  [<ffffffff8100afb2>] ? system_call_fastpath+0x16/0x1b 
 BUG: NMI Watchdog detected LOCKUP
  on CPU2, ip ffffffff8104763a, registers: 
 CPU 2 
  
 Modules linked in:
  netconsole
  configfs
  autofs4
  lockd
  sunrpc
  cpufreq_ondemand
  acpi_cpufreq
  freq_table
  mperf
  xfs
  exportfs
  dm_multipath
  scsi_dh
  video
  output
  sbs
  sbshc
  power_meter
  hwmon
  acpi_pad
  parport_pc
  lp
  parport
  sg
  e1000e
  snd_hda_intel
  snd_hda_codec
  snd_hwdep
  serio_raw
  snd_pcsp
  i7core_edac
  ioatdma
  snd_pcm
  dca
  pata_acpi
  snd_timer
  snd
  i2c_i801
  edac_core
  i2c_core
  soundcore
  ata_generic
  iTCO_wdt
  iTCO_vendor_support
  snd_page_alloc
  dm_raid45
  dm_memcache
  xor
  dm_snapshot
  dm_zero
  dm_mirror
  dm_region_hash
  dm_log
  dm_mod
  ata_piix
  libata
  shpchp
  mptsas
  mptscsih
  mptbase
  scsi_transport_sas
  sd_mod
  crc_t10dif
  scsi_mod
  ext3
  jbd
  mbcache
  [last unloaded: microcode]
  
  
 Pid: 18644, comm: java Tainted: G      D    ----------------   2.6.32-2.0.0.1 #4
  Supermicro X8DTL
 /X8DTL
  
 RIP: 0010:[<ffffffff8104763a>] 
 RSP: 0018:ffff8800bde13df8  EFLAGS: 00000006 
 RBP: ffff8800bde13df8 R08: ffff8801ca20a8f0 R09: ffff8801ca20a900 
 CR2: 0000000002176e10 CR3: 000000072e040000 CR4: 00000000000006e0 
 Stack: 
  ffff8803d8dfb080
  ffff8800bde13e98
  [<ffffffff81062e3e>] wait_consider_task+0x72e/0xa50 
  [<ffffffff8100afb2>] system_call_fastpath+0x16/0x1b 
 00 
 c1 
 18 
 82 
 11 
 39 
 1f 
 00 
 Pid: 18644, comm: java Tainted: G      D    ----------------   2.6.32-2.0.0.1 #4 
  [<ffffffff8105e02a>] ? oops_exit+0x1a/0x20 
  [<ffffffff814a402b>] ? do_nmi+0xcb/0x2d0 
  [<ffffffff81060ea8>] ? release_task+0x1c8/0x4a0 
  [<ffffffff81063422>] ? sys_wait4+0xa2/0xf0

Comment 1 gaoqiang 2012-09-28 05:05:09 UTC
normally, it runs well. this bug happens when cgroup (memory subsystem) is used.

Comment 3 Prarit Bhargava 2012-09-28 12:57:06 UTC
>2.6.32-220.7.1.el6.x86_64 on centos6.2,actually,it's a self-built kernel >without CONFIG_COMPACTION 

1.  We do not support custom compiling of the kernel.

2.  We do not support centos6.2, which may have different config options enabled.

P.