Bug 781356

Summary: rmmod amd64_edac_mod causes kernel soft lockup
Product: Red Hat Enterprise Linux 6 Reporter: adm.fkt.physik
Component: kernelAssignee: Aristeu Rozanski <arozansk>
Status: CLOSED DUPLICATE QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: high Docs Contact:
Priority: unspecified    
Version: 6.4CC: arozansk
Target Milestone: rc   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2013-10-15 14:58:23 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
/var/log/messaes excerpt for rmmod amd64_edac_mod
none
/var/log/messages excerpt for rmmod amd64_edac_mod after kernel update
none
/var/log/messages excerpt for rmmod amd64_edac_mod after kernel and distro update none

Description adm.fkt.physik 2012-01-13 11:12:42 UTC
Created attachment 555034 [details]
/var/log/messaes excerpt for rmmod amd64_edac_mod

Description of problem:
Boot kernel 2.6.32-220.2.1.el6.x86_64 and unload EDAC modules by
  rmmod amd64_edac_mod edac_mce_amd edac_core
Kernel will inhibit soft lockup BUG and will become unresponsive.

Version-Release number of selected component (if applicable):
kernel-2.6.32-220.2.1.el6.x86_64
kernel-firmware-2.6.32-220.2.1.el6.noarch

How reproducible: always

Steps to Reproduce:
1. Boot kernel 2.6.32-220.2.1.el6.x86_64 on 
   Supermicro H8DGT-HF BIOS Date: 11/09/2011 Rev: 2.0a
   (also tested: BIOS Rev: 1.0c)
   and 2 x Opteron 6128
      vendor_id : AuthenticAMD
      cpu family: 16
      model     : 9
      model name: AMD Opteron(tm) Processor 6128
      stepping  : 1
   and 64 GB RAM.
2. modprobe amd64_edac_mod edac_mce_amd edac_core
   (if modules are not auto-loaded)
3. rmmod amd64_edac_mod
  
Actual results: see attachment with excerpt of /var/log/messsages.

Expected results: unload the module, no crash

Additional info: works fine with 2.6.32-131.21.1.el6.x86_64

Comment 2 adm.fkt.physik 2012-01-27 10:55:23 UTC
Created attachment 557840 [details]
/var/log/messages excerpt for rmmod amd64_edac_mod after kernel update

update from 2.6.32-220.2.1.el6.x86_64 to 2.6.32-220.4.1.el6.x86_64 does not fix the bug

Comment 4 RHEL Program Management 2012-05-03 05:21:01 UTC
Since RHEL 6.3 External Beta has begun, and this bug remains
unresolved, it has been rejected as it is not proposed as
exception or blocker.

Red Hat invites you to ask your support representative to
propose this request, if appropriate and relevant, in the
next release of Red Hat Enterprise Linux.

Comment 5 adm.fkt.physik 2013-08-14 09:00:17 UTC
update to 6.4 and to kernel-2.6.32-358.14.1.el6.x86_64 does not fix the bug

rmmod amd64_edac_mod
-> shell freezes
-> edac_core depdendy -1
-> module amd64_edac_mod not removed
-> after ~1 minute node freezes completely

Comment 6 adm.fkt.physik 2013-08-14 09:02:03 UTC
Created attachment 786463 [details]
/var/log/messages excerpt for rmmod amd64_edac_mod after kernel and distro update

Comment 7 Aristeu Rozanski 2013-10-15 14:58:23 UTC
This is fixed by BZ#831127 which should be in 6.5.

*** This bug has been marked as a duplicate of bug 831127 ***