Bug 252443

Summary: loading acpi-cpufreq results in general protection fault
Product: [Fedora] Fedora Reporter: FanWei <wei.fan>
Component: kernelAssignee: Jarod Wilson <jarod>
Status: CLOSED RAWHIDE QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: high    
Version: 8CC: acpi-bugzilla, arumugam1175, austin.zhang, bibo.mao, cebbert, charlesr.harris, davej, grgustaf, urban
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2007-10-30 14:29:13 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 165150    

Description FanWei 2007-08-16 02:38:10 UTC
Description of problem:

After conplete  the installation, the reboot the system.
Enter the gui start process and stop at cpuspeed.
Starting process stop but the system have not hung.

The testing platform:intel Weybridge.
The testing OS: fedora f8test1 ia32.

the error message:
/etc/rc5.d/S06cpuspeed:line 112:1615 segmentation fault /sbin/modprobe acpi-
cpufreq 2 > /dev/null


Version-Release number of selected component (if applicable):


How reproducible:

Steps to Reproduce:
1.Install fedora f8test1 ia32 on intel weybridga and choice all 3 options to 
transaction set
2.Complete the installation,then reboot.
3.The system would stop at "cpuspeed" step.
  
Actual results:
The system should enter into configuration interface.

Expected results:
The system stoped.

Additional info:

Comment 1 Jarod Wilson 2007-08-17 14:59:14 UTC
Please chkconfig cpuspeed off, boot the system with 'cpufreq.debug=7' added to
the kernel boot params, and then try manually modprobing acpi-cpufreq. The
feedback from the modprobe, plus the tail end of dmesg output (dmesg | tail -20)
will hopefully give us some additional clues as to what's going on.

Also, if we have a system in the building that I can reproduce this on, I could
spend a bit of time poking at this myself... (Geoff, I don't suppose you know of
one I could use?)

Comment 2 Geoff Gustafson 2007-08-17 15:13:44 UTC
Brian Stein's team controls about 17 of them... I think some were supposed to go
to others but didn't. I do have one in my cube, but I'd go to the gold mine
first. :)

Comment 3 Jarod Wilson 2007-08-17 20:02:13 UTC
I've been able to reproduce the bug on a weybridge here in Westford:

[root@avocado ~]# uname -r
2.6.23-0.110.rc3.git1.fc8
[root@avocado ~]# modprobe acpi-cpufreq
Segmentation fault
[root@avocado ~]# 
Message from syslogd@ at Fri Aug 17 07:54:05 2007 ...
avocado kernel: general protection fault: 0000 [1] SMP 

set status page addr 0x00033000
acpi-cpufreq: acpi_cpufreq_init
acpi-cpufreq: acpi_cpufreq_early_init
general protection fault: 0000 [1] SMP 
CPU 1 
Modules linked in: acpi_cpufreq i915 drm ipt_MASQUERADE iptable_nat nf_nat
bridge autofs4 hidp rfcomm l2cap bluetoo
th sunrpc ipv6 nf_conntrack_netbios_ns nf_conntrack_ipv4 xt_state nf_conntrack
nfnetlink xt_tcpudp ipt_REJECT iptab
le_filter ip_tables x_tables video output sbs dock battery ac lp loop kvm_intel
kvm snd_hda_intel snd_seq_dummy snd
_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss
snd_pcm snd_timer 3c59x snd parport_pc
 sr_mod button soundcore mii e1000e parport floppy rtc_cmos cdrom i2c_i801
snd_page_alloc serio_raw i2c_core sg dm_
snapshot dm_zero dm_mirror dm_mod ata_generic ata_piix libata sd_mod scsi_mod
ext3 jbd mbcache ehci_hcd ohci_hcd uh
ci_hcd
Pid: 3574, comm: modprobe Not tainted 2.6.23-0.110.rc3.git1.fc8 #1
RIP: 0010:[<ffffffff8115c56f>]  [<ffffffff8115c56f>]
acpi_ns_get_parent_node+0x7/0x15
RSP: 0018:ffff810059acbcd0  EFLAGS: 00010282
RAX: 0000000000000000 RBX: 0000000000000005 RCX: ffff810059acbd58
RDX: ffff81007d1a5ee0 RSI: ffff810059acbd00 RDI: a56b6b6b6b6b6b6b
RBP: 0000000000000000 R08: ffff81007d1a5ee0 R09: ffff81007d0a3098
R10: 0000000000000287 R11: ffff81007d0a3098 R12: ffff81007d1a5ee0
R13: ffff810059acbd58 R14: ffffffff81163b22 R15: ffff810059acbdf8
FS:  00002aaaab0166f0(0000) GS:ffff810037c4d578(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000062600f CR3: 00000000795b9000 CR4: 00000000000026e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process modprobe (pid: 3574, threadinfo ffff810059aca000, task ffff8100598b6000)
Stack:  ffffffff8115c2d2 ffff810059acbd00 ffffffff81163a5a ffff810059acbd58
 ffffffff81163b4d ffff81007d0a3000 0000000000000018 ffff81007d0a3098
 ffffffff8116271c 0000000000000000 ffff81007d0a3000 ffff810059acbd98
Call Trace:
 [<ffffffff8115c2d2>] acpi_ns_get_pathname_length+0xa/0x31
 [<ffffffff81163a5a>] acpi_ut_get_simple_object_size+0x77/0xd9
 [<ffffffff81163b4d>] acpi_ut_get_element_length+0x2b/0x4f
 [<ffffffff8116271c>] acpi_ut_walk_package_tree+0x5b/0xd5
 [<ffffffff81163af6>] acpi_ut_get_object_size+0x3a/0x66
 [<ffffffff8115bea9>] acpi_evaluate_object+0x17a/0x1e5
 [<ffffffff8116d102>] acpi_processor_preregister_performance+0xce/0x41c
 [<ffffffff88118062>] :acpi_cpufreq:acpi_cpufreq_init+0x62/0x92
 [<ffffffff8105cfa8>] sys_init_module+0x15d5/0x173a
 [<ffffffff8100be55>] tracesys+0xdc/0xe1


Code: f6 47 0a 01 48 8b 7f 18 74 f6 48 89 f8 c3 31 c0 f6 47 0a 01 
RIP  [<ffffffff8115c56f>] acpi_ns_get_parent_node+0x7/0x15
 RSP <ffff810059acbcd0>


For the record, on the same system, running kernel-2.6.21-1.3228.fc7,
acpi-cpufreq loads just fine, and cpu frequency scaling works as expected.

Comment 4 Chuck Ebbert 2007-08-20 22:35:23 UTC
The latest rawhide kernels have ACPI debugging enabled; acpi.debug_level and
acpi.debug_layer kernel options should give some additional information.

Comment 5 Chuck Ebbert 2007-08-21 22:42:48 UTC
ffffffff8115c568 <acpi_ns_get_parent_node>:
/usr/src/debug/kernel-2.6.22/linux-2.6.22.x86_64/drivers/acpi/namespace/nsutils.c:905
ffffffff8115c56f:       f6 47 0a 01             testb  $0x1,0xa(%rdi)
ffffffff8115c573:       48 8b 7f 18             mov    0x18(%rdi),%rdi
ffffffff8115c577:       74 f6                   je     ffffffff8115c56f 

But RDI == a56b6b6b6b6b6b6b

So it is using freed memory as a pointer to something.


Comment 6 FanWei 2007-09-18 08:15:16 UTC
The testing platform:intel Weybridge.
The testing OS: fedora f8test2 i686 live CD.

This bug was recurring on Fedora f8test2 i686 live CD.

Comment 7 Chuck Ebbert 2007-09-18 17:08:59 UTC
Does this happen on vanilla kernels as well? Test kernel:

http://people.redhat.com/cebbert/kernels/F8/x86_64/kernel-vanilla-2.6.23-0.185.rc6.git7.V.x86_64.rpm


Comment 8 FanWei 2007-09-19 09:45:38 UTC
The bug also recurred on kernel-vanilla-2.6.23-0.185.rc6.git7.V.x86_64.rpm



Comment 9 Jarod Wilson 2007-10-24 13:07:05 UTC
*** Bug 349941 has been marked as a duplicate of this bug. ***

Comment 10 charles harris 2007-10-26 07:57:00 UTC
(In reply to comment #7)
> Does this happen on vanilla kernels as well? Test kernel:
> 
>
http://people.redhat.com/cebbert/kernels/F8/x86_64/kernel-vanilla-2.6.23-0.185.rc6.git7.V.x86_64.rpm
> 

I didn't have any problems with vanilla 2.6.23 from kernel.org. At the moment
I'm running 2.6.23.1-cfs-v22.1 and it works fine also. So did 2.6.23.rc{2,4,8}.

Comment 11 charles harris 2007-10-28 17:15:54 UTC
Kernel 2.6.23.1-35.fc8 boots on my machine.

Comment 12 Chuck Ebbert 2007-10-29 21:13:56 UTC
(In reply to comment #8)
> The bug also recurred on kernel-vanilla-2.6.23-0.185.rc6.git7.V.x86_64.rpm
> 
> 

Was this fixed for you in the latest Fedora 8 kernels?


Comment 13 bibo,mao 2007-10-30 02:22:38 UTC
kernel 2.6.23-0.214.rc8-git2 boots well on my machine also.

Comment 14 austin 2007-10-30 03:12:35 UTC
This bug still can be reproduced in "weybridge" platform under 2.6.23-
0.214.rc8-git2.fc8 (Fedora test 3 for x86-64.) The above comments from bibo is 
for "Montevina" platform. 

Comment 15 Jarod Wilson 2007-10-30 03:16:02 UTC
On weybridge, please give 2.6.23.1-37.fc8 or later a try -- I believe that's what Chuck was asking about.

Comment 16 austin 2007-10-30 03:32:40 UTC
I just try it in F8 test 3. Can you give me a hint on where can I get 2.6.23.1-
37.fc8 or later version? I didn't find it in 
http://people.redhat.com/cebbert/kernels/F8 and fedoraproject.org.
BTW, the F8 will use 2.6.23.1-xxx for final release?


Comment 17 austin 2007-10-30 03:37:03 UTC
I just try it in F8 test 3. Can you give me a hint on where can I get 2.6.23.1-
37.fc8 or later version? I didn't find it in 
http://people.redhat.com/cebbert/kernels/F8 and fedoraproject.org.
BTW, the F8 will use 2.6.23.1-xxx for final release?


Comment 18 charles harris 2007-10-30 03:44:25 UTC
Latest working is kernel-2.6.23.1-41.fc8:
http://koji.fedoraproject.org/koji/buildinfo?buildID=22818

Comment 19 austin 2007-10-30 04:20:21 UTC
Yes, I just tried the kernel-2.6.23.1-41.fc8 from charlesr, it was fixed. 

Comment 20 Arumugam 2009-03-10 17:52:35 UTC
Hi guys,
    I am new to this forum. I have a greencity board and just tried booting Redhat Enterprise Linux 2.6.28.6 on this board and i see the same error reported in this bug.

/etc/rc5.d/S06cpuspeed: line 111: 7089 Segmentation Fault /sbin/modprobe acpi-cpufreq 2 > /dev/NULL

   The system stops booting after getting this error.

   Could you please let me know what the fix is for this error?

   Appreciate your response.

Thanks,
Aru

Comment 21 Jarod Wilson 2009-03-10 18:52:47 UTC
(In reply to comment #20)
> Hi guys,
>     I am new to this forum. I have a greencity board and just tried booting
> Redhat Enterprise Linux 2.6.28.6 on this board and i see the same error
> reported in this bug.
> 
> /etc/rc5.d/S06cpuspeed: line 111: 7089 Segmentation Fault /sbin/modprobe
> acpi-cpufreq 2 > /dev/NULL
> 
>    The system stops booting after getting this error.
> 
>    Could you please let me know what the fix is for this error?


That's a different board and a different kernel. Could be a similar bug, but the one in this report is already fixed, so please open a new bug. When you do, please provide the console log from booting with the kernel parameters 'rhgb quiet' removed, and the additional kernel paramaters 'debug cpufreq.debug=7'.