Bug 664145

Summary: kernel 2.6.35.10-68 doesn't complete booting
Product: [Fedora] Fedora Reporter: antonio montagnani <antonio.montagnani>
Component: kernelAssignee: Jarod Wilson <jarodwilson>
Status: CLOSED ERRATA QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: medium Docs Contact:
Priority: low    
Version: 14CC: albiii, bugzilla, eb30750, gansalmon, itamar, jarod, jonathan, josdekloe, kernel-maint, kmcmartin, madhu.chinakonda, martin, maximlevitsky, me, req1348, vitor.dominor, wgianopoulos
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: kernel-2.6.35.10-74.fc14 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2010-12-26 19:54:39 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
Dmesg file... none

Description antonio montagnani 2010-12-18 18:10:15 UTC
Description of problem:
Boot stops when setting hostname 

Version-Release number of selected component (if applicable):
2.6.35.10-68

How reproducible:
always during booting

Steps to Reproduce:
1.just start machine
2.
3.
  
Actual results:
I get a message about udevd-work [519]: sbin/modprobe -bv acpi:ENE0100: unexpected exit with status 0x0009

Expected results:
normal boot

Additional info:

Comment 1 Jarod Wilson 2010-12-20 15:23:04 UTC
*** Bug 664191 has been marked as a duplicate of this bug. ***

Comment 2 Maxim Levitsky 2010-12-20 16:27:29 UTC
As an author of the ENE driver, I would be more that happy to track that bug down.

Comment 3 Maxim Levitsky 2010-12-20 16:29:07 UTC
What system you use?

Comment 4 antonio montagnani 2010-12-20 21:06:44 UTC
My smolt profile is:

http://www.smolts.org/client/show/pub_c38b5c49-9c83-4457-8db8-03d91072afa6

Comment 5 antonio montagnani 2010-12-20 21:07:51 UTC
Created attachment 469852 [details]
Dmesg file...

If you need additional infos, I will try to supply them.

Comment 6 antonio montagnani 2010-12-21 06:09:23 UTC
same trouble with kernel 2.6.35.10-69

Comment 7 Dario Castellarin 2010-12-21 09:50:44 UTC
Despite its changelog, I'm still seeing this issue with kernel -72

Comment 8 Jarod Wilson 2010-12-21 15:37:10 UTC
(In reply to comment #7)
> Despite its changelog, I'm still seeing this issue with kernel -72

Okay. I'd hoped we might have gotten lucky and discovered the cause merely by code inspection, but I think Maxim is going to have to work through this one with you.

Comment 9 Kyle McMartin 2010-12-21 17:28:47 UTC
*** Bug 664769 has been marked as a duplicate of this bug. ***

Comment 10 Paul Lambert 2010-12-22 14:32:47 UTC
10-72 does not boot on HP Pavillion dv7 laptop.  Boot progress halts before the last line or two of the Fedora icon is completely filled.  Possibly desktop app is not launching properly.

Comment 11 antonio montagnani 2010-12-22 14:59:01 UTC
if you hit Esc during boot you will get additional informations and you can report error when booting procedure stops

Comment 12 Paul Lambert 2010-12-22 17:16:54 UTC
Additional boot information for HP dv7 laptop 10-72 kernel.

Welcom to Fedora

Starting udev: udevd-work[495] `/sbin/modprobe -bv acpi: ENE0100:` unexpected exit with statua 0x0009       [OK]

setting hostname <my name>    [OK[

Comment 13 Kyle McMartin 2010-12-22 17:26:30 UTC
Please confirm whether the -70 build here works or not: http://koji.fedoraproject.org/koji/buildinfo?buildID=210654

Comment 14 Paul Lambert 2010-12-22 18:17:57 UTC
The -70 build boots successfully on HP Pavilion dv7 laptop.  

What's the YUM command that will resync the package version to eliminate the RPMDB altered outside of YUM warning?

Comment 15 Paul Lambert 2010-12-22 18:36:13 UTC
The -70 build boots successfully on HP Pavilion dv7 laptop.  

What's the YUM command that will resync the package version to eliminate the RPMDB altered outside of YUM warning?

Comment 16 antonio montagnani 2010-12-22 20:03:53 UTC
kernel-2.6.35.10-72.fc14 has same problem

Comment 17 Dario Castellarin 2010-12-22 20:14:38 UTC
-70 works fine even here (Compal JFL92), -72 however does not.

Comment 18 Jarod Wilson 2010-12-22 20:23:15 UTC
Yeah, 70 had all the updated ir bits reverted, 72 restored them, along with a prospective fix, which obviously wasn't the fix for this issue.

To get a better idea of what's actually going wrong, please try sliding ene_ir.ko out of the way (rename it ene_ir.ko.disabled or similar) for the 72 kernel, then boot. After that kernel has successfully booted (which it should), then go and name the kernel module back to its original name, and first, 'modprobe rc-core debug=1', then try a 'modprobe ene_ir debug=1'. Please provide any and all on-screen output, as well as the last several lines from dmesg, and hopefully we'll see something telling there.

Comment 19 Paul Lambert 2010-12-23 02:52:23 UTC
modprobe ene_ir debug=1

Message from syslogd@BRSINC-01 at Dec 22 21:43:34 ...
 kernel:[  394.285393] Oops: 0002 [#1] SMP 

Message from syslogd@BRSINC-01 at Dec 22 21:43:34 ...
 kernel:[  394.285399] last sysfs file: /sys/module/rc_core/initstate

Message from syslogd@BRSINC-01 at Dec 22 21:43:34 ...
 kernel:[  394.285638] Stack:

Message from syslogd@BRSINC-01 at Dec 22 21:43:34 ...
 kernel:[  394.285670] Call Trace:

Message from syslogd@BRSINC-01 at Dec 22 21:43:34 ...
 kernel:[  394.285826] Code: c7 83 f0 00 00 00 20 00 00 00 48 89 df c7 83 f4 00 00 00 32 00 00 00 c7 83 f8 00 00 00 03 00 00 00 88 83 fc 00 00 00 48 8b 43 08 <c7> 80 34 03 00 00 80 d1 f0 08 e8 11 f0 ff ff 41 80 8e 9c 00 00 

Message from syslogd@BRSINC-01 at Dec 22 21:43:34 ...
 kernel:[  394.285907] CR2: 0000000000000334


dmesg

[  394.285286] ene_ir: chip is 0x3926 - kbver = 0x00, rev = 0xc0
[  394.285295] ene_ir: PLL freq = 1406
[  394.285299] ene_ir: KB3926C detected
[  394.285315] ene_ir: Firmware regs: c1 01
[  394.285319] ene_ir: Hardware features:
[  394.285323] ene_ir: * Uses GPIO 40 for IR demodulated input
[  394.285356] BUG: unable to handle kernel NULL pointer dereference at 0000000000000334
[  394.285365] IP: [<ffffffffa04e47d6>] ene_probe+0x76f/0x814 [ene_ir]
[  394.285383] PGD 12860f067 PUD 12756c067 PMD 0 
[  394.285393] Oops: 0002 [#1] SMP 
[  394.285399] last sysfs file: /sys/module/rc_core/initstate
[  394.285405] CPU 0 
[  394.285409] Modules linked in: ene_ir(+) cryptd aes_x86_64 aes_generic fuse ppdev parport_pc parport sunrpc cpufreq_ondemand powernow_k8 freq_table mperf xt_physdev ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 kvm_amd kvm uinput snd_hda_codec_atihdmi arc4 snd_hda_codec_idt snd_hda_intel ecb snd_hda_codec snd_hwdep ath9k ath9k_common ir_lirc_codec lirc_dev snd_seq snd_seq_device ath9k_hw hp_wmi ir_sony_decoder uvcvideo videodev v4l1_compat ath mac80211 snd_pcm cfg80211 snd_timer ir_jvc_decoder v4l2_compat_ioctl32 ir_rc6_decoder snd soundcore rfkill microcode jmb38x_ms r8169 memstick ir_rc5_decoder edac_core joydev snd_page_alloc mii i2c_piix4 edac_mce_amd wmi k10temp ir_nec_decoder shpchp hp_accel lis3lv02d rc_core input_polldev ata_generic pata_acpi sdhci_pci sdhci pata_atiixp mmc_core video output radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan]
[  394.285534] 
[  394.285542] Pid: 2972, comm: modprobe Not tainted 2.6.35.10-72.fc14.x86_64 #1 30FC/HP Pavilion dv7 Notebook PC
[  394.285549] RIP: 0010:[<ffffffffa04e47d6>]  [<ffffffffa04e47d6>] ene_probe+0x76f/0x814 [ene_ir]
[  394.285564] RSP: 0018:ffff880138851d28  EFLAGS: 00010246
[  394.285570] RAX: 0000000000000000 RBX: ffff88013a50e200 RCX: 0000000000001473
[  394.285576] RDX: 000000000000fd63 RSI: 000000000000f8f9 RDI: ffff88013a50e200
[  394.285582] RBP: ffff880138851d78 R08: 00000000c000003f R09: 00000000fffffffe
[  394.285588] R10: ffff8801b8851c47 R11: 0000000000000000 R12: ffff880127508400
[  394.285594] R13: ffff88013a50e210 R14: ffff88013b57d400 R15: 0000000000000001
[  394.285601] FS:  00007fb6bb806720(0000) GS:ffff880002000000(0000) knlGS:00000000f7369b70
[  394.285608] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[  394.285614] CR2: 0000000000000334 CR3: 0000000127566000 CR4: 00000000000006f0
[  394.285620] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  394.285626] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  394.285633] Process modprobe (pid: 2972, threadinfo ffff880138850000, task ffff8801360645c0)
[  394.285638] Stack:
[  394.285641]  ffff880138851d48 ffffffff000000c1 ffffffffa04e4d01 c1260057a04e4d30
[  394.285650] <0> ffff880138851d78 ffff88013b57d400 ffffffffa04e4067 ffffffffa04e5b50
[  394.285659] <0> ffffffffa04e5b90 00000000014f84b0 ffff880138851db8 ffffffff81297b87
[  394.285670] Call Trace:
[  394.285682]  [<ffffffffa04e4067>] ? ene_probe+0x0/0x814 [ene_ir]
[  394.285695]  [<ffffffff81297b87>] pnp_device_probe+0x81/0xab
[  394.285705]  [<ffffffff812d7c6a>] ? driver_sysfs_add+0x4c/0x71
[  394.285714]  [<ffffffff812d804b>] driver_probe_device+0x14b/0x271
[  394.285722]  [<ffffffff812d81ce>] __driver_attach+0x5d/0x81
[  394.285730]  [<ffffffff812d8171>] ? __driver_attach+0x0/0x81
[  394.285738]  [<ffffffff812d6f4f>] bus_for_each_dev+0x56/0x82
[  394.285747]  [<ffffffff812d7c1c>] driver_attach+0x1e/0x20
[  394.285754]  [<ffffffff812d77f6>] bus_add_driver+0xf1/0x25e
[  394.285762]  [<ffffffff812d8432>] driver_register+0x9b/0x108
[  394.285773]  [<ffffffffa04e9000>] ? ene_init+0x0/0x12 [ene_ir]
[  394.285781]  [<ffffffff8129790f>] pnp_register_driver+0x21/0x23
[  394.285791]  [<ffffffffa04e9010>] ene_init+0x10/0x12 [ene_ir]
[  394.285802]  [<ffffffff810021a1>] do_one_initcall+0x5e/0x155
[  394.285812]  [<ffffffff8107cc7d>] sys_init_module+0xa6/0x1e4
[  394.285821]  [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b
[  394.285826] Code: c7 83 f0 00 00 00 20 00 00 00 48 89 df c7 83 f4 00 00 00 32 00 00 00 c7 83 f8 00 00 00 03 00 00 00 88 83 fc 00 00 00 48 8b 43 08 <c7> 80 34 03 00 00 80 d1 f0 08 e8 11 f0 ff ff 41 80 8e 9c 00 00 
[  394.285892] RIP  [<ffffffffa04e47d6>] ene_probe+0x76f/0x814 [ene_ir]
[  394.285903]  RSP <ffff880138851d28>
[  394.285907] CR2: 0000000000000334
[  394.285960] ---[ end trace 2dacd001df6b3652 ]---

Comment 20 Kyle McMartin 2010-12-23 03:46:00 UTC
dev->rdev is uninitialized at the call-site:

/* Initialize default settings */
static void ene_setup_default_settings(struct ene_device *dev)
{
...
        /* Set reasonable default timeout */
        dev->rdev->timeout = MS_TO_US(150000);
}

/* probe entry */
static int ene_probe(struct pnp_dev *pnp_dev, const struct pnp_device_id *id)
{
...
        ene_rx_setup_hw_buffer(dev);
        ene_setup_default_settings(dev);
        ene_setup_hw_settings(dev);

        device_set_wakeup_capable(&pnp_dev->dev, true);
        device_set_wakeup_enable(&pnp_dev->dev, true);

        error = rc_register_device(rdev);
        if (error < 0)
                goto error;

        dev->rdev = rdev;
...
}

Comment 21 Kyle McMartin 2010-12-23 03:52:11 UTC
It's a non-issue in 2.6.37, since the code has been re-written to use the proper input stuff, looks like it's only an lirc issue...

Comment 22 Kyle McMartin 2010-12-23 03:52:46 UTC
http://koji.fedoraproject.org/koji/taskinfo?taskID=2686028

Please confirm this scratch build fixes the issues with ene_ir and I'll quickly push a stable update for it.

Thanks, Kyle

Comment 23 Paul Lambert 2010-12-23 04:06:36 UTC
results of attempting to install 10-73 patch.  Since no user or group named mockbuild exits the installer stated it was using root but still failed.


warning: user mockbuild does not exist - using root
warning: group mockbuild does not exist - using root
########################################### [100%]
error: unpacking of archive failed on file /root/rpmbuild/SOURCES/linux-2.6.35.tar.bz2;4d12ca38: cpio: read failed - Bad file descriptor
error: kernel-2.6.35.10-73.fc14.src.rpm cannot be installed

Comment 24 Kyle McMartin 2010-12-23 04:13:25 UTC
just wait for the build to finish.

Comment 25 Jarod Wilson 2010-12-23 04:46:48 UTC
(In reply to comment #20)
> dev->rdev is uninitialized at the call-site:

Gah. Nasty. At least its an obvious fix. I should have seen that one.

(In reply to comment #21)
> It's a non-issue in 2.6.37, since the code has been re-written to use the
> proper input stuff, looks like it's only an lirc issue...

Nothing at all to do with lirc. The code in 2.6.37 is actually older than what's in the Fedora kernel now, its from the v4l/dvb media_tree's staging/for_v2.6.38 branch, where the primary rc-core interfaces have been rewritten.

http://git.linuxtv.org/media_tree.git?a=commitdiff;h=1639cc0c5348957c076c0304005632d4a1b63252

So this bug actually does appear to be present in the upstream v4l/dvb tree too. At least it should be trivial to fix.

Comment 26 Alexei Panov 2010-12-23 13:18:54 UTC
Oh, and my Toshiba A660 is also affected by this bug.
My workaround is disabling CIR in BIOS settings or include ene_ir module in blacklist.

Comment 27 Kyle McMartin 2010-12-23 16:19:39 UTC
https://koji.fedoraproject.org/koji/taskinfo?taskID=2686028

Try that, Alexei.

Comment 28 Alexei Panov 2010-12-23 16:48:55 UTC
Thank you Kyle.

Yes, kernel 2.6.35.10-73 works fine.

$ dmesg | grep ene_ir
[    8.116334] ene_ir: chip is 0x3926 - kbver = 0x00, rev = 0xd2
[    8.116337] ene_ir: PLL freq = 1000
[    8.116339] ene_ir: KB3926D or higher detected
[    8.116351] ene_ir: Firmware regs: ec 00
[    8.116352] ene_ir: Hardware features:
[    8.116354] ene_ir: * Uses GPIO 40 for IR demodulated input
[    8.116355] ene_ir: * Uses new style input buffer
[    8.116397] ene_ir: Hardware uses 2 extended buffers:
[    8.116399] ene_ir:   0xfa48 - len : 15
[    8.116401] ene_ir:   0xfb24 - len : 17
[    8.116403] ene_ir: Total buffer len = 40
[    9.133352] ene_ir: driver has been succesfully loaded
[   10.167652] rc rc0: lirc_dev: driver ir-lirc-codec (ene_ir) registered at minor = 0

Comment 29 Bill Gianopoulos 2010-12-23 18:07:51 UTC
*** Bug 665416 has been marked as a duplicate of this bug. ***

Comment 30 vitor.dominor 2010-12-23 19:09:27 UTC
I would also like to thank this quick bug fix.
I was also in the same situation, in that after updating to this kernel version, Linux failed to boot, while giving the same kind of messages reported here.
With this patched version (73), I am now able to successfully boot Fedora 14.

Comment 31 Dario Castellarin 2010-12-23 19:57:33 UTC
I confirm -74 works fine even here, thank you!

Comment 32 Jarod Wilson 2010-12-23 20:35:54 UTC
*** Bug 664191 has been marked as a duplicate of this bug. ***

Comment 33 Jarod Wilson 2010-12-23 20:37:16 UTC
Kyle, do you want to ship the patch upstream, or shall I? (Or Maxim?)

Comment 34 Fedora Update System 2010-12-23 20:47:52 UTC
kernel-2.6.35.10-74.fc14 has been submitted as an update for Fedora 14.
https://admin.fedoraproject.org/updates/kernel-2.6.35.10-74.fc14

Comment 35 Kyle McMartin 2010-12-23 20:50:40 UTC
*** Bug 665392 has been marked as a duplicate of this bug. ***

Comment 36 Kyle McMartin 2010-12-23 20:51:26 UTC
Great, thanks for testing folks, I've submitted -74 as an update today.

Comment 37 Fedora Update System 2010-12-25 00:26:05 UTC
kernel-2.6.35.10-74.fc14 has been pushed to the Fedora 14 testing repository.  If problems still persist, please make note of it in this bug report.
 If you want to test the update, you can install it with 
 su -c 'yum --enablerepo=updates-testing update kernel'.  You can provide feedback for this update here: https://admin.fedoraproject.org/updates/kernel-2.6.35.10-74.fc14

Comment 38 Andy Bentley 2010-12-25 12:11:53 UTC
kernel-2.6.35.10-74.fc14 in updates-testing update kernel /works and fixes this bug/ for me <original report>.  Thank you all for pushing through the start of the holiday.  :-)              Merry Christmas !

Comment 39 Fedora Update System 2010-12-26 19:54:22 UTC
kernel-2.6.35.10-74.fc14 has been pushed to the Fedora 14 stable repository.  If problems still persist, please make note of it in this bug report.

Comment 40 MartinG 2011-01-09 21:26:24 UTC
I've got similar problems with the kernel-2.6.35.10-74.fc14.x86_64 on F14 - unable to boot. It halts after the modprobe message, but complains about usb instead. I don't have the exact message at hand right now.

It is an Asus P5E-VM HDMI, smolt profile: http://www.smolts.org/client/show/pub_a42197f6-2e32-4adc-baaa-549b5b9c5a4b

Kernel 2.6.35.9-64.fc14.x86_64 works fine.

Same issue reported by someone else here:
http://forums.fedoraforum.org/showthread.php?p=1432695

Comment 41 Jarod Wilson 2011-01-10 06:11:59 UTC
(In reply to comment #40)
> I've got similar problems with the kernel-2.6.35.10-74.fc14.x86_64 on F14 -
> unable to boot. It halts after the modprobe message, but complains about usb
> instead. I don't have the exact message at hand right now.
> 
> It is an Asus P5E-VM HDMI, smolt profile:
> http://www.smolts.org/client/show/pub_a42197f6-2e32-4adc-baaa-549b5b9c5a4b
> 
> Kernel 2.6.35.9-64.fc14.x86_64 works fine.
> 
> Same issue reported by someone else here:
> http://forums.fedoraforum.org/showthread.php?p=1432695

This is a different issue from what's reported in this bug. This bug deals with the ene_ir driver. Your issue appears to be a usb one, introduced between -65 and -67, according to the link you provided. And this bug is already closed. Please open a new bugzilla for your issue.

Comment 42 antonio montagnani 2011-01-23 21:03:54 UTC
I am suffering of same problem but it happens seldom, I have to boot with a previous kernel, then it restarts fine

Comment 43 antonio montagnani 2011-02-06 14:40:43 UTC
Not completely solved in my case for the ene_ir driver, it happens randomly. Opened a different bug 672086