Bug 1297554 - [abrt] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [in:imjournal:831]
Summary: [abrt] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [in:imjournal:831]
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: 22
Hardware: x86_64
OS: Linux
unspecified
unspecified
Target Milestone: ---
Assignee: Kernel Maintainer List
QA Contact: Fedora Extras Quality Assurance
URL: https://retrace.fedoraproject.org/faf...
Whiteboard: abrt_hash:af48fd4cf345d75db0a924e5fa4...
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-01-11 20:25 UTC by Bugzy
Modified: 2016-03-14 22:26 UTC (History)
8 users (show)

Fixed In Version: 4.3.6-201
Clone Of:
Environment:
Last Closed: 2016-03-14 22:26:07 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)
File: dmesg (64.34 KB, text/plain)
2016-01-11 20:25 UTC, Bugzy
no flags Details

Description Bugzy 2016-01-11 20:25:25 UTC
Description of problem:
update kernel
reboot computer with wifi enabled.

This soft lockup is caused by an update to the rtlwifi driver between kernel 4.1.3 and kernel 4.1.4. The issue was reported in an unrelated bug #1154286. Wheras it is surpossed to be fixed in kerlen 4.3 with this commit[1]. It looks like the fedora kernel is either not carrying that patch, or that the patch doe not fully undo the original problem.

[1] https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/drivers/net/wireless/rtlwifi/rtl8821ae/hw.c?id=54328e64047a54b8fc2362c2e1f0fa16c90f739f

See also [2] and [3]

[2] https://bugzilla.opensuse.org/show_bug.cgi?id=944978
[3] http://marc.info/?l=linux-wireless&m=144373370103285&w=2

Additional info:
reporter:       libreport-2.6.3
NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [in:imjournal:831]
Modules linked in: xt_conntrack ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle bnep iptable_security iptable_raw vfat fat intel_rapl x86_pkg_temp_thermal coretemp kvm_intel kvm crct10dif_pclmul uvcvideo crc32_pclmul crc32c_intel arc4 videobuf2_vmalloc videobuf2_core rtl8821ae videobuf2_memops v4l2_common videodev btcoexist rtl_pci rtlwifi media mac80211 snd_hda_codec_hdmi snd_hda_codec_conexant snd_hda_codec_generic cfg80211 snd_soc_rt5640 snd_soc_rl6231 asus_nb_wmi snd_hda_intel asus_wmi sparse_keymap snd_hda_codec snd_soc_core iTCO_wdt iTCO_vendor_support
 snd_hda_core snd_compress btusb btrtl btbcm btintel bluetooth rfkill snd_pcm_dmaengine snd_hwdep ac97_bus snd_seq elan_i2c acpi_als snd_seq_device snd_pcm joydev wmi snd_timer i2c_designware_platform i2c_designware_core kfifo_buf snd i2c_i801 soundcore snd_soc_sst_acpi industrialio shpchp lpc_ich dw_dmac dw_dmac_core tpm_tis tpm processor_thermal_device int3400_thermal intel_soc_dts_iosf acpi_thermal_rel mei_me mei iosf_mbi int3402_thermal int340x_thermal_zone nfsd auth_rpcgss nfs_acl lockd grace sunrpc binfmt_misc hid_multitouch i915 i2c_algo_bit drm_kms_helper drm serio_raw sdhci_acpi sdhci mmc_core video i2c_hid fjes
CPU: 0 PID: 831 Comm: in:imjournal Not tainted 4.3.3-300.fc23.x86_64 #1
Hardware name: ASUSTeK COMPUTER INC. TX201LA/TX201LA, BIOS TX201LA.215 06/17/2014
task: ffff880119a65a00 ti: ffff8800cc72c000 task.ti: ffff8800cc72c000
RIP: 0010:[<ffffffff810a508b>]  [<ffffffff810a508b>] __do_softirq+0x7b/0x290
RSP: 0000:ffff88011ee03f28  EFLAGS: 00000206
RAX: ffff8800cc730000 RBX: 0000000000000000 RCX: 00000000000006e0
RDX: 0000000000000014 RSI: 00000000cf80e911 RDI: ffff880119a65a00
RBP: ffff88011ee03f78 R08: 00000014cf3f34a4 R09: ffff88011ee03db8
R10: 0000000000000000 R11: 0000000000000004 R12: 0000000000016c80
R13: ffff8800cc72c000 R14: ffff8800cc72ff58 R15: 0000000000000000
FS:  00007f34b27af700(0000) GS:ffff88011ee00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000560e3340efe4 CR3: 00000000cd31f000 CR4: 00000000001406f0
Stack:
 004040401ee0f168 ffff8800cc730000 00000000fffbcc2d 000000050000000a
 000002021ee0f0d8 0000000000000000 0000000000016c80 ffff8800cc72c000
 ffff8800cc72ff58 0000000000000000 ffff88011ee03f90 ffffffff810a54a5
Call Trace:

Comment 1 Bugzy 2016-01-11 20:25:33 UTC
Created attachment 1113669 [details]
File: dmesg

Comment 2 Bugzy 2016-01-25 21:19:57 UTC
Got some more information on this. It looks like the original patch from kernel 4.3.x did not fix the problem, and that a new patch[1] has been included in kernel 4.4 to resolve the problem. Is there any chance of back porting this patch to fedora kernel 4.3.4 ?


[1] https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/drivers/net/wireless/realtek/rtlwifi/rtl8821ae?id=eeec5d0ef7ee54a75e09e861c3cc44177b8752c7

Comment 3 Josh Boyer 2016-01-26 17:18:01 UTC
(In reply to Bugzy from comment #2)
> Got some more information on this. It looks like the original patch from
> kernel 4.3.x did not fix the problem, and that a new patch[1] has been
> included in kernel 4.4 to resolve the problem. Is there any chance of back
> porting this patch to fedora kernel 4.3.4 ?
> 
> 
> [1]
> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/
> drivers/net/wireless/realtek/rtlwifi/
> rtl8821ae?id=eeec5d0ef7ee54a75e09e861c3cc44177b8752c7

How about we test it first?

Please test this scratch build with the patch included when it completes and let us know how it works for this issue:

http://koji.fedoraproject.org/koji/taskinfo?taskID=12691482

Comment 4 Bugzy 2016-01-29 21:18:46 UTC
Hi @Josh,
-- First, I like to say a great big thanks for going through the trouble of including the patch and building a kernel that I could test with (that actually made my day, I did the victory dance and all that ;)).

-- Second, it took me a while to figure out that I had to click on the  "Descendants 	build" to get the generated rpms, so I ended up searching for and installing the wrong kernel packages [1], which of course failed.

-- Third, after installing the right packages, my system came up fine and the wireless was working again, which got me doing another dance.

-- Fourth, another GREAT BIG THANK YOU, you are SUPER AWESOME.

FYI: I am using and testing on fc22 (I could not upgrade to fc 23 cause it would have rendered my system inoperable)


[1] http://koji.fedoraproject.org/koji/taskinfo?taskID=12676566

Comment 5 Bugzy 2016-01-29 21:19:59 UTC
Just in case I was not clear above, the patched kernel http://koji.fedoraproject.org/koji/taskinfo?taskID=12691482 worked

Comment 6 Bugzy 2016-03-12 04:57:00 UTC
Fixed in Kernel 4.3.6-201

Comment 7 Laura Abbott 2016-03-14 22:26:07 UTC
Thank you for reporting and testing


Note You need to log in before you can comment on or make changes to this bug.