Bug 1352674 - [RHELSA-7.3] 4.5.0-0.43.el7 module-load tested LWD
Summary: [RHELSA-7.3] 4.5.0-0.43.el7 module-load tested LWD
Keywords:
Status: CLOSED NOTABUG
Alias: None
Product: Red Hat Enterprise Linux 7
Classification: Red Hat
Component: kernel-aarch64
Version: 7.3
Hardware: aarch64
OS: Linux
high
high
Target Milestone: rc
: ---
Assignee: Iyappan Subramanian
QA Contact: Jeff Bastian
URL:
Whiteboard:
Depends On:
Blocks: 1250212
TreeView+ depends on / blocked
 
Reported: 2016-07-04 16:49 UTC by PaulB
Modified: 2016-09-12 18:37 UTC (History)
8 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-09-12 18:37:40 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description PaulB 2016-07-04 16:49:40 UTC
Description of problem:
We hit LWD during  /kernel/misc/module-load,
due to defunct process.
 modprobe        D fffffe000009463c     0 17063  17020 0x00000200

Version-Release number of selected component (if applicable):
 distro: RHEL-7.3-20160629.n.0 Server aarch64
 kernel: 4.5.0-0.43.el7 
 task: /kernel/misc/module-load 1.4-23

How reproducible:
 unknown

Steps to Reproduce:
1. Install ARM host with  distro: RHEL-7.3-20160629.n.0 Server aarch64
2. Install kernel 4.5.0-0.43.el7 
3. Run /kernel/misc/module-load 1.4-23

Actual results:
https://beaker.engineering.redhat.com/jobs/1390332
https://beaker.engineering.redhat.com/recipes/2841022#task42630139
http://beaker-archive.app.eng.bos.redhat.com/beaker-logs/2016/07/13903/1390332/2841022/console.log
---<-snip->---
[  528.711245] restraintd[4942]: ** Running task: 42630139 [/kernel/misc/module-load] 
** Attempting to load 8021q... ** 
[  528.921768] 8021q: 802.1Q VLAN Support v1.8 
** Attempting to unload 8021q... ** 
[  589.043663] restraintd[4942]: *** Current Time: Sat Jul 02 02:38:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 
[  649.090509] restraintd[4942]: *** Current Time: Sat Jul 02 02:39:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 
[-- MARK -- Sat Jul  2 06:40:00 2016] 
[  709.094700] restraintd[4942]: *** Current Time: Sat Jul 02 02:40:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 
[  720.083748] INFO: task modprobe:17063 blocked for more than 120 seconds. 
[  720.090432]       Tainted: G        W   E  ------------   4.5.0-0.43.el7.aarch64 #1 
[  720.098058] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. 
[  720.105858] modprobe        D fffffe000009463c     0 17063  17020 0x00000200 
[  720.112907] Call trace: 
[  720.115353] [<fffffe000009463c>] __switch_to+0x5c/0x68 
[  720.120486] [<fffffe0000785fd8>] __schedule+0x1d8/0x68c 
[  720.125702] [<fffffe00007864c4>] schedule+0x38/0x90 
[  720.130563] [<fffffe0000788f14>] schedule_timeout+0x18c/0x244 
[  720.136296] [<fffffe000010baa4>] wait_woken+0x54/0x94 
[  720.141332] [<fffffe0000659028>] rtnl_link_unregister+0x108/0x10c 
[  720.147416] [<fffffdfffc8e3538>] vlan_netlink_fini+0x10/0x20 [8021q] 
[  720.153751] [<fffffdfffc8e34ec>] vlan_cleanup_module+0x2c/0x68 [8021q] 
[  720.160263] [<fffffe0000146a60>] SyS_delete_module+0x1e8/0x23c 
[  720.166075] [<fffffe0000091a8c>] __sys_trace_return+0x0/0x4 
[  769.092057] restraintd[4942]: *** Current Time: Sat Jul 02 02:41:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 
[  829.089556] restraintd[4942]: *** Current Time: Sat Jul 02 02:42:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 
[  840.172916] INFO: task modprobe:17063 blocked for more than 120 seconds. 
[  840.179598]       Tainted: G        W   E  ------------   4.5.0-0.43.el7.aarch64 #1 
[  840.187226] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. 
[  840.195027] modprobe        D fffffe000009463c     0 17063  17020 0x00000200 
[  840.202079] Call trace: 
[  840.204523] [<fffffe000009463c>] __switch_to+0x5c/0x68 
[  840.209656] [<fffffe0000785fd8>] __schedule+0x1d8/0x68c 
[  840.214874] [<fffffe00007864c4>] schedule+0x38/0x90 
[  840.219747] [<fffffe0000788f14>] schedule_timeout+0x18c/0x244 
[  840.225483] [<fffffe000010baa4>] wait_woken+0x54/0x94 
[  840.230526] [<fffffe0000659028>] rtnl_link_unregister+0x108/0x10c 
[  840.236603] [<fffffdfffc8e3538>] vlan_netlink_fini+0x10/0x20 [8021q] 
[  840.242946] [<fffffdfffc8e34ec>] vlan_cleanup_module+0x2c/0x68 [8021q] 
[  840.249452] [<fffffe0000146a60>] SyS_delete_module+0x1e8/0x23c 
[  840.255274] [<fffffe0000091a8c>] __sys_trace_return+0x0/0x4 
[  889.094210] restraintd[4942]: *** Current Time: Sat Jul 02 02:43:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 
[  949.058661] restraintd[4942]: *** Current Time: Sat Jul 02 02:44:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 
[  960.261846] INFO: task modprobe:17063 blocked for more than 120 seconds. 
---<-snip->---

Expected results:
 successful completion of module-load task

Additional info:

Comment 3 Mark Langsdorf 2016-07-22 14:34:05 UTC
Iyappan, please see if you can reproduce this and if so, resolve it.

Comment 4 John Feeney 2016-08-12 18:49:10 UTC
This is another A3 Mustang. Can I assume it has not been seen on a B0, Merlin or any other system we test on?

Comment 7 Jon Masters 2016-08-29 18:31:23 UTC
Let's see how this goes with a -5 kernel.

Comment 9 Chris Tatman 2016-09-12 18:37:40 UTC
Closing this one as fixed per discussion today.

--Chris


Note You need to log in before you can comment on or make changes to this bug.