Description of problem: We hit LWD during /kernel/misc/module-load, due to defunct process. modprobe D fffffe000009463c 0 17063 17020 0x00000200 Version-Release number of selected component (if applicable): distro: RHEL-7.3-20160629.n.0 Server aarch64 kernel: 4.5.0-0.43.el7 task: /kernel/misc/module-load 1.4-23 How reproducible: unknown Steps to Reproduce: 1. Install ARM host with distro: RHEL-7.3-20160629.n.0 Server aarch64 2. Install kernel 4.5.0-0.43.el7 3. Run /kernel/misc/module-load 1.4-23 Actual results: https://beaker.engineering.redhat.com/jobs/1390332 https://beaker.engineering.redhat.com/recipes/2841022#task42630139 http://beaker-archive.app.eng.bos.redhat.com/beaker-logs/2016/07/13903/1390332/2841022/console.log ---<-snip->--- [ 528.711245] restraintd[4942]: ** Running task: 42630139 [/kernel/misc/module-load] ** Attempting to load 8021q... ** [ 528.921768] 8021q: 802.1Q VLAN Support v1.8 ** Attempting to unload 8021q... ** [ 589.043663] restraintd[4942]: *** Current Time: Sat Jul 02 02:38:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 [ 649.090509] restraintd[4942]: *** Current Time: Sat Jul 02 02:39:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 [-- MARK -- Sat Jul 2 06:40:00 2016] [ 709.094700] restraintd[4942]: *** Current Time: Sat Jul 02 02:40:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 [ 720.083748] INFO: task modprobe:17063 blocked for more than 120 seconds. [ 720.090432] Tainted: G W E ------------ 4.5.0-0.43.el7.aarch64 #1 [ 720.098058] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 720.105858] modprobe D fffffe000009463c 0 17063 17020 0x00000200 [ 720.112907] Call trace: [ 720.115353] [<fffffe000009463c>] __switch_to+0x5c/0x68 [ 720.120486] [<fffffe0000785fd8>] __schedule+0x1d8/0x68c [ 720.125702] [<fffffe00007864c4>] schedule+0x38/0x90 [ 720.130563] [<fffffe0000788f14>] schedule_timeout+0x18c/0x244 [ 720.136296] [<fffffe000010baa4>] wait_woken+0x54/0x94 [ 720.141332] [<fffffe0000659028>] rtnl_link_unregister+0x108/0x10c [ 720.147416] [<fffffdfffc8e3538>] vlan_netlink_fini+0x10/0x20 [8021q] [ 720.153751] [<fffffdfffc8e34ec>] vlan_cleanup_module+0x2c/0x68 [8021q] [ 720.160263] [<fffffe0000146a60>] SyS_delete_module+0x1e8/0x23c [ 720.166075] [<fffffe0000091a8c>] __sys_trace_return+0x0/0x4 [ 769.092057] restraintd[4942]: *** Current Time: Sat Jul 02 02:41:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 [ 829.089556] restraintd[4942]: *** Current Time: Sat Jul 02 02:42:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 [ 840.172916] INFO: task modprobe:17063 blocked for more than 120 seconds. [ 840.179598] Tainted: G W E ------------ 4.5.0-0.43.el7.aarch64 #1 [ 840.187226] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 840.195027] modprobe D fffffe000009463c 0 17063 17020 0x00000200 [ 840.202079] Call trace: [ 840.204523] [<fffffe000009463c>] __switch_to+0x5c/0x68 [ 840.209656] [<fffffe0000785fd8>] __schedule+0x1d8/0x68c [ 840.214874] [<fffffe00007864c4>] schedule+0x38/0x90 [ 840.219747] [<fffffe0000788f14>] schedule_timeout+0x18c/0x244 [ 840.225483] [<fffffe000010baa4>] wait_woken+0x54/0x94 [ 840.230526] [<fffffe0000659028>] rtnl_link_unregister+0x108/0x10c [ 840.236603] [<fffffdfffc8e3538>] vlan_netlink_fini+0x10/0x20 [8021q] [ 840.242946] [<fffffdfffc8e34ec>] vlan_cleanup_module+0x2c/0x68 [8021q] [ 840.249452] [<fffffe0000146a60>] SyS_delete_module+0x1e8/0x23c [ 840.255274] [<fffffe0000091a8c>] __sys_trace_return+0x0/0x4 [ 889.094210] restraintd[4942]: *** Current Time: Sat Jul 02 02:43:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 [ 949.058661] restraintd[4942]: *** Current Time: Sat Jul 02 02:44:15 2016 Localwatchdog at: Sat Jul 02 03:37:14 2016 [ 960.261846] INFO: task modprobe:17063 blocked for more than 120 seconds. ---<-snip->--- Expected results: successful completion of module-load task Additional info:
Iyappan, please see if you can reproduce this and if so, resolve it.
This is another A3 Mustang. Can I assume it has not been seen on a B0, Merlin or any other system we test on?
Let's see how this goes with a -5 kernel.
Closing this one as fixed per discussion today. --Chris