| Summary: | BUG: soft lockup - CPU#1 stuck for 67s! [while trying to free memory] | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Andy Lawrence <dr.diesel> | ||||||
| Component: | kernel | Assignee: | Kernel Maintainer List <kernel-maint> | ||||||
| Status: | CLOSED ERRATA | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||||
| Severity: | medium | Docs Contact: | |||||||
| Priority: | unspecified | ||||||||
| Version: | 15 | CC: | gansalmon, itamar, james.bottomley, jonathan, kernel-maint, llevet, madhu.chinakonda | ||||||
| Target Milestone: | --- | ||||||||
| Target Release: | --- | ||||||||
| Hardware: | x86_64 | ||||||||
| OS: | Linux | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2011-06-24 11:39:09 UTC | Type: | --- | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Attachments: |
|
||||||||
|
Description
Andy Lawrence
2011-04-08 14:17:21 UTC
Created attachment 490805 [details]
dmesg dump
Created attachment 490806 [details]
messages dump
This is similar to https://bugzilla.redhat.com/show_bug.cgi?id=643661 but a different process name. The debug kernel has some heavy debugging options enabled, does this also happen with a non-debug kernel? Also, your system looks to be under heavy memory pressure. Yeah, I loaded the debug kernel hoping to provide more useful output when the bug hit. Funny you mention memory, this has not happened since I installed another 4G of memory. Before it only had 2G (no swap either), but I just installed it 2 days ago so not much time with it. Thanks I keep seeing something similar in an attempted Fedora 15 Beta install. I've also got a Lenovo Sandybridge laptop with 2GB of ram. Best guess is that it's writeout related because anything that touches the filesystem hangs after it happens. I can trigger it reliably by untarring a 90GB tar file of someone's home directory. Whatever it is, it's also occurring with upstream, so I'll report it there James, since I added that additional 4G of memory it has not happened once! Still a bug but.... Please post the upstream tracker once you file. Thanks (In reply to comment #7) > Please post the upstream tracker once you file. I'm not sure what you mean by this. The thread is going on here: http://marc.info/?t=130392066000001 The current theory is that it's a bad interaction between the shrinkers and the cgroup memory controller. One apparent workaround is just to disable the cgroup memory controller (In reply to comment #8) > (In reply to comment #7) > > Please post the upstream tracker once you file. > > I'm not sure what you mean by this. The thread is going on here: > You mentioned posting this upstream, I'm guessing at bugzilla.kernel.org? If so please scroll to the top of this bug report and add it to the "External Tracker" box. Thanks Hi, Same append for me. BUG: soft lockup - CPU#1 stuck for 67s! [kswapd0:35] I'm on fedora 15 kernel : 2.6.38.6-27.fc15.x86_64 #1 SMP My cpu is Intel(R) Core(TM) i3-2100T CPU @ 2.50GHz MotherBoard : Gigabyte Technology Co., Ltd. H67N-USB3-B3/H67N-USB3-B3, BIOS F5 03/31/2011 I have 2GB of memory.My HDD is in ahci mode. I can reprodure the bug by this : dd if=/dev/zero of=/root/test.dat bs=50M count=50 dd if=/root/test.dat of=/dev/null dd if=/dev/zero of=/root/test.dat bs=50M count=50 and lockup at during this last command. The computer responding to ping, but enable to have access to anything. I have try disable cpuspeed and add cgroup_disable=memory to kernel command with no success, the computer hang each time you solicit all the memory. Thanks. Ludo. Hi, The bug is corrected on version 2.6.38.8 http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=f06590bd718ed950c98828e30ef93204028f3210 So, waiting for a new build of FC15's kernel. Thanks. Ludo. Hi,
I'm just update kernel to 2.6.38.8-32.fc15.x86_64 but the is already a problem.
My computer don't hang now, but kswapd0 still take 100% off on core after my test :
3 times after boot :
dd if=/dev/zero of=/root/test.dat bs=50M count=50
dd if=/root/test.dat of=/dev/null
and the result :
top - 23:13:27 up 12 min, 1 user, load average: 1.01, 0.99, 0.62
Tasks: 106 total, 2 running, 104 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.0%us, 25.2%sy, 0.0%ni, 74.8%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 1937388k total, 1093916k used, 843472k free, 7596k buffers
Swap: 3899388k total, 0k used, 3899388k free, 906172k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
35 root 20 0 0 0 0 R 99.8 0.0 10:17.97 kswapd0
1 root 20 0 37076 4268 1496 S 0.0 0.2 0:00.93 systemd
2 root 20 0 0 0 0 S 0.0 0.0 0:00.00 kthreadd
3 root 20 0 0 0 0 S 0.0 0.0 0:00.00 ksoftirqd/0
6 root RT 0 0 0 0 S 0.0 0.0 0:00.00 migration/0
...
You can see : uptime 12 min and cpu for kswapd0 10 min and don't stop ...
Thanks.
Ludo.
(In reply to comment #12) > Hi, > > I'm just update kernel to 2.6.38.8-32.fc15.x86_64 but the is already a problem. > My computer don't hang now, but kswapd0 still take 100% off on core after my > test : That is bug 712019 . Since the computer no longer hangs with softlockup messages, which is what the original report was about, I'll close this bug. |