Red Hat Bugzilla – Bug 1473242
kernel BUG at mm/page_alloc.c:1877
Last modified: 2017-12-25 22:57:37 EST
Created attachment 1301639 [details]
3 Kernel Bug Dumps
Description of problem:
When performing backups using rsync to an ext4 filesystem on top of LUKs & LVM, the system eventually locks up completely and dumps a kernel BUG trace. I've attached several of those traces. Note, even though some of the traces are marked as "tainted", there is an untainted traces also. This has been happening for sometime... perhaps since the 4.9 or 4.10 series of kernels under F25. I've only just tracked down what was going on.
I've tried all sorts of BIOS settings, memory tests, disk tests, disabling swap, changing the destination disk etc.etc but none of that has made any difference.
Version-Release number of selected component (if applicable):
Linux rex.homenet.lan 4.11.10-300.fc26.x86_64 #1 SMP Wed Jul 12 17:05:39 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
Steps to Reproduce:
1. See above
Please let me know what other output you require. Thanks
Created attachment 1301665 [details]
Kernel Dump with XFS
Seems EXT4 is not to blame. The same occurs when using XFS
I suspect the memory layout of your particular system is tripping something. I'd recommend reporting this bug upstream since it's been so consistent. You can use firstname.lastname@example.org or file a bugzilla.kernel.org bug which will also get converted into e-mail.
Reported upstream as bug 196443
For what it is worth: I have been hit by the same bug running CentOS7 with kernel3.10.0-514.26.2.el7.x86_64
I have hooked my findings onto Ian's upstream bug.
I'm not so sure it is the same bug Sigurd. For what it's worth this issue persists with 4.14.6-200.fc26 yet 4.10.17 remains rock solid.
Is there anything that can be done Laura?