Bug 1473242 - kernel BUG at mm/page_alloc.c:1877
kernel BUG at mm/page_alloc.c:1877
Status: NEW
Product: Fedora
Classification: Fedora
Component: kernel (Show other bugs)
26
x86_64 Linux
unspecified Severity high
: ---
: ---
Assigned To: Kernel Maintainer List
Fedora Extras Quality Assurance
:
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2017-07-20 05:55 EDT by Ian Chapman
Modified: 2017-12-25 22:57 EST (History)
11 users (show)

See Also:
Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
Environment:
Last Closed:
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)
3 Kernel Bug Dumps (15.39 KB, text/plain)
2017-07-20 05:55 EDT, Ian Chapman
no flags Details
Kernel Dump with XFS (6.45 KB, text/plain)
2017-07-20 07:02 EDT, Ian Chapman
no flags Details

  None (edit)
Description Ian Chapman 2017-07-20 05:55:57 EDT
Created attachment 1301639 [details]
3 Kernel Bug Dumps

Description of problem:

When performing backups using rsync to an ext4 filesystem on top of LUKs & LVM, the system eventually locks up completely and dumps a kernel BUG trace. I've attached several of those traces. Note, even though some of the traces are marked as "tainted", there is an untainted traces also. This has been happening for sometime... perhaps since the 4.9 or 4.10 series of kernels under F25. I've only just tracked down what was going on.

I've tried all sorts of BIOS settings, memory tests, disk tests, disabling swap, changing the destination disk etc.etc but none of that has made any difference. 


Version-Release number of selected component (if applicable):

Linux rex.homenet.lan 4.11.10-300.fc26.x86_64 #1 SMP Wed Jul 12 17:05:39 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

rsync-3.1.2-5.fc26.x86_64
cryptsetup-1.7.5-1.fc26.x86_64
lvm2-2.02.168-6.fc26.x86_64



How reproducible:


Steps to Reproduce:
1. See above
2.
3.

Actual results:


Expected results:


Additional info:

Please let me know what other output you require. Thanks
Comment 1 Ian Chapman 2017-07-20 07:02 EDT
Created attachment 1301665 [details]
Kernel Dump with XFS

Seems EXT4 is not to blame. The same occurs when using XFS
Comment 2 Laura Abbott 2017-07-20 10:44:37 EDT
I suspect the memory layout of your particular system is tripping something. I'd recommend reporting this bug upstream since it's been so consistent. You can use linux-mm@kvack.org or file a bugzilla.kernel.org bug which will also get converted into e-mail.
Comment 3 Ian Chapman 2017-07-22 06:34:55 EDT
Reported upstream as bug 196443
Comment 5 Sigurd Urdahl 2017-09-01 09:44:40 EDT
For what it is worth: I have been hit by the same bug running CentOS7 with kernel3.10.0-514.26.2.el7.x86_64

I have hooked my findings onto Ian's upstream bug.
Comment 6 Ian Chapman 2017-12-25 22:57:37 EST
I'm not so sure it is the same bug Sigurd. For what it's worth this issue persists with 4.14.6-200.fc26 yet 4.10.17 remains rock solid.

Is there anything that can be done Laura?

Note You need to log in before you can comment on or make changes to this bug.