Bug 472581

Summary: Kernel panic during hibernate on Tyan S2881
Product: Red Hat Enterprise Linux 5 Reporter: Orion Poplawski <orion>
Component: kernelAssignee: John Feeney <jfeeney>
Status: CLOSED CURRENTRELEASE QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: medium Docs Contact:
Priority: medium    
Version: 5.2CC: dzickus
Target Milestone: rc   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2009-03-04 11:23:17 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
dmesg from S2881 system
none
dmesg from S2882 system
none
Don't save aperture memory none

Description Orion Poplawski 2008-11-21 21:05:17 UTC
Description of problem:

Going into hibernate on Tyan S2881 based system panics running 2.6.18-92.1.18.el5.

Hand copied:

Process: bash
Trace:
swsusp_arch_suspend+0x7c/0x80
swsusp_suspend+0x2c/0x51
pm_suspend_disk+0x42/0xce
enter_state+0x5e/0x19b
state_store+0x5e/0x79
sysfs_write_file+0xb9/0xe8
vfs_write+0x45/0x6e
tracesys+0xd5/0xe0

Code: 48 8b 04 11 48 89 04 16 48 83 c2 08 48 81 fa 00 10 00 00 75
RIP: swsusp_save+0x27a/0x335
RSP: <ffff81017861de10>
CR2: <ffff81000c000000>

Kernel panic - not syncing: Fatal exception

Tried to capture with serial console, but all I get is:

Disabling non-boot CPUs ...
Breaking affinity for irq 4
CPU 1 is now offline
CPU1 is down
Stopping tasks: ==============================================================================================|
Shrinking memory... done (60097 pages freed)
 usbdev2.1_ep81: PM: suspend 0->1, parent 2-0:1.0 already 1
 usbdev1.1_ep81: PM: suspend 0->1, parent 1-0:1.0 already 1

Comment 1 Matthew Garrett 2009-02-17 00:38:23 UTC
Could you attach the full dmesg for this system? Does booting with the

iommu=soft

kernel commandline argument avoid the oops?

Comment 2 Matthew Garrett 2009-02-17 00:41:09 UTC
Also, is it possible to test 5.3 on this system? A bug was fixed that would lead to failed hibernation on amd64 systems with 4GB or more of RAM.

Comment 3 Orion Poplawski 2009-02-18 17:26:18 UTC
Created attachment 332422 [details]
dmesg from S2881 system

iommu=soft does appear to prevent the crash, however it appears to disable wake on lan as well.  Or maybe wake on lan has never worked from hibernate and I just haven't been able to do it before.  wake on lan from power off works.

5.3 will have to wait for a bit, but we will be moving to it soon.

Comment 4 Orion Poplawski 2009-02-18 17:30:36 UTC
Created attachment 332423 [details]
dmesg from S2882 system

Also seeing on S2882 system.  Have crash dumps as well.

Comment 5 Matthew Garrett 2009-02-18 17:37:29 UTC
Created attachment 332426 [details]
Don't save aperture memory

The bug is caused by the system attempting to save and restore the contents of the iommu aperture. This patch should fix the problem and has been integrated into the 5.3 release.

Comment 6 Orion Poplawski 2009-02-18 18:31:02 UTC
I've installed the 2.6.18-128.1.1 kernel and indeed hibernate is now working.  Thanks!

I am having trouble with wake on lan though.  When I powerdown the NIC is left running (NIC light on the machine is on).  When I use hibernate, the NIC is turned off.  Does the kernel have any control over this, or is this a BIOS only thing?

Comment 7 Matthew Garrett 2009-02-18 18:38:28 UTC
Wake on LAN sounds like #476501. Could you subscribe to that bug and provide the information requested there?

Comment 8 Orion Poplawski 2009-02-18 18:49:44 UTC
Will do.  Feel free to close this bug.