Bug 678964 - kernel panic while suspending to disk
Summary: kernel panic while suspending to disk
Keywords:
Status: CLOSED WORKSFORME
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: kernel
Version: 5.8
Hardware: Unspecified
OS: Linux
medium
high
Target Milestone: rc
: ---
Assignee: John Feeney
QA Contact: Red Hat Kernel QE team
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2011-02-21 05:29 UTC by Hushan Jia
Modified: 2013-01-10 07:34 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2011-05-11 19:37:17 UTC
Target Upstream Version:


Attachments (Terms of Use)

Description Hushan Jia 2011-02-21 05:29:23 UTC
Description of problem:
kernel panic while suspending to disk by 'echo disk > /sys/power/state'

Version-Release number of selected component (if applicable):
2.6.18-238.el5

How reproducible:
most of the time

Steps to Reproduce:
1. echo disk > /sys/power/state
2.
3.
  
Actual results:
panics

Expected results:
can suspend on disk

Additional info:
[root@hp-dl320g5-01 power]# cat state
disk 
[root@hp-dl320g5-01 power]# echo disk > state
Disabling non-boot CPUs ...
Breaking affinity for irq 3
Breaking affinity for irq 138
Breaking affinity for irq 154
Breaking affinity for irq 233
CPU 1 is now offline
CPU1 is down
Stopping tasks: ======================================================================================|
Shrinking memory... done (19156 pages freed)
Saving image data pages (121280 pages) ...     <1>Unable to handle kernel paging request at 000000003d1af038 RIP: 
 [<ffffffff800ac61d>] snapshot_read_next+0x111/0x1bc
PGD 1f1f5067 PUD 220aa067 PMD 0 
Oops: 0000 [1] SMP 
last sysfs file: /power/state
CPU 0 
Modules linked in: autofs4 hidp rfcomm l2cap bluetooth lockd sunrpc ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp bnx2i cnic ipv6 xfrm_nalgo crypto_api uio cxgb3i cxgb3 libiscsi_tcp loop dm_multipath scsi_dh video backlight sbs power_meter hwmon i2c_ec i2c_core dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc lp parport be2net tpm_tis sg be2iscsi 8021q tpm pcspkr i3000_edac edac_mc libiscsi2 tpm_bios tg3 shpchp scsi_transport_iscsi2 hpilo serio_raw scsi_transport_iscsi dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod ata_piix libata sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 3385, comm: bash Not tainted 2.6.18-238.el5 #1
RIP: 0010:[<ffffffff800ac61d>]  [<ffffffff800ac61d>] snapshot_read_next+0x111/0x1bc
RSP: 0018:ffff81001f1f9d78  EFLAGS: 00010206
RAX: ffff810000001000 RBX: 0000000000000000 RCX: 000000003d1af030
RDX: 0000000000000002 RSI: 0000000000000003 RDI: ffff810006265000
RBP: ffff81001f1f9da8 R08: 00000000fffffff4 R09: 000000000000003e
R10: ffffffff804525a0 R11: 0000000000000000 R12: 0000000000001000
R13: 000000000001d9c0 R14: 0000000000000000 R15: 0000000000000001
FS:  00002b7cdfc81f50(0000) GS:ffffffff80425000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000003d1af038 CR3: 00000000224ed000 CR4: 00000000000006e0
Process bash (pid: 3385, threadinfo ffff81001f1f8000, task ffff81003de1e0c0)
Stack:  0000000000000000 00000000000004bc ffff81001f1f9e30 ffffffff800ad424
 0000000000000000 ffff81003f2d8000 0000000000001000 0000000000000001
 0000000000000000 ffff81003d1af000 0000000000000000 ffff810006265000
Call Trace:
 [<ffffffff800ad424>] swsusp_write+0x1c8/0x471
 [<ffffffff801cfa3f>] resume_device+0x90/0xa7
 [<ffffffff800ab8a9>] pm_suspend_disk+0x5b/0xce
 [<ffffffff800aa77a>] enter_state+0x5e/0x19b
 [<ffffffff800aa926>] state_store+0x5e/0x79
 [<ffffffff8010fee2>] sysfs_write_file+0xb9/0xe8
 [<ffffffff80016a81>] vfs_write+0xce/0x174
 [<ffffffff80017339>] sys_write+0x45/0x6e
 [<ffffffff8005d28d>] tracesys+0xd5/0xe0


Code: 48 8b 41 08 48 63 d2 48 89 04 d7 48 8b 49 10 89 f2 48 ff c6 
RIP  [<ffffffff800ac61d>] snapshot_read_next+0x111/0x1bc
 RSP <ffff81001f1f9d78>
CR2: 000000003d1af038
 <0>Kernel panic - not syncing: Fatal exception

Comment 1 Hushan Jia 2011-02-21 05:31:17 UTC
a few times it succeeds:
[root@hp-dl320g5-01 power]# echo disk > state
Disabling non-boot CPUs ...
Breaking affinity for irq 3
Breaking affinity for irq 138
Breaking affinity for irq 154
Breaking affinity for irq 233
CPU 1 is now offline
CPU1 is down
Stopping tasks: ======================================================================================|
Shrinking memory... done (19157 pages freed)
Saving image data pages (125393 pages) ...   4%<6>tg3: eth0: Link is up at 1000 Mbps, full duplex.
done
Wrote 501572 kbytes in 49.93 seconds (10.04 MB/s)
S|
sd 0:0:0:0: [sda] Stopping disk
Power down.
acpi_power_off called
 The server is not powered on.  The Virtual Serial Port is not available.

Comment 4 Hushan Jia 2011-02-22 07:28:06 UTC
Set regression based on the results.

Comment 9 Matthew Garrett 2011-05-11 19:37:17 UTC
Can't reproduce with 2.6.18-259.el5


Note You need to log in before you can comment on or make changes to this bug.