Bug 539521 - Call trace error display when resume from suspend to disk (ide block) - pvclock related
Summary: Call trace error display when resume from suspend to disk (ide block) - pvclo...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: kernel
Version: 5.5
Hardware: All
OS: Linux
low
medium
Target Milestone: rc
: ---
Assignee: Glauber Costa
QA Contact: Red Hat Kernel QE team
URL:
Whiteboard:
: 527927 (view as bug list)
Depends On:
Blocks: 528898 560640
TreeView+ depends on / blocked
 
Reported: 2009-11-20 13:24 UTC by Suqin Huang
Modified: 2013-01-11 02:37 UTC (History)
11 users (show)

Fixed In Version: kernel-2.6.18-187.el5
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2010-03-30 07:42:46 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
suspend to disk (22.09 KB, image/png)
2009-11-20 13:24 UTC, Suqin Huang
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2010:0178 0 normal SHIPPED_LIVE Important: Red Hat Enterprise Linux 5.5 kernel security and bug fix update 2010-03-29 12:18:21 UTC

Description Suqin Huang 2009-11-20 13:24:39 UTC
Created attachment 372486 [details]
suspend to disk

Description of problem:
Call trace error display when resume from suspend to disk (ide block)

Version-Release number of selected component (if applicable):
kvm-83-135.el5

How reproducible:
most of time

Steps to Reproduce:
1. install vm with if=ide
/usr/libexec/qemu-kvm -drive file=rhel5.4-64.qcow2,if=ide  -no-hpet -rtc-td-hack -smp 8 -m 4G -uuid `uuidgen` -net nic,model=e1000,macaddr=00:A6:78:FA:0D:95,vlan=0 -net tap,vlan=0,script=/etc/qemu-ifup -cpu qemu64,+sse2 -boot n -vnc :16 -monitor stdio -name 5.4-64 -no-kvm-pit-reinjection
2. boot vm 
/usr/libexec/qemu-kvm -drive file=rhel5.4-64.qcow2,if=ide  -no-hpet -rtc-td-hack -smp 8 -m 4G -uuid `uuidgen` -net nic,model=e1000,macaddr=00:A6:78:FA:0D:95,vlan=0 -net tap,vlan=0,script=/etc/qemu-ifup -cpu qemu64,+sse2 -boot c -vnc :16 -monitor stdio -name 5.4-64 -no-kvm-pit-reinjection
3. # echo > disk /sys/power/state
4. Resume vm again after suspend
  
Actual results:
Call trace error display, guest can not resume successfully.

Expected results:


Additional info:

1. guest 
rhel5.4-64 with 2.6.18-164.6.1.el5 kernel

2. host
processor	: 7
vendor_id	: GenuineIntel
cpu family	: 6
model		: 26
model name	: Intel(R) Core(TM) i7 CPU         920  @ 2.67GHz
stepping	: 4
cpu MHz		: 2668.000
cache size	: 8192 KB
physical id	: 0
siblings	: 8
core id		: 3
cpu cores	: 4
apicid		: 7
fpu		: yes

3. attach the error

Comment 2 Gleb Natapov 2010-01-04 13:07:03 UTC
Test on real HW (with same amount cpus) several times please.

Comment 3 Eduardo Habkost 2010-01-04 20:34:31 UTC
Why was this bug moved to MODIFIED without any comment indicating why, and which version fixes it?

Comment 4 Lawrence Lim 2010-01-05 00:50:09 UTC
Could you please follow up Comment #2?

Comment 5 Suqin Huang 2010-01-05 06:28:08 UTC
work well on real HW with 8 cpus, 12G mem, scsi block.

1. # echo disk > /sys/power/state 
2. can access network after resume, firefox, gedit still work after resume.
3. repeat 10 times.

4. host kernel:
2.6.18-164.6.1.el5
5. cpu:
processor	: 7
vendor_id	: GenuineIntel
cpu family	: 6
model		: 26
model name	: Intel(R) Core(TM) i7 CPU         920  @ 2.67GHz
stepping	: 4
cpu MHz		: 1600.000
cache size	: 8192 KB

Comment 6 Gleb Natapov 2010-01-05 06:35:19 UTC
During suspend to disk block subsystem plays major role. Do you happen to have ide HW to test with? Not necessary 8 cpus, 2 may be enough.

Comment 7 Suqin Huang 2010-01-05 08:03:11 UTC
(In reply to comment #5)
> work well on real HW with 8 cpus, 12G mem, scsi block.

Check the block again, it's ide block.

Comment 8 Gleb Natapov 2010-01-05 08:09:49 UTC
(In reply to comment #7)
> (In reply to comment #5)
> > work well on real HW with 8 cpus, 12G mem, scsi block.
> 
> Check the block again, it's ide block.  

ide or sata? I doubt you'll find 8 cpu box with ide disk.

Comment 9 Gleb Natapov 2010-01-05 09:46:57 UTC
(In reply to comment #8)
> (In reply to comment #7)
> > (In reply to comment #5)
> > > work well on real HW with 8 cpus, 12G mem, scsi block.
> > 
> > Check the block again, it's ide block.  
> 
> ide or sata? I doubt you'll find 8 cpu box with ide disk.  

No need to do real HW testing. I found out that disabling kvm clock solves the problem. Unfortunately no-kvmclock kernel option does not take any effect so I had to disable kvm clock in qemu for testing. Glauber can you look at this?

Comment 10 Glauber Costa 2010-01-20 17:39:04 UTC
sidenote: 

no-kvmclock does not work on x86_64 RHEL5 because the option parsing for that is done too late in the game. You'd have to pass clock=<something> (like pmtimer or tsc) in the command line.

I can't reproduce this issue, because I can't even put the machine to disk.
What I get is this:

Stopping tasks: ==================================================================================
 stopping tasks timed out after 120 seconds (1 tasks remaining):
  khungtaskd
Restarting tasks...<6> Strange, khungtaskd not stopped


However, by code inspection, I have a slight idea of what's going on. I am brewing a new scratch build that will be ready shortly at:

http://brewweb.devel.redhat.com/brew/taskinfo?taskID=2214588

Comment 11 Glauber Costa 2010-01-20 20:38:15 UTC
Ok... according to Gleb, that was an unrelated regression introduced in 5.5

So I am building another rpm ontop of 5.4.z

To be ready shortly. Here it is:

http://brewweb.devel.redhat.com/brew/taskinfo?taskID=2215204

Comment 12 Suqin Huang 2010-01-21 10:22:40 UTC
repeat 10 times, can not reproduce with new built package.
guest: rhel5.4-x86_64
1. can access network after resume.
2. can edit files after resume.

CLI:
/usr/libexec/qemu-kvm -drive file=/root/rhel5.4-64-ide.bak,if=ide  -no-hpet -rtc-td-hack -smp 8 -m 4G -uuid `uuidgen` -net nic,model=e1000,macaddr=00:A6:78:FA:0D:95,vlan=0 -net tap,vlan=0,script=/etc/qemu-ifup -cpu qemu64,+sse2 -boot c -vnc :6  -monitor stdio -name 5.4-64 -no-kvm-pit-reinjection

Comment 13 Glauber Costa 2010-01-28 12:05:07 UTC
*** Bug 527927 has been marked as a duplicate of this bug. ***

Comment 21 errata-xmlrpc 2010-03-30 07:42:46 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2010-0178.html


Note You need to log in before you can comment on or make changes to this bug.