700752 – 32-bit PV guest crash on restore on x64_86 host

Bug 700752 - 32-bit PV guest crash on restore on x64_86 host

Summary: 32-bit PV guest crash on restore on x64_86 host

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 5
Classification:	Red Hat
Component:	kernel-xen
Sub Component:
Version:	5.7
Hardware:	x86_64
OS:	Linux
Priority:	unspecified
Severity:	urgent
Target Milestone:	rc
Target Release:	5.8
Assignee:	Igor Mammedov
QA Contact:	Virtualization Bugs
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	514490 784372
TreeView+	depends on / blocked

Reported:	2011-04-29 10:11 UTC by Igor Mammedov
Modified:	2012-02-21 03:46 UTC (History)
CC List:	17 users (show)
Fixed In Version:	kernel-2.6.18-305.el5
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2012-02-21 03:46:53 UTC
Target Upstream Version:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
[RHEL5.9 Xen PATCH] x86: fix a few 32-on-64 compat mode issues (5.59 KB, patch) 2011-12-29 00:34 UTC, Igor Mammedov	no flags	Details \| Diff
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHSA-2012:0150	0	normal	SHIPPED_LIVE	Moderate: Red Hat Enterprise Linux 5.8 kernel update	2012-02-21 07:35:24 UTC

Description Igor Mammedov 2011-04-29 10:11:41 UTC

Description of problem:
32-bit PV guest crash on restore on x64_86 host.

Version-Release number of selected component (if applicable):
host:
xen-3.0.3-129.el5
kernel-xen-2.6.18-257.el5
guest:
kernel-PAE-2.6.38.2-9.fc15

How reproducible:
100%

Steps to Reproduce:
1. install 32-bit FC15 as PV guest on RHEL5
2. xm save fc15_32 /tmp/dump.fc15
3. xm restore /tmp/dump.fc15
  
Actual results:
Guest crash on restore

Expected results:
guest should run after restore.

Additional info:
excerpt fro hypervisor.log:

(XEN) mm.c:630:d16 Non-privileged (16) attempt to map I/O space 2712536c5e
(XEN) mm.c:3619:d16 ptwr_emulate: fixing up invalid PAE PTE c082712536c5e063
(XEN) /builddir/build/BUILD/kernel-2.6.18/xen/include/asm/mm.h:225:d16 Error pfn 36c5b: rd=ffff8300afdca080, od=ffff8300af1f0080, caf=80000002, taf=00000000e8000001
(XEN) mm.c:654:d16 Error getting mfn 36c5b (pfn 3b325b) from L1 entry 0000000036c5b063 for dom16
(XEN) mm.c:3619:d16 ptwr_emulate: fixing up invalid PAE PTE 0000000036c5b063
(XEN) Pagetable walk from 00000000eb40ff98:
(XEN)  L4[0x000] = 0000000066ad4027 0000000000027494
(XEN)  L3[0x003] = 0000000066eae027 0000000000027492
(XEN)  L2[0x15a] = 0000000237400067 0000000000001600 
(XEN)  L1[0x00f] = 80000002373f8061 000000000002b40f
(XEN) domain_crash_sync called from entry.S
(XEN) Domain 16 (vcpu#0) crashed on cpu#1:
(XEN) ----[ Xen-3.1.2-257.el5  x86_64  debug=n  Not tainted ]----
(XEN) CPU:    1
(XEN) RIP:    0061:[<00000000c04047ee>]
(XEN) RFLAGS: 0000000000010213   CONTEXT: guest
(XEN) rax: 00000000f4d78000   rbx: 00000000eb4aea40   rcx: 000000000000007b
(XEN) rdx: 00000000eb4aea40   rsi: 00000000eb410024   rdi: 00000000c0827117
(XEN) rbp: 00000000eb40ffa0   rsp: 00000000eb40ffa0   r8:  0000000000000000
(XEN) r9:  0000000000000000   r10: 0000000000000000   r11: 0000000000000000
(XEN) r12: 0000000000000000   r13: 0000000000000000   r14: 0000000000000000
(XEN) r15: 0000000000000000   cr0: 000000008005003b   cr4: 00000000000006f0
(XEN) cr3: 0000000250002000   cr2: 00000000f4d78008
(XEN) ds: 007b   es: 007b   fs: 00d8   gs: 00e0   ss: 0069   cs: 0061
(XEN) Guest stack trace from esp=eb40ffa0:
(XEN)   eb41001c c0827152 36c5d063 80000002 36c5c063 80000002 36c5b062 00000000
(XEN)   00000008 eb4aea40 00000000 80000002 36c58063 80000002 36c57063 80000002
(XEN)   36c56063 80000002 36c55063 80000002 36c54063 80000002 36c53063 80000002
(XEN) mm.c:630:d0 Non-privileged (16) attempt to map I/O space 00000000


gdb guest kernel stack trace:
#0  0xc04047ee in xen_read_cr2 () at arch/x86/xen/mmu.c:1244
#1  0xc0827152 in read_cr2 (regs=0xeb410024, error_code=0)
    at /usr/src/debug/kernel-2.6.38.fc15/linux-2.6.38.i686/arch/x86/include/asm/paravirt.h:64
#2  do_page_fault (regs=0xeb410024, error_code=0) at arch/x86/mm/fault.c:974
#3  <signal handler called>
...

Comment 1 Igor Mammedov 2011-05-04 19:54:16 UTC

Bisecting of upstream kernel shows following offending commit:

commit 448f2831934381e9d3c4d93e700ba7bbe14612dc
Author: Jeremy Fitzhardinge <jeremy.fitzhardinge>
Date:   Wed Dec 15 13:19:33 2010 +0000

    xen: add m2p override mechanism

Comment 2 Chuck Ebbert 2011-05-05 11:07:45 UTC

(In reply to comment #1)
> Bisecting of upstream kernel shows following offending commit:
> 
> commit 448f2831934381e9d3c4d93e700ba7bbe14612dc
> Author: Jeremy Fitzhardinge <jeremy.fitzhardinge>
> Date:   Wed Dec 15 13:19:33 2010 +0000
> 
>     xen: add m2p override mechanism

Comment 3 Igor Mammedov 2011-05-09 11:51:35 UTC

Bug reproduces only on more than 1 numa nodes machines,
tested it on 2 sockets intel/amd boxes.

Guset can be restored successfully by turning off 'numa' option on xen hv commandline. 

In addition tested it with SLES 11 host (xen 4.0), it also works as expected
with and without 'numa=on' option.

Looks like xen 3.0 bug.

Comment 4 Andrew Jones 2011-12-15 17:36:28 UTC

Based on comment 3 this sounds like a RHEL5 Xen HV problem. Moving it there.

Comment 5 Igor Mammedov 2011-12-16 16:14:19 UTC

Just retested with latest RHEL5.8 beta xen and FC16 (3.1.5-2.fc16.i686.PAE).

(XEN) mm.c:630:d2 Non-privileged (2) attempt to map I/O space 003c092a
(XEN) mm.c:3624:d2 ptwr_emulate: could not get_page_from_l1e()
(XEN) Pagetable walk from 00000000ebc0eff4:
(XEN)  L4[0x000] = 0000000087ef9027 00000000000018b9
(XEN)  L3[0x003] = 0000000087bff027 0000000000001fb7
(XEN)  L2[0x15e] = 000000008a49b067 0000000000003bd2 
(XEN)  L1[0x00e] = 800000030ecb0061 000000000002bc0e
(XEN) domain_crash_sync called from entry.S
(XEN) Domain 2 (vcpu#0) crashed on cpu#0:
(XEN) ----[ Xen-3.1.2-300.el5  x86_64  debug=n  Not tainted ]----
(XEN) CPU:    0
(XEN) RIP:    0061:[<00000000c092a346>]
(XEN) RFLAGS: 0000000000010217   CONTEXT: guest
(XEN) rax: 0000000000000000   rbx: 0000000000000000   rcx: 000000000000007b
(XEN) rdx: 00000000ebc63ed0   rsi: 00000000ebc0f078   rdi: 00000000c092a310
(XEN) rbp: 00000000ebc0f070   rsp: 00000000ebc0effc   r8:  0000000000000000
(XEN) r9:  0000000000000000   r10: 0000000000000000   r11: 0000000000000000
(XEN) r12: 0000000000000000   r13: 0000000000000000   r14: 0000000000000000
(XEN) r15: 0000000000000000   cr0: 000000008005003b   cr4: 00000000000026b0
(XEN) cr3: 000000030f364000   cr2: 00000000ebc0eff8
(XEN) ds: 007b   es: 007b   fs: 00d8   gs: 00e0   ss: 0069   cs: 0061
(XEN) Guest stack trace from esp=ebc0effc:
(XEN)   80000003 00000007 00000001 ebc63ed0 ebc0d010 00000000 00000000 00000028
(XEN)   00000001 ebc0d030 ebc0d038 ebc030e0 ebc00300 00000003 00000001 ebc0d048
(XEN)   ebc0d050 ebc0c0c0 ebc00330 00000000 00000000 00000000 00000000 00000000
(XEN)   00000000 f0000000 00000000 ebc0f138 c092a310 ebc0f0b4 c0927997 00000000
(XEN)   0000007b ebc63ed0 ebc0f138 c092a310 ebc0f0b4 f5298000 0000007b 0000007b
(XEN)   000000d8 000000e0 ffffffff c04053ce 00010061 00010017 ebc0f130 c092a34b
(XEN)   00000000 f0000001 0001a1ff ebc63ed0 00000000 00000000 00000000 00000028
(XEN)   00000000 ebc0f060 c0b9e340 00000000 ebc003c0 00000000 00000000 00000001
(XEN)   00000000 00000000 ebc0f104 ebc0f104 00000000 74656e03 00000000 00000000
(XEN)   00000000 f0000002 00000000 ebc0f1f8 c092a310 ebc0f174 c0927997 00000000
(XEN)   0000007b ebc63ed0 ebc0f1f8 c092a310 ebc0f174 f5298000 0000007b 0000007b
(XEN)   000000d8 000000e0 ffffffff c04053ce 00010061 00010017 ebc0f1f0 c092a34b
(XEN)   00000000 f0000003 0005416d ebc63ed0 00000000 00000000 00000000 00000028
(XEN)   c0945280 ebc0f120 c0b9e340 ebdb5240 00000000 00000000 00000000 00000001
(XEN)   00000000 00000000 ebc0f1c4 ebc0f1c4 00000000 00736602 00000000 00000000
(XEN)   00000000 f0000004 00000000 ebc0f2b8 c092a310 ebc0f234 c0927997 00000000
(XEN)   0000007b ebc63ed0 ebc0f2b8 c092a310 ebc0f234 f5298000 0000007b 0000007b
(XEN)   000000d8 000000e0 ffffffff c04053ce 00010061 00010017 ebc0f2b0 c092a34b
(XEN)   00000000 f0000005 0002416d ebc63ed0 00000000 00000000 00000000 00000028
(XEN)   c0945280 00000000 ebc0f180 00000000 00000000 00000000 00000000 00000001

Comment 6 Igor Mammedov 2011-12-29 00:34:16 UTC

Created attachment 549882 [details]
[RHEL5.9 Xen PATCH] x86: fix a few 32-on-64 compat mode issues

Bug uncovered by commit 448f28319343 in upstream kernel by introducing
a following call chain on restore:

      xen_arch_post_suspend
        xen_setup_shared_info
          set_fixmap
            xen_set_fixmap
              mfn_pte
                mfn_to_pfn
                  m2p_find_override
                    spin_lock_irqsave
                      ...
                        xen_save_fl
                          vcpu = percpu_read(xen_vcpu);
                          flags = !vcpu->evtchn_upcall_mask;

where guest tries to access invalid vcpu_info memory area. Since
VCPUOP_register_vcpu_info isn't properly handled by xen for 32-on-64 case
and as result mapping for that memory area isn't saved in restore file.
Consequently on restore xen_vcpu points to a not mapped memory.

xen-unstable c/s 17612 fixes issue by properly handling VCPUOP_register_vcpu_info for 32-on-64bit case.

Comment 7 Igor Mammedov 2011-12-29 00:38:53 UTC

Probably guest was lucky on amd box with numa=off. Since I was able to reproduce BUG on intel systems with several cpus without numa=on.

Comment 8 Paolo Bonzini 2012-01-02 12:55:14 UTC

What about 5.8 or at least 5.8.z?  This is in some sense a regression.

Comment 9 Igor Mammedov 2012-01-02 12:57:28 UTC

why is it regression, it was there forever.

Comment 10 Paolo Bonzini 2012-01-02 13:09:28 UTC

It shows as a regression when you upgrade the guest; migration no longer works.

Comment 11 Andrew Jones 2012-01-02 14:08:05 UTC

I agree with 5.8.z, but it's not a high enough priority issue to slip 5.8's schedule.

Comment 13 Jarod Wilson 2012-01-18 17:02:00 UTC

Patch(es) available in kernel-2.6.18-305.el5
You can download this test kernel (or newer) from http://people.redhat.com/jwilson/el5/
Detailed testing feedback is always welcomed.
If you require guidance regarding testing, please ask the bug assignee.

Comment 15 Jinxin Zheng 2012-01-20 11:04:02 UTC

(In reply to comment #7)
> Probably guest was lucky on amd box with numa=off. Since I was able to
> reproduce BUG on intel systems with several cpus without numa=on.

On what intel system did you reproduce it without numa on please?

Comment 16 Igor Mammedov 2012-01-20 15:03:06 UTC

(In reply to comment #15)
> (In reply to comment #7)
> > Probably guest was lucky on amd box with numa=off. Since I was able to
> > reproduce BUG on intel systems with several cpus without numa=on.
> 
> On what intel system did you reproduce it without numa on please?

on my ws (it's dell-pet610 with 2 sockets) and the other one I do not remember.

Comment 17 Jinxin Zheng 2012-01-21 07:05:05 UTC

I reproduced this on an Intel(E5620) machine with 2 sockets, with the RHEL 5.7 release kernel-xen (-274).

The guest is Fedora-16 32 bit PV with kernel-PAE-3.1.0-7.fc16.i686.

With kernel-xen-274, either with numa=on or off, the guest crashes upon restore. `xm dmesg` shows the crash message:
Domain 2 (vcpu#0) crashed on cpu#15:
...

With kernel-xen-305, the guest does not crash on restore, either with numa on or off.

Also tried RHEL5 and RHEL6 32bit PV guests. This bug does not apply to them.

Comment 18 Laszlo Ersek 2012-01-21 13:01:36 UTC

Thanks for verifying, Jinxin! Can you please move the bug to VERIFIED?

Comment 20 errata-xmlrpc 2012-02-21 03:46:53 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHSA-2012-0150.html

Note You need to log in before you can comment on or make changes to this bug.