Bug 215372
Summary: | Live/Non-live migration sometimes fails between two machines | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Chris Lalancette <clalance> | ||||||
Component: | xen | Assignee: | Glauber Costa <gcosta> | ||||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | |||||||
Severity: | medium | Docs Contact: | |||||||
Priority: | medium | ||||||||
Version: | 5.0 | CC: | gcosta, konradr, tao, xen-maint | ||||||
Target Milestone: | --- | ||||||||
Target Release: | --- | ||||||||
Hardware: | All | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | 5.0.0 | Doc Type: | Bug Fix | ||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2007-01-08 14:15:18 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Chris Lalancette
2006-11-13 18:25:52 UTC
Are those messages on xm dmesg referent to a single failure, or multiple ones? Are your logs always getting messy, with messages getting truncated this way? If yes, maybe we should fill a bug separatedely for it. This is rendering the log useless. From what I was able to dig up to now, the mfn we're trying to pin to an L1 table has a forbidden flag. Moreover, this flag is an unused flag. (XEN) DOM0: (file=mm.c, line=572) Bad L1 flags 80000000 (Flags are currently only defined up to 4 digits). What remains to be answered, is how did such a flag get there? Does the failing also happen with normal linux guests, or is it just with the xm-test instance? What I have so far: alloc_l3_table calls get_page_and_type_from_pagenr(l3e_get_pfn(pl3e[i]), note the ===> PGT_l2_page_table | flags ===> PGT_pae_xen_l2, d) ) which in turn, calls get_page_type() with unmodifyied flags. This last one, has the following piece of code: ASSERT(!(x & PGT_pae_xen_l2)); It's probably with ASSERTS turnt off, thus not bugging. But it gives strong reasons to believe there may be something wrong here. This call in alloc_l3_page() is only made in the 3rd page index. And according to your report, (XEN) DOM0: (file=mm.c, line=1063) Failure in alloc_l3_table: entry 3 All of those are inside conditionals so I'm not 100 % sure. they're rather good clues that I'll be following next days. Aren't you able to see the whole line in xm dmesg log ? Created attachment 143303 [details]
xend.log-failed_migration
Created attachment 143304 [details]
xm_dmesg-failed_migration
Please, test the rpms that can be found at http://et.redhat.com/~gcosta/ and see if the problem still persists. Thanks. Please update the status of this issue with either the package in #12 or the retest in #14. Can't really ack until we have more details. Closing out based on comment 20. |