Bug 646649
Summary: | DomU crash - "Unable to reduce memory reservation" | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | asilva <asilva> | ||||
Component: | kernel-xen | Assignee: | Xen Maintainance List <xen-maint> | ||||
Status: | CLOSED DUPLICATE | QA Contact: | Red Hat Kernel QE team <kernel-qe> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | low | ||||||
Version: | 5.5 | CC: | drjones, jmunilla, jruemker, lersek, mrezanin, pbonzini, smayhew, xen-maint | ||||
Target Milestone: | rc | ||||||
Target Release: | --- | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2011-03-08 16:36:50 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 514489 | ||||||
Attachments: |
|
Description
asilva
2010-10-25 20:10:17 UTC
There aren't that many ways for this hypercall to fail, so this is strange. We could narrow it down some more if we had 'xm dmesg', but 'xm dmesg' with more logging turned on, i.e. boot with 'loglvl=all guest_loglvl=all' on Xen's command line. Checking the load and memory usage of the host during the problem might be interesting as well. One thing we may be able to check with the core of the guest we have is if all the pages in the rx_pfn_array look valid. Did we get a sos report from the host at the time of the crash to look at? I don't think the guest core will tell us any more, although you could check if the balloon driver is running on the long-shot that this crash is a side-effect of bug 653262, possibly making this bug a dup of that bug. dmesg and 'xm dmesg' from the host would be useful in checking that theory too, as another guest could have been ballooning. I don't think the balloon driver is involved. The only cases in which *de*crease_reservation fails, is if the arguments are wrong. The guest core could help us find why guest_remove_page is failing (that's the workhorse of decrease_reservation in the hypervisor). If the customer desires a workaround, I suggest that the customer switches the netfront module to copying by using the "rx_copy=1" module option. However, a guest without this option and running on a hypervisor with "loglvl=all guest_loglvl=all" is necessary in order to collect the required information. (In reply to comment #4) > Did we get a sos report from the host at the time of the crash to look at? > > I don't think the guest core will tell us any more, although you could check if > the balloon driver is running on the long-shot that this crash is a side-effect > of bug 653262, possibly making this bug a dup of that bug. dmesg and 'xm dmesg' > from the host would be useful in checking that theory too, as another guest > could have been ballooning. We did get a sosreport from dom0, but its from a while after the initial incident. The core is from Sep 27 and the sos from Oct 19. However I am attaching it anyways. Created attachment 462417 [details]
dom0 sosreport
Unfortunately nothing interesting in the sos :( Waiting from info on 5.6 beta. As a workaround, the guest netfront module could also be started with the rx_copy=1 option. |