Bug 450953
| Summary: | el4u6 xenU guest kernel lockup due to mm_unpinned_lock and runqueue spinlock deadlock | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 4 | Reporter: | Herbert van den Bergh <herbert.van.den.bergh> | ||||||
| Component: | kernel-xen | Assignee: | Chris Lalancette <clalance> | ||||||
| Status: | CLOSED ERRATA | QA Contact: | Martin Jenner <mjenner> | ||||||
| Severity: | high | Docs Contact: | |||||||
| Priority: | high | ||||||||
| Version: | 4.6 | CC: | bmr, clalance, cward, greg.marsden, jwest, riel, rpacheco, tao, uobergfe, vgoyal, xen-maint | ||||||
| Target Milestone: | rc | ||||||||
| Target Release: | --- | ||||||||
| Hardware: | x86_64 | ||||||||
| OS: | Linux | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2009-05-18 19:15:53 UTC | Type: | --- | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Bug Depends On: | |||||||||
| Bug Blocks: | 461304, 470304 | ||||||||
| Attachments: |
|
||||||||
|
Description
Herbert van den Bergh
2008-06-11 21:13:09 UTC
Created attachment 309000 [details]
fix for mm_unpinned_lock / runq deadlock
The patch looks good to me. Updating PM score. This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. OK. I posted this patch to upstream Xen, and they pointed out that not only does 2.6.18 probably not have this issue, but also that the patch isn't entirely sufficient, since you might still deadlock with the mm->page_table_lock. Upstream 2.6.18 does this a different way (which the RHEL-5 kernel already has); I've backported the changes, and built a test kernel available at http://people.redhat.com/clalance/bz450953/. Can the reporter please download the test kernel, try it out, and report back test results? Thank you, Chris Lalancette Created attachment 324399 [details]
Backport of upstream xen-unstable c/s 10343 and 10995
This is a backport of xen-unstable c/s 10343 and 10995, that I believe will fix the issue. Instead of changing the locking, this actually defers the mm_pin time until activate_mm, which means that we should avoid the deadlock in the first place.
Chris, the upstream patches are indeed better. I'll go NACK Larry's patches :) Committed in 78.20.EL . RPMS are available at http://people.redhat.com/vgoyal/rhel4/ Any updates here? Has this issue been resolved in the RHEL 4.8 Beta? later kernel? An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2009-1024.html |