Bug 361931
Summary: | [Stratus 4.7 bug] iounmap may sleep while holding vmlist_lock, causing a deadlock. | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | Peter Martuccelli <peterm> | ||||
Component: | kernel | Assignee: | Larry Woodman <lwoodman> | ||||
Status: | CLOSED ERRATA | QA Contact: | Martin Jenner <mjenner> | ||||
Severity: | urgent | Docs Contact: | |||||
Priority: | urgent | ||||||
Version: | 4.6 | CC: | andriusb, dmair, jbaron, jskrabal, lwoodman, smcgrath, tao, vgoyal | ||||
Target Milestone: | rc | Keywords: | ZStream | ||||
Target Release: | --- | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | GSSApproved | ||||||
Fixed In Version: | RHSA-2008-0665 | Doc Type: | Bug Fix | ||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2008-07-24 19:19:20 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 240187, 367631, 422551, 430698, 433267 | ||||||
Attachments: |
|
Description
Kimball Murray
2007-11-01 14:57:32 UTC
This is a very timing-and-scheduling-sensitive bug. It has been around for a long time, and we never hit it through RHEL4.4 and RHEL4.5 until last week. One of our drivers, at system startup, does an ioremap_nocache, followed by an iounmap. Meanwhile, we must have changed something in the system that changes the timing of events such that the init_mm.mmap_sem is taken by some other thread at just the wrong time. Now our system hits this bug frequently on bootup, but changing anything in the way the system starts can make this go away again. So it's hard to assess the exposure acurately. Created attachment 245981 [details]
Patch to fix this problem
This patch fixes this problem:
This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. Simon, can you confirm that the patch below fixes the issue on 4.6? (Since this may get proposed for 4.6.z). Thanks! Will do patching, rebuilding and will submit to Systems System Test group for verification tomorrow morning. Good, Stratus Systems Test group have been running the patched 2.6.9-67.0.4 kernel for several hours now, and are unable to repro the problem. Previously, on one system, the problem would occur in about an hour. They will continue tests for 24 hours before giving a definiative sign-off on the patch. Stratus Systems Test group have run tests successfully for 24 hours without reproducing problem, or encountering any new ones. Therefore, patch is GOOD. On Friday Feb 15th Vivek G. agreed to include patch in next 4.7 build, with no objections from Larry W. This bug is cloned for inclusion in the 4.6.z stream with: https://bugzilla.redhat.com/show_bug.cgi?id=433267 Committed in 68.12. RPMS are available at http://people.redhat.com/vgoyal/rhel4/ What test procedures did Stratus Systems Test group use to test this? An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2008-0665.html |