Bug 134736
Summary: | kernel panic in md driver (md lacks proper locking of device lists) | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 3 | Reporter: | Paul Clements <paul.clements> | ||||
Component: | kernel | Assignee: | Doug Ledford <dledford> | ||||
Status: | CLOSED ERRATA | QA Contact: | Brian Brock <bbrock> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 3.0 | CC: | dwmw2, james.bottomley, petrides, riel | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | All | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | RHSA-2006-0437 | Doc Type: | Bug Fix | ||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2006-07-20 13:17:21 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 170417, 181405 | ||||||
Attachments: |
|
Description
Paul Clements
2004-10-05 20:30:32 UTC
David, it's not entirely clear -- has the patch at http://marc.theaimsgroup.com/?l=linux-raid&m=106393738529573&w=2 been tested at the customer site, or another patch of your own? David, no, I only tried to add locking to ITERATE_MDDEV(), I thinking that the patch didn't apply cleanly to rhel3 and there were some fix ups to be made. We've applied the fix in Neil Brown's email to a Red Hat 8 kernel 2.4.20-28.7 and run into a locking failure in the seq_file interface. The down_read(&all_mddevs_sem); needs to be moved from md_seq_next() to md_seq_start() in order to avoid a proc file bug which will cause the system to hang. I've attached our complete patch. With these locking changes, the system is stable for us and no-longer oopses Created attachment 122629 [details]
locking patch against 2.6.20-28.7
This patch is modified from Neil Browns original to apply against 2.6.20-28.7
and also has the proc file locking problem fixed
I modified the patch to work with a RHEL3 kernel and with the md event interface we have in RHEL3. It then passed my testing, and I've submitted it internally for review. A fix for this problem has just been committed to the RHEL3 U8 patch pool this evening (in kernel version 2.4.21-40.10.EL). Adding a couple dozen bugs to CanFix list so I can complete the stupid advisory. A kernel has been released that contains a patch for this problem. Please verify if your problem is fixed with the latest available kernel from the RHEL3 public beta channel at rhn.redhat.com. Reverting to ON_QA. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2006-0437.html |