Bug 543270 - Fix deadlock in multipath when removing a device
Summary: Fix deadlock in multipath when removing a device
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: kernel
Version: 5.5
Hardware: All
OS: Linux
medium
medium
Target Milestone: rc
: ---
Assignee: Mikuláš Patočka
QA Contact: Red Hat Kernel QE team
URL:
Whiteboard:
: 555266 (view as bug list)
Depends On:
Blocks: 527501
TreeView+ depends on / blocked
 
Reported: 2009-12-02 02:27 UTC by Mikuláš Patočka
Modified: 2010-11-28 05:33 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2010-03-30 07:36:42 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
The backported patch (4.41 KB, patch)
2009-12-02 02:27 UTC, Mikuláš Patočka
no flags Details | Diff


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2010:0178 0 normal SHIPPED_LIVE Important: Red Hat Enterprise Linux 5.5 kernel security and bug fix update 2010-03-29 12:18:21 UTC

Comment 1 RHEL Program Management 2009-12-02 02:31:40 UTC
This request was evaluated by Red Hat Product Management for inclusion in a Red
Hat Enterprise Linux maintenance release.  Product Management has requested
further review of this request by Red Hat Engineering, for potential
inclusion in a Red Hat Enterprise Linux Update release for currently deployed
products.  This request is not yet committed for inclusion in an Update
release.

Comment 2 Don Zickus 2009-12-11 19:31:42 UTC
in kernel-2.6.18-179.el5
You can download this test kernel from http://people.redhat.com/dzickus/el5

Please update the appropriate value in the Verified field
(cf_verified) to indicate this fix has been successfully
verified. Include a comment with verification details.

Comment 5 Barry Donahue 2010-02-19 17:54:57 UTC
    What are the steps to reproduce this problem?

Comment 6 Tom Coughlan 2010-02-25 21:44:29 UTC
<mikulas> coughlan: this is a race condition, when failing a path and removing the mpath device simultaneously causes the deadlock.
<mikulas> coughlan: if you want to reliably reproduce the race, you must artifically delay the thread that submits the events.
<mikulas> coughlan: I would recommend adding a msleep(10000) (ten seconds) at the beginning of trigger_event function.
<mikulas> then, fail the path and within 10 seconds, remove the device. I think that should trigger the bug.
<mikulas> (note, by removing the device, I mean "dmsetup remove <device>")
<mikulas> if it doesn't work (something during device removal uses the same event thread), another option how to reproduce it would be to remove schedule_work(&m->trigger_event); from dm-mpath.c

Comment 8 errata-xmlrpc 2010-03-30 07:36:42 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2010-0178.html

Comment 9 Mikuláš Patočka 2010-11-28 05:33:12 UTC
*** Bug 555266 has been marked as a duplicate of this bug. ***


Note You need to log in before you can comment on or make changes to this bug.