Bug 127066
Summary: | Panic is occurring in the I/O completion interrupt handling for the character interface driver (sg). | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 3 | Reporter: | Heather Conway <conway_heather> | ||||
Component: | kernel | Assignee: | Tom Coughlan <coughlan> | ||||
Status: | CLOSED ERRATA | QA Contact: | Brian Brock <bbrock> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 3.0 | CC: | dledford, petrides, riel, rkenna | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | All | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2005-05-18 13:27:41 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 132991 | ||||||
Attachments: |
|
Description
Heather Conway
2004-07-01 13:14:56 UTC
Created attachment 101559 [details]
patch for RHEL 3.0 U2 sg.cb
The patch looks okay to me. I would like to have it reviewed on the linux-scsi list, and incorporated upstream if possible. Would the author of this patch be willing to post this to linux-scsi? (Use a unified diff when posting to the Linux lists.) If not, I will post it. If it passes review, the patch will be in U4. Tom Would you please take the lead on this issue and post the patch for me Tom? Thanks Steven Tweedie reviewed the patch, and does not think it is a sufficient fix: It's an interruptible task, so there's nothing at all to stop user-space from signalling or timing out or otherwise continuing on its own independently of the wake_up_interruptible(). In that case, there's nothing to stop the race happening *before* we take the copy of sfp->async_qp. All we're doing here is fixing the most likely cause of the wakeup/read()/sg_release(). The patch seems to be saying that there are plenty of other ways of reaching the same race which are not addressed by the patch. Don't we really need to move the kill_fasync() up to within the locking, before we let go of the command completely? --- I am looking at whether we can find a suitable solution before U4 freezes. Tom A patch has been proposed upstream, and looks like it will be accepted. http://marc.theaimsgroup.com/?l=linux-scsi&m=109936088901128&w=2 Please test to confirm this fixes your problem. This is a candidaate for RHEL 3 U5. This bug was found when PowerPath contained a volume manage and the problem isn't being replicated. The PowerPath team is reviewing the code and will provide an update. To clarify, there is no longer a volume manager included in the PowerPath package and the problem isn't being replicated. A fix for this problem has just been committed to the RHEL3 U5 patch pool this evening (in kernel version 2.4.21-27.13.EL). An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2005-294.html |