Bug 176879
Summary: | shost->host_busy count is set to -1 | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | tom phelan <tap> | ||||
Component: | kernel | Assignee: | Chip Coldwell <coldwell> | ||||
Status: | CLOSED ERRATA | QA Contact: | Brian Brock <bbrock> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 4.0 | CC: | andriusb, coughlan, egoggin, eric.moore, jbaron | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | All | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | RHBA-2007-0304 | Doc Type: | Bug Fix | ||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2007-05-02 00:02:02 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 217097 | ||||||
Attachments: |
|
Description
tom phelan
2006-01-03 22:39:02 UTC
Looks to me that the problem is that mptscsih_qcmd, the LSILogic queuecommand callout in mptscsih.c, is both calling the scsi command's io done callback and returning FAILED to the scsi mid-layer's queuecommand function whenever the LUN of the command is greater than the mptscsih configured/derived "last lun". It should be doing one or the other but certainly not both. Doing both causes both the scsi mid-layer's host_busy and device_busy values for the adapter's host structure to be decremented twice instead of once -- thereby causing the -1. Failing the command with a scsi status of DID_BAD_TARGET, calling the io done callback on the failed command, and returning 0 to queuecommand is the thing to do. This amounts to a one line change to the RHEL4 mptscsih.c driver -- returning 0 instead of FAILED whenever the command's LUN is greater than "last_lun". Created attachment 146313 [details] patch that implements the recommendation from comment #1 I've build a kernel that incorporates the patch in comment #3: http://people.redhat.com/coldwell/bugs/kernel/176879/ If someone at VMWare (Tom Phelan or Ed Groggin?) could please verify that the problem is fixed in the test kernel, and if Eric Moore at LSI is willing to sign off on the patch, then I will submit the patch for inclusion in the RHEL4.5 kernel. Thanks, Chip Chip, when should this verify be done by in order to have the patch included in the RHEL4 U5 distro? Thanks, Ed (In reply to comment #5) > Chip, when should this verify be done by in order to have the patch included in > the RHEL4 U5 distro? As soon as possible. I really should have had that patch submitted before Christmas (my fault). Chip Chip - Sorry for being late, however I just came to know about this patch on 1/23/2007. The suggested patch in comment #3 is fine. We've added this patch to our internal driver stream. committed in stream U5 build 45. A test kernel with this patch is available from http://people.redhat.com/~jbaron/rhel4/ QE ack for 4.5. Patch is in the -52 kernel. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2007-0304.html |