Bug 352001
Summary: | I/O errors are thrown on FC storage lun not assigned to the host server. | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Shyam kumar Iyer <shyam_iyer> | ||||
Component: | kernel | Assignee: | Tom Coughlan <coughlan> | ||||
Status: | CLOSED NOTABUG | QA Contact: | Martin Jenner <mjenner> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | low | ||||||
Version: | 5.1 | CC: | andriusb, berthiaume_wayne, coughlan, dzickus, jfeeney, wwlinuxengineering | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | All | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2007-12-19 22:57:53 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 217106 | ||||||
Attachments: |
|
Description
Shyam kumar Iyer
2007-10-25 10:17:22 UTC
Created attachment 237251 [details]
/var/log/messages file
/dev/sdb is the FC storage lun which has not been assigned to the host server
but still throws I/O errors.
I'm not entirely sure what Naviagent is, but from the sounds of the problem and the messages in the log file, it certainly looks like a race condition in the Naviagent software. The Emulex driver is actually working properly from what I can see. It is getting an async notification of a new device on the fabric (when the fabric came up, the device was there) and it adds the device to the SCSI layer, the SCSI layer successfully gets an INQUIRY through to the device, then it starts getting failures when it attempts to send the remaining commands it normally sends during device scan (READ_CAPACITY and so on). Based on what I've seen, and a rather limited knowledge of Naviagent, I would guess that once the Naviagent software is brought up, it is possibly adding the devices, then realizing they aren't exported to this machine and removing them, or something like that. In the meantime, sometimes the Emulex driver notices the device between the add/remove, and sometimes it doesn't, resulting in what you see. In order to be any more help than this, I would need to know more about the Naviagent software and it's role in device discovery (or alternatively, someone inside Red Hat that knows more about it would have to take over for me...Cc:ing Tom Coughlan since he might know if someone else is knowledgeable in Naviagent setups). Thanks for looking at this Doug. I'll ask Wayne at EMC to take a look. The errors are on LUNZ. This is fake LUN that provides a path for in-band comunication with the Clariion controller: Oct 17 19:44:31 aknode5 kernel: lpfc 0000:04:00.0: 0:1303 Link Up Event x1 received Data: x1 xf7 x10 x0 Oct 17 19:44:31 aknode5 kernel: Vendor: DGC Model: LUNZ Rev: 0322 Oct 17 19:44:31 aknode5 kernel: Type: Direct-Access ANSI SCSI revision: 04 Oct 17 19:44:31 aknode5 kernel: sdb : READ CAPACITY failed. Oct 17 19:44:31 aknode5 kernel: sdb : status=1, message=00, host=0, driver=08 Oct 17 19:44:31 aknode5 kernel: sd: Current: sense key: Illegal Request Oct 17 19:44:31 aknode5 kernel: Add. Sense: Logical unit not supported This happens when the WWID of the HBA port is not properly registered with the Clariion. This may also happen when there are no LUNs assigned. Wayne? Hey Wayne, any updates to this? Thanks! This is expected behavior. As Tom pointed out, this is a fake LUN used for in- band communications via sg() between the host (Naviagent) and the array. Once LUNs are assigned tothe storage group on teh array teh LUNZ will no longer be seen. |