Bug 470610 - [Emulex 5.3 bug] Update lpfc to version 8.2.0.33.3p
Summary: [Emulex 5.3 bug] Update lpfc to version 8.2.0.33.3p
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: kernel
Version: 5.3
Hardware: All
OS: Linux
high
urgent
Target Milestone: rc
: ---
Assignee: Tom Coughlan
QA Contact: Mike Gahagan
URL:
Whiteboard:
Depends On:
Blocks: 357171 373081 431464
TreeView+ depends on / blocked
 
Reported: 2008-11-07 23:14 UTC by Jamie Wellnitz
Modified: 2009-06-20 08:05 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2009-01-20 19:48:54 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
patch for Emulex lpfc 8.2.0.33.3p - applies on top of 8.2.0.33.2p (1.11 KB, patch)
2008-11-07 23:14 UTC, Jamie Wellnitz
no flags Details | Diff


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2009:0225 0 normal SHIPPED_LIVE Important: Red Hat Enterprise Linux 5.3 kernel security and bug fix update 2009-01-20 16:06:24 UTC

Description Jamie Wellnitz 2008-11-07 23:14:03 UTC
Created attachment 322912 [details]
patch for Emulex lpfc 8.2.0.33.3p - applies on top of 8.2.0.33.2p

There's a bug in 8.2.0.33.2p (in kernel 2.6.18-122.el5) involving Fibre Channel discovery.

Symptoms:
lpfc HBAs fail to discover all the targets after a switch reboot when connected to a QLogic switch.  The fix is two lines (actually a single line moved by a few lines), plus the version change.

Following are the events that cause this issue.

- After a switch reboot HBA receives a link up event
- Driver starts discovery by querying the name server and sending PLOGI to the targets.
- The PLOGI for one target fails with LS_RJT
- The driver started a delay timer to retry the PLOGI after one second. The NLP_NPR_2B_DISC flag of this target is set and target is in NPR state now.
- Before delay timer expires, the HBA received an RSCN for the target and lpfc_device_recov_npr_node state machine function is called. This function  resets the NLP_NPR_2B_DISC flag and calls lpfc_cancel_retry_delay_tmo function.

The counters for keeping track of the number of targets in discovery is updated in lpfc_cancel_retry_delay_tmo function based on NLP_NPR_2B_DISC flag. These counters are not updated because NLP_NPR_2B_DISC flag is cleared before calling lpfc_cancel_retry_delay_tmo function. This leaves the HBA in FC_NDISC_ACTIVE state and results in the driver not responding to any RSCN events.

Fix:
Moved the clearing of NLP_NPR_2B_DISC flag after the lpfc_cancel_retry_delay_tmo function call in lpfc_device_recov_npr_node function.

Comment 5 Don Zickus 2008-12-09 21:04:53 UTC
in kernel-2.6.18-126.el5
You can download this test kernel from http://people.redhat.com/dzickus/el5

Comment 7 Chris Ward 2008-12-16 16:29:26 UTC
~~~ Attention Partners ~~~ The *last* RHEL 5.3 Snapshot 6 is now available at partners.redhat.com. A fix for this bug should be present. Please test and update this bug with test results as soon as possible.  If the fix present in Snap6 meets all the expected requirements for this bug, please add the keyword PartnerVerified. If any new bugs are discovered, please CLONE this bug and describe the issues encountered there.

Comment 8 Jamie Wellnitz 2008-12-17 02:38:08 UTC
2.6.18-126.el5 has 8.2.0.33.3p and looks good.

Comment 10 errata-xmlrpc 2009-01-20 19:48:54 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2009-0225.html


Note You need to log in before you can comment on or make changes to this bug.