Bug 699016

Summary: Emulex adapters failing CONFIG_PORT command following PCIe bus EEH injection on some IBM platforms
Product: Red Hat Enterprise Linux 6 Reporter: IBM Bug Proxy <bugproxy>
Component: kernelAssignee: Steve Best <sbest>
Status: CLOSED CURRENTRELEASE QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: high Docs Contact:
Priority: unspecified    
Version: 6.1CC: balkov, jkachuck, rlary, sbest, vaios.papadimitriou
Target Milestone: rc   
Target Release: ---   
Hardware: ppc64   
OS: All   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-10-07 19:54:27 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
Sets needs_freset on Emulex PCIe adapters none

Description IBM Bug Proxy 2011-04-22 16:00:41 UTC
---Problem Description---
On IBM Power 7 platforms with new P7 PCI Host Bridge, Emulex PCIe adapters require a 
fundamental reset, rather than the default hot-reset to recover from ioa pci bus errors.

Latest version this has been seen on was RHEL 6.1 SS3.
  
---Additional Hardware Info---
IBM P7 Platforms with new P7 PCI Host Bridges:
9179-MHB	
9179-MHC		
8406-70Y	
8406-71Y		
8205-E6B	
8202-E4B	
8231-E2B
8231-E1C
8231-E2C	
9119-FHB	
9125-F2C	 

---uname output---
Linux isquad1.austin.ibm.com 2.6.32-130.el6.ppc64 #1 SMP Tue Apr 5 19:52:26 EDT 2011 
ppc64 ppc64 
   
---Steps to Reproduce---
 Use powerpc errinjct tool to inject PCI IOA bus error on Emulex PCIe adapter.
Observe adapter fails to recover, due to time out of mailbox 0x88, CONFIG_PORT command.
 
lpfc 0002:01:00.0: 0:0442 Adapter failed to init, mbxCmd x88 CONFIG_PORT, mbxStatus x0 
Data: x0
lpfc 0002:01:00.0: 0:0442 Adapter failed to init, mbxCmd x88 CONFIG_PORT, mbxStatus x0 
Data: x0

Patch has been submitted to Emulex for review and submission to Red Hat.

Patch has been tested on various P7 and P6 platforms with both PCIe and PCI-X adapters.

Testing confirmed there were  no regressions and that patch allows recovery of injected EEH 
errors in all cases tested.

Patch affects only Power PC platforms and does not affect AER recovery paths on Intel 
architecture platforms.

Without this patch, customers using newer P7 platforms could be exposed to system crash or 
loss of access to data should an unexpected pci bus error be detected.  With this patch, such 
pci bus errors would be recovered via Power PC Enhanced Error recovery.

Comment 1 IBM Bug Proxy 2011-04-22 16:00:47 UTC
Created attachment 494259 [details]
Sets needs_freset on Emulex PCIe adapters

Comment 3 RHEL Program Management 2011-04-23 06:01:30 UTC
Since RHEL 6.1 External Beta has begun, and this bug remains
unresolved, it has been rejected as it is not proposed as
exception or blocker.

Red Hat invites you to ask your support representative to
propose this request, if appropriate and relevant, in the
next release of Red Hat Enterprise Linux.

Comment 4 IBM Bug Proxy 2011-06-09 17:10:58 UTC
------- Comment From rlary.com 2011-06-09 13:02 EDT-------
Upstream commit for patch set including lpfc needs freset patch:

http://git.kernel.org/?p=linux/kernel/git/jejb/scsi-misc-2.6.git;a=commit;h=0558056c1ecd177f2621fc2a0484d565270f7ae1

- When driver initializing device, if the device is on a PCIe bus, set
PCI's "needs fundamental reset" bit so that EEH uses fundamental
reset instead of hot reset for recovery.

Comment 5 IBM Bug Proxy 2011-09-09 16:00:27 UTC
------- Comment From rlary.com 2011-09-09 11:56 EDT-------
Confirmed this patch is present in lpfc driver version 8.3.5.45.2p in RHEL 6.2 kernel 2.6.32-195

Comment 6 RHEL Program Management 2011-10-07 15:31:47 UTC
Since RHEL 6.2 External Beta has begun, and this bug remains
unresolved, it has been rejected as it is not proposed as
exception or blocker.

Red Hat invites you to ask your support representative to
propose this request, if appropriate and relevant, in the
next release of Red Hat Enterprise Linux.

Comment 7 IBM Bug Proxy 2011-10-07 19:09:23 UTC
------- Comment From rlary.com 2011-10-07 14:06 EDT-------
(In reply to comment #14)
> Since RHEL 6.2 External Beta has begun, and this bug remains
> unresolved, it has been rejected as it is not proposed as
> exception or blocker.
> Red Hat invites you to ask your support representative to
> propose this request, if appropriate and relevant, in the
> next release of Red Hat Enterprise Linux.

Hello, I am assuming this message is some boiler plate message that was blanket attached to bugzillas and the the patch I submitted and tested has not and will not be removed from future RHEL6.2 kernels?

Comment 8 Steve Best 2011-10-07 19:54:27 UTC
I talked to Richard and this patch came in under feature that updated the driver for RHEL 6.2, and he has confirmed that patch is in the beta build. so closing this bz as we no longer need it.

-Steve