Bug 670903

Summary: [RHEL6.0] megaraid_sas stalls after driver is reset
Product: Red Hat Enterprise Linux 6 Reporter: Debbie Johnson <dejohnso>
Component: kernelAssignee: Tomas Henzl <thenzl>
Status: CLOSED DUPLICATE QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: high Docs Contact:
Priority: urgent    
Version: 6.0CC: bo.yang, coughlan, dhoward, fhrbata, kzhang, mzywusko, ndoane, plyons, sforsber, syeghiay
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: 665427 Environment:
Last Closed: 2011-04-11 11:26:24 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 665427    
Bug Blocks: 667141    

Comment 1 Debbie Johnson 2011-01-19 16:12:57 UTC
NEC reported same problem on RHEL5 Bug 665427 ([RHEL5.6] megaraid_sas stalls after driver is reset ).
As far as I've checked 71.el6 our latest RHEL6 kernel (98.el6) on brew, but it seems only part of patch (at http://marc.info/?l=linux-scsi&m=129018442416761&w=2) is applied so far.

Would  you please open Bugzilla to fix this problem on RHEL6 too?


Current Situation:

  - NEC reported there's same problem with RHEL5 bug 665427 on RHEL6.

What TAM had done:

  - As far as TAM checked code on 71.el6 and 98.el6, a part of patch at http://marc.info/?l=linux-scsi&m=129018442416761&w=2 (plesae see below) is not applied yet.

---- a not applied part of patch at http://marc.info/?l=linux-scsi&m=129018442416761&w=2 ----
> @@ -2487,7 +2484,9 @@ megasas_transition_to_ready(struct megas
>  		case MFI_STATE_FAULT:
>  
>  			printk(KERN_DEBUG "megasas: FW in FAULT state!!\n");
> -			return -ENODEV;
> +			max_wait = MEGASAS_RESET_WAIT_TIME;
> +			cur_state = MFI_STATE_FAULT;
> +			break;
>  
>  		case MFI_STATE_WAIT_HANDSHAKE:
>  			/*

  - Also, here's code in 71.el6 and 98.el6

---- kernel-2.6.32-71.el6.src.rpmbuild-bp/linux-2.6.32-71.el6.x86_64/drivers/scsi/megaraid/megaraid_sas.c ----
2406         while (fw_state != MFI_STATE_READY) {
2407 
2408                 abs_state = instance->instancet->read_fw_status_reg(instance->reg_set);
2409 
2410                 switch (fw_state) {
2411 
2412                 case MFI_STATE_FAULT:
2413 
2414                         printk(KERN_DEBUG "megasas: FW in FAULT state!!\n");
2415                         return -ENODEV;
----

---- kernel-2.6.32-98.el6.src.rpmbuild-bp/linux-2.6.32-98.el6.x86_64/drivers/scsi/megaraid/megaraid_sas.c ----
2407         while (fw_state != MFI_STATE_READY) {
2408 
2409                 abs_state = instance->instancet->read_fw_status_reg(instance->reg_set);
2410 
2411                 switch (fw_state) {
2412 
2413                 case MFI_STATE_FAULT:
2414 
2415                         printk(KERN_DEBUG "megasas: FW in FAULT state!!\n");
2416                         return -ENODEV;
----

Comment 3 RHEL Program Management 2011-02-01 06:21:37 UTC
This request was evaluated by Red Hat Product Management for
inclusion in the current release of Red Hat Enterprise Linux.
Because the affected component is not scheduled to be updated
in the current release, Red Hat is unfortunately unable to
address this request at this time. Red Hat invites you to
ask your support representative to propose this request, if
appropriate and relevant, in the next release of Red Hat
Enterprise Linux. If you would like it considered as an
exception in the current release, please ask your support
representative.

Comment 4 RHEL Program Management 2011-02-01 19:08:07 UTC
This request was erroneously denied for the current release of
Red Hat Enterprise Linux.  The error has been fixed and this
request has been re-proposed for the current release.

Comment 8 Tomas Henzl 2011-04-11 11:26:24 UTC
(In reply to comment #1)
> NEC reported same problem on RHEL5 Bug 665427 ([RHEL5.6] megaraid_sas stalls
> after driver is reset ).
> As far as I've checked 71.el6 our latest RHEL6 kernel (98.el6) on brew, but it
> seems only part of patch (at
> http://marc.info/?l=linux-scsi&m=129018442416761&w=2) is applied so far.
> 
> Would  you please open Bugzilla to fix this problem on RHEL6 too?

The patch you are asking for was added to the -130 kernel, added via bz#692673.
I'm closing this as duplicate, the z-stream bug will be created based on 692673.

*** This bug has been marked as a duplicate of bug 692673 ***

Comment 9 Tomas Henzl 2011-04-11 12:27:04 UTC
The z-stream bug - #695322