Bug 641193

Summary: [NetApp 5.6 bug] regression: allow offlined devs to be set to running
Product: Red Hat Enterprise Linux 5 Reporter: Mike Christie <mchristi>
Component: kernelAssignee: Mike Christie <mchristi>
Status: CLOSED ERRATA QA Contact: Storage QE <storage-qe>
Severity: urgent Docs Contact:
Priority: high    
Version: 5.6CC: andriusb, bdonahue, coughlan, cward, dhoward, marting, mgoodwin, pbenas, tao, xdl-redhat-bugzilla
Target Milestone: rcKeywords: OtherQA, ZStream
Target Release: 5.6   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
: 643237 (view as bug list) Environment:
Last Closed: 2011-01-13 21:56:34 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 643237, 658934    
Attachments:
Description Flags
enable transition from offline to running none

Description Mike Christie 2010-10-08 01:14:13 UTC
Description of problem:

This patch
http://git.kernel.org/linus/5c10e63c943b4c67561ddc6bf61e01d4141f881f
was merged in RHEL 5.5. It does not allow devices that are offlined to be onlined.

The problem is that in RHEL 5 with qla4xxx and fc drivers using the fc class a device might be offlined due to a transport problem, then later the problem is resolved, but the devices are not usable until a user manually corrects the state.

Upstream and in RHEL6, this is not a problem for FC because it will would remove the device when dev_loss_tmo fires and then readd them when the problem is resolved. It is still a problem for qla4xxx, but upstream we are adding dev_loss_tmo functionality.

For RHEL 5, we cannot change the default dev_loss_tmo behavior so we should revert the patch.



Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 1 RHEL Program Management 2010-10-08 01:19:34 UTC
This request was evaluated by Red Hat Product Management for inclusion in a Red
Hat Enterprise Linux maintenance release.  Product Management has requested
further review of this request by Red Hat Engineering, for potential
inclusion in a Red Hat Enterprise Linux Update release for currently deployed
products.  This request is not yet committed for inclusion in an Update
release.

Comment 3 Jarod Wilson 2010-10-14 14:04:16 UTC
in kernel-2.6.18-227.el5
You can download this test kernel (or newer) from http://people.redhat.com/jwilson/el5

Detailed testing feedback is always welcomed.

Comment 5 Mike Christie 2010-10-25 18:18:54 UTC
Created attachment 455600 [details]
enable transition from offline to running

Comment 6 Martin George 2010-11-02 18:36:06 UTC
Patch looks good. Is this queued for the next 5.5.z release?

Comment 7 Ben Marzinski 2010-11-09 15:09:18 UTC
*** Bug 582683 has been marked as a duplicate of this bug. ***

Comment 8 Martin George 2010-11-09 15:28:51 UTC
*** Bug 632195 has been marked as a duplicate of this bug. ***

Comment 9 Petr Beňas 2010-12-01 15:09:15 UTC
Code review. 
Verified the patch is actually being applied in -227.el5 kernel.spec

 5983 Patch25982: linux-2.6-scsi-re-enable-transistions-from-offline-to-running.patch
11754 %patch25982 -p1

and the patch applied is corresponding to the one attached to this BZ.

Comment 12 Chris Ward 2010-12-02 15:31:44 UTC
Reminder! There should be a fix present for this BZ in snapshot 3 -- unless otherwise noted in a previous comment.

Please test and update this BZ with test results as soon as possible.

Comment 15 errata-xmlrpc 2011-01-13 21:56:34 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2011-0017.html

Comment 16 Ben Marzinski 2011-03-14 19:49:11 UTC
*** Bug 678472 has been marked as a duplicate of this bug. ***