RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 643237 - [NetApp 6.1 bug] regression: allow offlined devs to be set to running
Summary: [NetApp 6.1 bug] regression: allow offlined devs to be set to running
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 6
Classification: Red Hat
Component: kernel
Version: 6.0
Hardware: All
OS: Linux
high
high
Target Milestone: rc
: 6.1
Assignee: Mike Christie
QA Contact: Gris Ge
URL:
Whiteboard:
Depends On: 641193
Blocks: 660590
TreeView+ depends on / blocked
 
Reported: 2010-10-15 03:24 UTC by Mike Christie
Modified: 2011-07-06 15:12 UTC (History)
7 users (show)

Fixed In Version: kernel-2.6.32-91.el6
Doc Type: Bug Fix
Doc Text:
Prior to this update, when using Red Hat Enterprise Linux 6 with a qla4xxx driver and FC (Fibre Channel) drivers using the fc class, a device might have been put in the offline state due to a transport problem. Once the transport problem was resolved, the device was not usable until a user manually corrected the state. This update enables the transition from the offline state to the running state, thus, fixing the problem.
Clone Of: 641193
Environment:
Last Closed: 2011-05-19 12:21:14 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
revert patch that prevents changing device state from offline (842 bytes, patch)
2010-10-20 21:03 UTC, Mike Christie
no flags Details | Diff


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2011:0542 0 normal SHIPPED_LIVE Important: Red Hat Enterprise Linux 6.1 kernel security, bug fix and enhancement update 2011-05-19 11:58:07 UTC

Comment 2 RHEL Program Management 2010-10-15 03:28:38 UTC
Thank you for your bug report. This issue was evaluated for inclusion
in the current release of Red Hat Enterprise Linux. Unfortunately, we
are unable to address this request in the current release. Because we
are in the final stage of Red Hat Enterprise Linux 6 development, only
significant, release-blocking issues involving serious regressions and
data corruption can be considered.

If you believe this issue meets the release blocking criteria as
defined and communicated to you by your Red Hat Support representative,
please ask your representative to file this issue as a blocker for the
current release. Otherwise, ask that it be evaluated for inclusion in
the next minor release of Red Hat Enterprise Linux.

Comment 3 Mike Christie 2010-10-20 21:03:39 UTC
Created attachment 454673 [details]
revert patch that prevents changing device state from offline

For the IO stuck in a queue problem, it is possible if the device was offlined
by the scsi layer, then the fc class tried to delete it due to dev_loss_tmo,
then the IO could get stuck in the scsi/block layer queue. It could have gone
from blocked->offline->then it should go to the cancel and device delete state,
but due to a bug in the scsi layer, it would not and the queues would not get
started again so IO in the queue would be stuck with no timer running on it to
unjam us.

Attached is a patch that reverts the patch that caused the problem (note it is
slightly different than the RHEL 5 one).

Comment 4 RHEL Program Management 2010-10-20 23:09:44 UTC
This request was evaluated by Red Hat Product Management for inclusion
in a Red Hat Enterprise Linux maintenance release. Product Management has 
requested further review of this request by Red Hat Engineering, for potential
inclusion in a Red Hat Enterprise Linux Update release for currently deployed 
products. This request is not yet committed for inclusion in an Update release.

Comment 6 Andrius Benokraitis 2010-12-02 15:32:47 UTC
Mike - any idea when this is getting POSTed?

Comment 7 Andrius Benokraitis 2010-12-02 17:01:05 UTC
Requesting 6.0.z due this is blocking NetApp's 6.0 GA cert.

Comment 9 Mike Christie 2010-12-03 03:22:19 UTC
(In reply to comment #6)
> Mike - any idea when this is getting POSTed?

I was trying to work on a proper upstream fix first. I think I can send the patch in this bz that just reverts the bad patch now and then open a new bz for the proper fix. So tomorrow/this-weekend.

Comment 10 Andrius Benokraitis 2010-12-03 15:58:36 UTC
(In reply to comment #9)
> (In reply to comment #6)
> > Mike - any idea when this is getting POSTed?
> 
> I was trying to work on a proper upstream fix first. I think I can send the
> patch in this bz that just reverts the bad patch now and then open a new bz for
> the proper fix. So tomorrow/this-weekend.

Yeah, I don't think NetApp can wait until the whole upstream set is set for 6.1 since this is blocking 6.0 cert (and needs 6.0.z) so it sounds like if you could POST just the patch that fixes this ASAP that would be great.

Comment 12 Aristeu Rozanski 2010-12-15 16:07:47 UTC
Patch(es) available on kernel-2.6.32-91.el6

Comment 15 Martin Prpič 2011-02-23 15:05:20 UTC
    Technical note added. If any revisions are required, please edit the "Technical Notes" field
    accordingly. All revisions will be proofread by the Engineering Content Services team.
    
    New Contents:
Prior to this update, when using Red Hat Enterprise Linux 6 with a qla4xxx driver and FC (Fibre Channel) drivers using the fc class, a device might have been put in the offline state due to a transport problem. Once the transport problem was resolved, the device was not usable until a user manually corrected the state. This update enables the transition from the offline state to the running state, thus, fixing the problem.

Comment 18 Gris Ge 2011-03-11 06:23:13 UTC
NetApp and Qlogic verified this patch.

Code reivewed. Patch has been applied into kernel-2.6.32-120.

Set as Sanity only.

Comment 19 errata-xmlrpc 2011-05-19 12:21:14 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2011-0542.html


Note You need to log in before you can comment on or make changes to this bug.