Bug 752708

Summary: kernel: qla2xxx 0000:12:00.0: scsi(4:1:0): Abort command issued -- 1 46b19574 2002.
Product: Red Hat Enterprise Linux 5 Reporter: Harrison Han <xizhi.han>
Component: kernelAssignee: Red Hat Kernel Manager <kernel-mgr>
Status: CLOSED WONTFIX QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 5.3CC: emi2fast, xizhi.han
Target Milestone: rcFlags: pm-rhel: needinfo? (xizhi.han)
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-06-02 13:03:36 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
captured pictures none

Description Harrison Han 2011-11-10 07:50:07 UTC
Created attachment 532745 [details]
captured pictures

Description of problem:

kernel: 2.6.18-128.e15


We have one oracle RAC with 4 nodes,and the I/O(especially write) on the node 1 turned very slowly, it took 3~18 minutes to write the DB redo logs to archive log files, normally only need several seconds.the average I/O wait was 30 times of normal one.After node 1 reboot, the failure disappeared.


This only happened on one node, and the other nodes worked well, and after OS reboot of node 1, everything turned normally.


The vendor checked the hardware and storage, no failure found.



The below messages registered in /var/log/message during failure time.


Oct 30 16:49:01 mxrac01 auditd[7475]: Audit daemon rotating log files
Oct 31 18:58:00 mxrac01 kernel: qla2xxx 0000:12:00.0: scsi(4:1:0): Abort command issued -- 1 46b19574 2002.
Oct 31 18:58:01 mxrac01 kernel: qla2xxx 0000:12:00.0: scsi(4:0:1): Abort command issued -- 1 46b19608 2002.
Oct 31 18:59:07 mxrac01 kernel: qla2xxx 0000:12:00.0: scsi(4:0:1): Abort command issued -- 1 46b1a38a 2002.
Oct 31 19:00:11 mxrac01 kernel: qla2xxx 0000:12:00.0: scsi(4:1:0): Abort command issued -- 1 46b1abe9 2002.
Oct 31 19:00:11 mxrac01 kernel: qla2xxx 0000:12:00.0: scsi(4:1:0): Abort command issued -- 1 46b1acbe 2002.
Oct 31 19:01:15 mxrac01 kernel: qla2xxx 0000:12:00.0: scsi(4:1:0): Abort command issued -- 1 46b1add4 2002.
...............................................................................................................................................................................................................
#remark: until the node 1 restarted
...............................................................................................................................................................................................................
Oct 31 20:39:07 mxrac01 kernel: qla2xxx 0000:12:00.0: scsi(4:0:2): Abort command issued -- 1 46b6787e 2002.
 


Could you please help us?
One attachment added, please help check.

Thanks!
                                                                     Harrison

Comment 1 Emmanuel Segura 2013-08-01 13:02:57 UTC
I have the some problem using oracle rac 11G  with 6 nodes redhat 5.8, this happen only with server with qlogic hba on heavy load

Comment 2 RHEL Program Management 2014-03-07 13:35:09 UTC
This bug/component is not included in scope for RHEL-5.11.0 which is the last RHEL5 minor release. This Bugzilla will soon be CLOSED as WONTFIX (at the end of RHEL5.11 development phase (Apr 22, 2014)). Please contact your account manager or support representative in case you need to escalate this bug.

Comment 3 RHEL Program Management 2014-06-02 13:03:36 UTC
Thank you for submitting this request for inclusion in Red Hat Enterprise Linux 5. We've carefully evaluated the request, but are unable to include it in RHEL5 stream. If the issue is critical for your business, please provide additional business justification through the appropriate support channels (https://access.redhat.com/site/support).