Bug 802607

Summary: reset high-speed USB device (sata drive and USB connection)
Product: [Fedora] Fedora Reporter: George R. Goffe <grgoffe>
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED ERRATA QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: unspecified    
Version: 16CC: gansalmon, itamar, jonathan, kernel-maint, madhu.chinakonda
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-04-10 13:58:55 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
redacted /var/log/messages
none
gzip'd copy of parts of /var/log/messages during a problem event.
none
gzip'd copy of parts of /var/log/messages during a problem event.
none
Clip from /var/log/messages
none
gzip'd exerpts from /var/log/messages relating to this problem none

Description George R. Goffe 2012-03-13 04:39:51 UTC
Description of problem:

Random device resets causing data loss.

Version-Release number of selected component (if applicable):

3.2.9-1.fc16.x86_64.debug

How reproducible:

This problem appears to be randomly occurring and not especially associated with heavy I/O. One failure occurred with the device NOT mounted and NO I/O activity.

Steps to Reproduce:
1.Connect the device and power it on
2.System sees the device
3.Mount the device
4.Use the drive and wait for this problem to happen.
  
Actual results:

See above.

Expected results:

Normal drive operations.

Additional info:

Comment 1 George R. Goffe 2012-03-13 04:42:27 UTC
Created attachment 569546 [details]
redacted /var/log/messages

This bug has a lot of similarities to this bug, "Bug 782280 - ext4 write took more than 2 minutes to complete"

Comment 2 Dave Jones 2012-03-22 16:45:30 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 3 Dave Jones 2012-03-22 16:49:54 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 4 Dave Jones 2012-03-22 16:59:32 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 5 George R. Goffe 2012-03-23 09:06:37 UTC
Dave,

I just tested and after a few minutes got the same result.

Do you need more information?

Regards,

George...

Comment 6 George R. Goffe 2012-03-23 20:48:52 UTC
Created attachment 572377 [details]
gzip'd copy of parts of /var/log/messages during a problem event.

Dave,

Here's the excerpt from /var/log/messages for the time when this latest problem appeared.

I'm wondering if my setup is the problem. I have a usb-splitter 1 in, 4 out which connects one of the 4 to a KingWin usb/sata external connector.

Regards,

George...

Comment 7 George R. Goffe 2012-03-23 20:51:25 UTC
Dave,

This time I used the cp command to create the problem instead of a "tar -cvlf - * | (cd /sdc1/wtf;tar -xvpf -)"

The key message looks like this: "   1210 [52205.619432] sd 10:0:0:0: [sdb] Unhandled error code"

Comment 8 George R. Goffe 2012-04-10 07:04:24 UTC
Dave,

I have been running on this kernel: "3.3.0-8.fc16.x86_64.debug #1 SMP Thu Mar 29 18:14:11 UTC 2012 x86_64 GNU/Linux" for the past 3 days + and have NOT seen this error. I would say that this bug is fixed now (knock on wood).

Thanks for your help!

George...

 00:01am  up 3 days 11:35,  16 users,  load average: 1.18, 1.00, 0.97

Comment 9 Josh Boyer 2012-04-10 13:58:55 UTC
Thank you for letting  us know.

Comment 10 George R. Goffe 2012-04-12 09:26:37 UTC
Created attachment 577012 [details]
gzip'd copy of parts of /var/log/messages during a problem event.

Josh,

I spoke too soon. I'm enclosing an excerpt from /var/log/messages that shows this problem but there's also a stack dump and some other messages that I'm thinking you might find interesting. If I need to file a new bug for the stack dump, let me know and, if you can, please suggest who I should send it to.

Regards,

George...

Comment 11 George R. Goffe 2012-05-03 18:50:14 UTC
Created attachment 581932 [details]
Clip from /var/log/messages

Josh,

This just happened a few minutes ago after several days of blissfull operation.

George...

Comment 12 George R. Goffe 2012-09-30 06:09:08 UTC
Josh,

This problem just re-appeared from my making a 1TB Seagate laptop drive busy. Effectively, the hardware has not changed except for the new drive. Sigh.

I'm enclosing excerpts from /var/log/messages. There's a trace dump in this file. Perhaps this will lead to a solution of this problem?

Regards,

George...

Comment 13 George R. Goffe 2012-09-30 06:10:16 UTC
Created attachment 619323 [details]
gzip'd exerpts from /var/log/messages relating to this problem