Bug 1103234

Summary: xHCI xhci_drop_endpoint called with disabled ep ffff8802234caa00
Product: [Fedora] Fedora Reporter: Michael Monreal <michael.monreal>
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED INSUFFICIENT_DATA QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 20CC: gansalmon, itamar, jonathan, kernel-maint, madhu.chinakonda, mchehab, sangu.fedora, villapla
Target Milestone: ---Flags: jforbes: needinfo?
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-12-10 15:01:54 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Michael Monreal 2014-05-30 13:51:38 UTC
Description of problem:
I ran into a series of problems with my (new) Western Digital My Book 4TB external USB3 hard drive:

- transfer to the disk seems to stop at times
- running fdisk on the disk takes about 10-20 seconds to load
- dmesg shows this:

[  286.433990] usb 4-3: new SuperSpeed USB device number 2 using xhci_hcd
[  286.445578] usb 4-3: New USB device found, idVendor=1058, idProduct=1230
[  286.445586] usb 4-3: New USB device strings: Mfr=2, Product=3, SerialNumber=1
[  286.445590] usb 4-3: Product: My Book 1230
[  286.445593] usb 4-3: Manufacturer: Western Digital
[  286.445596] usb 4-3: SerialNumber: 574343344531323635393932
[  286.460272] usb-storage 4-3:1.0: USB Mass Storage device detected
[  286.461034] scsi6 : usb-storage 4-3:1.0
[  286.461483] usbcore: registered new interface driver usb-storage
[  287.463172] scsi 6:0:0:0: Direct-Access     WD       My Book 1230     1059 PQ: 0 ANSI: 6
[  287.463441] scsi 6:0:0:1: Enclosure         WD       SES Device       1059 PQ: 0 ANSI: 6
[  287.464123] sd 6:0:0:0: Attached scsi generic sg2 type 0
[  287.464546] scsi 6:0:0:1: Attached scsi generic sg3 type 13
[  287.466129] sd 6:0:0:0: [sdb] Spinning up disk...
[  287.475357] scsi 6:0:0:1: Failed to get diagnostic page 0x8000002
[  287.475365] scsi 6:0:0:1: Failed to bind enclosure -19
[  287.475393] ses 6:0:0:1: Attached Enclosure device
[  288.467762] ...........ready
[  298.648205] sd 6:0:0:0: [sdb] 976746240 4096-byte logical blocks: (4.00 TB/3.63 TiB)
[  298.648582] sd 6:0:0:0: [sdb] Write Protect is off
[  298.648592] sd 6:0:0:0: [sdb] Mode Sense: 53 00 10 08
[  298.648927] sd 6:0:0:0: [sdb] No Caching mode page found
[  298.648933] sd 6:0:0:0: [sdb] Assuming drive cache: write through
[  298.649996] sd 6:0:0:0: [sdb] 976746240 4096-byte logical blocks: (4.00 TB/3.63 TiB)
[  298.650605] sd 6:0:0:0: [sdb] No Caching mode page found
[  298.650610] sd 6:0:0:0: [sdb] Assuming drive cache: write through
[  298.800377]  sdb: sdb1
[  298.802644] sd 6:0:0:0: [sdb] 976746240 4096-byte logical blocks: (4.00 TB/3.63 TiB)
[  298.803262] sd 6:0:0:0: [sdb] No Caching mode page found
[  298.803268] sd 6:0:0:0: [sdb] Assuming drive cache: write through
[  298.803272] sd 6:0:0:0: [sdb] Attached SCSI disk
[  334.053262] usb 4-3: Disable of device-initiated U1 failed.
[  339.059015] usb 4-3: Disable of device-initiated U2 failed.
[  339.161699] usb 4-3: reset SuperSpeed USB device number 2 using xhci_hcd
[  339.172654] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8802234caa00
[  339.172663] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8802234caa40

Version-Release number of selected component (if applicable):
kernel-3.14.4-200.fc20.x86_64

How reproducible:
100%

Steps to Reproduce:
1. attach drive
2. cause some I/O
OR
3. run "fdisk /dev/sdX"

Additional info:
There are other problems which could be related:
- GNOME disks does not see SMART attributes for this drive (used to work)
- GNOME disks does not see partitions on this drive (used to work) 

I first thought the drive was broken but it still works totally fine (no xHCI message, SMART being displayed in GNOME disks) on older Fedora releases (namely F17 and F18). I also tested on the F19 LiveCD and there the problem shows exactly as on F20.

Please tell me what other information I can provide to help fixing this.

Comment 1 Michael Monreal 2014-05-30 14:00:29 UTC
Here are some more results of the tests I made:

F17 LiveCD
kernel-3.3.4-5.fc17.x86_64, udisks2-1.94.0-4.fc17.x86_64, libatasmart-0.18-2.fc17.x86_64
=> works perfectly: G-D sees partition and SMART, no xHCI error

F18 LiveCD
kernel-3.6.10-4.fc18.x86_64, udisks2-1.99.0-1.fc18.x86_64, libatasmart-0.19-3.fc18.x86_64
=> works perfectly: G-D sees partition and SMART
=> no xHCI error

F19 LiveCD
kernel-3.9.5-301.fc19.x86_64, 
udisks2-2.1.0-2.fc19.x86_64, 
libatasmart-0.19-4.fc19.x86_64
=> G-D does not see partition (only "free space" or "unallocated space")
=> G-D shows "SMART is not supported" (but SMART works with smartctl)
=> xHCI error

F20 LiveCD
kernel-3.11.10-301.fc20.x86_64, udisks2-2.1.1-1.fc20.x86_64, libatasmart-0.19-5.fc20.x86_64
=> G-D does not see partition (only "free space" or "unallocated space")
=> G-D shows "SMART is not supported" (but SMART works with smartctl)
=> xHCI error

F20 Install
kernel-3.14.4-200.fc20.x86_64, udisks2-2.1.2-2.fc20.x86_64, libatasmart-0.19-5.fc20.x86_64
=> G-D does not see partition (only "free space" or "unallocated space")
=> G-D shows "SMART is not supported" (but SMART works with smartctl)
=> xHCI error

So kernel-3.6.10 still works and kernel-3.9.5 does not. Any idea?

Comment 2 Michael Monreal 2014-05-30 14:07:33 UTC
One more thing:

lspci | grep USB
00:14.0 USB controller: Intel Corporation 7 Series/C210 Series Chipset Family USB xHCI Host Controller (rev 04)
00:1a.0 USB controller: Intel Corporation 7 Series/C210 Series Chipset Family USB Enhanced Host Controller #2 (rev 04)
00:1d.0 USB controller: Intel Corporation 7 Series/C210 Series Chipset Family USB Enhanced Host Controller #1 (rev 04)

The machine is a Thinkpad Edge S430, it only has two USB ports (both USB3 capable, so I cannot test this on a USB2-only port).

Comment 3 Michael Monreal 2014-06-05 19:32:55 UTC
I tried a recent Rawhide/F21 live ISO with kernel 3.15 rc7 and did /not/ run into the problem. Is it possible to try the 3.15 kernel on F20 to verify?

Comment 4 Michael Monreal 2014-06-05 22:07:33 UTC
After installing and booting the F21 kernel the problem was not fixed. I then recompiled the udisks2 SRPM and now the problem is gone - even on the original F20 kernel.

So this is not a kernel/driver bug after all. Please reassign to udisks2 if possible.

Comment 5 Juanjo Villaplana 2014-06-17 12:18:40 UTC
Similar issue here, Conceptronic chddockusb3 on a USB3 port:

# lspci | grep USB
00:14.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset Family USB xHCI (rev 04)
00:1a.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset Family USB EHCI #2 (rev 04)
00:1d.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset Family USB EHCI #1 (rev 04)

# dmesg
[  123.811985] scsi 4:0:0:0: Direct-Access     Maxtor 6 V080E0           VA11 PQ: 0 ANSI: 2
[  123.812782] sd 4:0:0:0: Attached scsi generic sg1 type 0
[  123.817057] sd 4:0:0:0: [sdb] 160084412 512-byte logical blocks: (81.9 GB/76.3 GiB)
[  123.819424] sd 4:0:0:0: [sdb] Write Protect is off
[  123.819439] sd 4:0:0:0: [sdb] Mode Sense: 03 00 00 00
[  123.819956] sd 4:0:0:0: [sdb] No Caching mode page found
[  123.819966] sd 4:0:0:0: [sdb] Assuming drive cache: write through
[  123.824988] sd 4:0:0:0: [sdb] No Caching mode page found
[  123.824995] sd 4:0:0:0: [sdb] Assuming drive cache: write through
[  123.849795]  sdb: unknown partition table
[  124.869215] sd 4:0:0:0: [sdb] No Caching mode page found
[  124.869220] sd 4:0:0:0: [sdb] Assuming drive cache: write through
[  124.869224] sd 4:0:0:0: [sdb] Attached SCSI disk
[  140.473469] xhci_hcd 0000:00:14.0: ERROR Unknown event condition, HC probably busted
[  170.912998] usb 4-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  170.929182] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a400
[  170.929192] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a440
[  172.087212] usb 4-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  172.103507] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a400
[  172.103511] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a440
[  174.210977] usb 4-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  174.227058] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a400
[  174.227062] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a440
[  175.593702] usb 4-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  175.609721] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a400
[  175.609725] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a440
[  176.979163] usb 4-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  176.995608] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a400
[  176.995613] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a440
[  178.361953] usb 4-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  178.377868] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a400
[  178.377871] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a440
[  179.729659] usb 4-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  179.745940] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a400
[  179.745949] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d909a440
[  180.950305] xhci_hcd 0000:00:14.0: ERROR Unknown event condition, HC probably busted

# uname -a
Linux macarulla.si 3.14.7-200.fc20.x86_64 #1 SMP Wed Jun 11 22:38:05 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux

# rpm -qa "*udisks2*"
".x86_64
libudisks2-2.1.3-2.fc20.x86_64

Note that "udisks2-2.1.3-2.fc20" has been built from fc21.

The issue *persists* even using the updated version of udisks2.

Comment 6 Justin M. Forbes 2014-11-13 16:02:36 UTC
*********** MASS BUG UPDATE **************

We apologize for the inconvenience.  There is a large number of bugs to go through and several of them have gone stale.  Due to this, we are doing a mass bug update across all of the Fedora 20 kernel bugs.

Fedora 20 has now been rebased to 3.17.2-200.fc20.  Please test this kernel update (or newer) and let us know if you issue has been resolved or if it is still present with the newer kernel.

If you have moved on to Fedora 21, and are still experiencing this issue, please change the version to Fedora 21.

If you experience different issues, please open a new bug report for those.

Comment 7 Justin M. Forbes 2014-12-10 15:01:54 UTC
This bug is being closed with INSUFFICIENT_DATA as there has not been a response in over 3 weeks. If you are still experiencing this issue, please reopen and attach the relevant data from the latest kernel you are running and any data that might have been requested previously.