Bug 1460789 - Ethernet on Dell's Thunberbolt 3 dock (TB16) randomly stops working
Summary: Ethernet on Dell's Thunberbolt 3 dock (TB16) randomly stops working
Keywords:
Status: NEW
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: rawhide
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
Assignee: Kernel Maintainer List
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-06-12 18:51 UTC by Christian Kellner
Modified: 2020-08-11 02:17 UTC (History)
26 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Type: Bug
Embargoed:


Attachments (Terms of Use)

Description Christian Kellner 2017-06-12 18:51:16 UTC
This is a Dell XPS 13 connected to the network via the TB16 dock.
Kernel is: Linux ag13.local 4.12.0-0.rc3.git0.2.fc27.x86_64 #1 SMP Tue May 30 19:36:51 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

Host controller of the dock:
09:00.0 USB controller: ASMedia Technology Inc. ASM1042A USB 3.0 Host Controller

USB network interface in the dock:
/:  Bus 04.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/2p, 5000M
    |__ Port 1: Dev 2, If 0, Class=Hub, Driver=hub/7p, 5000M
        |__ Port 2: Dev 3, If 0, Class=Vendor Specific Class, Driver=r8152, 5000M

[32930.573816] usb 4-1.2: new SuperSpeed USB device number 3 using xhci_hcd
[32930.591744] usb 4-1.2: New USB device found, idVendor=0bda, idProduct=8153
[32930.591752] usb 4-1.2: New USB device strings: Mfr=1, Product=2, SerialNumber=6
[32930.591757] usb 4-1.2: Product: USB 10/100/1000 LAN
[32930.591761] usb 4-1.2: Manufacturer: Realtek
[32930.591766] usb 4-1.2: SerialNumber: 000001000000
[32930.739428] usb 4-1.2: reset SuperSpeed USB device number 3 using xhci_hcd

I *sometimes* get the following in the log and with that the ethernet port stops working. 
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: Looking for event-dma 00000001c3eec010 trb-start 00000001c3eebfe0 trb-end 00000001c3eebfe0 seg-start 00000001c3eeb000 seg-end 00000001c3eebff0
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: Looking for event-dma 00000001c3eec020 trb-start 00000001c3eebfe0 trb-end 00000001c3eebfe0 seg-start 00000001c3eeb000 seg-end 00000001c3eebff0
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: Looking for event-dma 00000001c3eec030 trb-start 00000001c3eebfe0 trb-end 00000001c3eebfe0 seg-start 00000001c3eeb000 seg-end 00000001c3eebff0
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: Looking for event-dma 00000001c3eec040 trb-start 00000001c3eebfe0 trb-end 00000001c3eebfe0 seg-start 00000001c3eeb000 seg-end 00000001c3eebff0
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: Looking for event-dma 00000001c3eec050 trb-start 00000001c3eebfe0 trb-end 00000001c3eebfe0 seg-start 00000001c3eeb000 seg-end 00000001c3eebff0
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: Looking for event-dma 00000001c3eec060 trb-start 00000001c3eebfe0 trb-end 00000001c3eebfe0 seg-start 00000001c3eeb000 seg-end 00000001c3eebff0
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: Looking for event-dma 00000001c3eec070 trb-start 00000001c3eebfe0 trb-end 00000001c3eebfe0 seg-start 00000001c3eeb000 seg-end 00000001c3eebff0
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Jun 12 19:00:04 ag13.local kernel: xhci_hcd 0000:09:00.0: Looking for event-dma 00000001c3eec080 trb-start 00000001c3eebfe0 trb-end 00000001c3eebfe0 seg-start 00000001c3eeb000 seg-end 00000001c3eebff0

earlier in the log (and maybe related):
Jun 12 18:39:08 ag13.local kernel: WARNING: CPU: 0 PID: 0 at net/core/dev.c:5411 net_rx_action+0x2d0/0x3c0
Jun 12 18:39:08 ag13.local kernel: Modules linked in: cdc_ether usbnet r8152 mii snd_usb_audio snd_usbmidi_lib snd_rawmidi rfcomm fuse ccm xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack libcrc32c iptable_mangle iptable_raw iptable_security ebtable_filter ebtables ip6table_filter ip6_tables cmac arc4 bnep sunrpc vfat fat hid_multitouch intel_rapl snd_soc_skl x86_pkg_temp_thermal snd_soc_skl_ipc intel_powerclamp snd_soc_sst_ipc ath10k_pci snd_soc_sst_dsp ath10k_core snd_hda_ext_core
Jun 12 18:39:08 ag13.local kernel:  coretemp snd_soc_sst_match kvm_intel snd_hda_codec_hdmi snd_soc_core kvm mei_wdt mac80211 snd_hda_codec_realtek snd_hda_codec_generic iTCO_wdt snd_compress iTCO_vendor_support i2c_designware_platform i2c_designware_core snd_pcm_dmaengine dell_wmi irqbypass ac97_bus crct10dif_pclmul crc32_pclmul snd_hda_intel dell_laptop dell_smbios dcdbas ghash_clmulni_intel snd_hda_codec intel_cstate uvcvideo intel_uncore videobuf2_vmalloc videobuf2_memops snd_hda_core videobuf2_v4l2 snd_hwdep intel_rapl_perf snd_seq videobuf2_core snd_seq_device ath snd_pcm videodev cfg80211 joydev hci_uart btusb snd_timer btrtl snd rtsx_pci_ms media soundcore memstick i2c_i801 idma64 btbcm mei_me mei btqca btintel intel_pch_thermal shpchp processor_thermal_device intel_lpss_pci intel_soc_dts_iosf bluetooth wmi intel_vbtn
Jun 12 18:39:08 ag13.local kernel:  soc_button_array acpi_als kfifo_buf int3403_thermal ecdh_generic intel_hid intel_lpss_acpi rfkill pinctrl_sunrisepoint intel_lpss sparse_keymap pinctrl_intel industrialio int340x_thermal_zone int3400_thermal acpi_thermal_rel tpm_tis tpm_tis_core acpi_pad tpm btrfs xor i915 rtsx_pci_sdmmc mmc_core raid6_pq i2c_algo_bit drm_kms_helper drm crc32c_intel rtsx_pci serio_raw i2c_hid video
Jun 12 18:39:08 ag13.local kernel: CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.12.0-0.rc3.git0.2.fc27.x86_64 #1
Jun 12 18:39:08 ag13.local kernel: Hardware name: Dell Inc. XPS 13 9360/05HM5Y, BIOS 1.3.2 01/18/2017
Jun 12 18:39:08 ag13.local kernel: task: ffffffffabe104c0 task.stack: ffffffffabe00000
Jun 12 18:39:08 ag13.local kernel: RIP: 0010:net_rx_action+0x2d0/0x3c0
Jun 12 18:39:08 ag13.local kernel: RSP: 0018:ffff8faabe403eb8 EFLAGS: 00010297
Jun 12 18:39:08 ag13.local kernel: RAX: 0000000000000041 RBX: 00000000ffffffff RCX: ffff8faaa4a81e50
Jun 12 18:39:08 ag13.local kernel: RDX: ffff8fa86f802000 RSI: 00000000fffffe01 RDI: ffffffffc0b476b0
Jun 12 18:39:08 ag13.local kernel: RBP: ffff8faabe403f28 R08: 0000000000000002 R09: 0000000000000000
Jun 12 18:39:08 ag13.local kernel: R10: 0000000000100000 R11: ffffffffab44a0b0 R12: ffff8faaa4a87ba0
Jun 12 18:39:08 ag13.local kernel: R13: 0000000000000040 R14: 0000000000000000 R15: 0000000000000003
Jun 12 18:39:08 ag13.local kernel: FS:  0000000000000000(0000) GS:ffff8faabe400000(0000) knlGS:0000000000000000
Jun 12 18:39:08 ag13.local kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 12 18:39:08 ag13.local kernel: CR2: 00007fd0eee69000 CR3: 000000013fe09000 CR4: 00000000003406f0
Jun 12 18:39:08 ag13.local kernel: Call Trace:
Jun 12 18:39:08 ag13.local kernel:  <IRQ>
Jun 12 18:39:08 ag13.local kernel:  __do_softirq+0xea/0x2bf
Jun 12 18:39:08 ag13.local kernel:  irq_exit+0xfb/0x100
Jun 12 18:39:08 ag13.local kernel:  do_IRQ+0x80/0xd0
Jun 12 18:39:08 ag13.local kernel:  common_interrupt+0x93/0x93
Jun 12 18:39:08 ag13.local kernel: RIP: 0010:cpuidle_enter_state+0x12b/0x2d0
Jun 12 18:39:08 ag13.local kernel: RSP: 0018:ffffffffabe03dc8 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff3b
Jun 12 18:39:08 ag13.local kernel: RAX: ffff8faabe41a0c0 RBX: 0000179af34e9083 RCX: 000000000000001f
Jun 12 18:39:08 ag13.local kernel: RDX: 0000179af34e9083 RSI: ffff8faabe4178d8 RDI: 0000000000000000
Jun 12 18:39:08 ag13.local kernel: RBP: ffffffffabe03e08 R08: cccccccccccccccd R09: 0000000000000018
Jun 12 18:39:08 ag13.local kernel: R10: ffffffffabe03d98 R11: 000000000000070b R12: ffff8faabe423800
Jun 12 18:39:08 ag13.local kernel: R13: 0000000000000000 R14: 0000000000000006 R15: ffffffffabf81838
Jun 12 18:39:08 ag13.local kernel:  </IRQ>
Jun 12 18:39:08 ag13.local kernel:  ? cpuidle_enter_state+0x11b/0x2d0
Jun 12 18:39:08 ag13.local kernel:  cpuidle_enter+0x17/0x20
Jun 12 18:39:08 ag13.local kernel:  call_cpuidle+0x23/0x40
Jun 12 18:39:08 ag13.local kernel:  do_idle+0x18a/0x1e0
Jun 12 18:39:08 ag13.local kernel:  cpu_startup_entry+0x71/0x80
Jun 12 18:39:08 ag13.local kernel:  rest_init+0x77/0x80
Jun 12 18:39:08 ag13.local kernel:  start_kernel+0x450/0x471
Jun 12 18:39:08 ag13.local kernel:  ? early_idt_handler_array+0x120/0x120
Jun 12 18:39:08 ag13.local kernel:  x86_64_start_reservations+0x29/0x2b
Jun 12 18:39:08 ag13.local kernel:  x86_64_start_kernel+0x143/0x166
Jun 12 18:39:08 ag13.local kernel:  secondary_startup_64+0x9f/0x9f
Jun 12 18:39:08 ag13.local kernel: Code: 8b 7e 08 49 83 c6 18 44 89 e9 44 89 fa 4c 89 e6 41 ff d2 4d 8b 16 4d 85 d2 75 e4 4c 8b 75 90 44 89 f8 41 39 c5 0f 8d 95 fe ff ff <0f> ff 41 39 c5 0f 8f 93 fe ff ff 49 8b 44 24 10 a8 04 0f 85 88 
Jun 12 18:39:08 ag13.local kernel: ---[ end trace 3fda2d590db48512 ]---

Comment 1 Christian Kellner 2017-06-12 18:54:30 UTC
There is an upstream patch for the ASM1042A host controller[1] that has been reported to help with the issue (see corresponding launchpad issue[2]).

[1] http://www.spinics.net/lists/linux-usb/msg157958.html
[2] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1667750

Comment 2 Christian Kellner 2017-06-27 16:15:03 UTC
I added the v5 of the patch[1] to a kernel, scratch build:
https://koji.fedoraproject.org/koji/taskinfo?taskID=20206892

[1] http://www.spinics.net/lists/linux-usb/msg158473.html

Comment 3 Mario Limonciello 2017-07-17 20:44:12 UTC
v5 has landed in the maintainer's tree (to target to 4.13-rcX) along with CC to stable.
https://git.kernel.org/pub/scm/linux/kernel/git/mnyman/xhci.git/commit/?h=for-usb-linus&id=353a73c757c0b856bb95f5e73cf41b10b685258d

Comment 4 Martin Bříza 2017-07-26 10:32:17 UTC
After an initial hiccup with the LAN cable in the dock (and plugging it into a different socket), the performance is now much better (not sure if I can say it's perfect, yet) using the patched kernel.
Thanks!

Comment 5 Christian Kellner 2017-08-22 12:56:22 UTC
For future reference, the mentioned patch git merged upstream, as commit 9da5a1092b13468839b1a864b126cacfb72ad016
It also made it into stable, 4.12.4 I believe, as 5cc9b698a494827b15f74ef70a31d7911d00e52a

So I think this should be fixed (or at least better) in F26, because we currently ship 4.12.5-300.fc26.x86_64

Comment 6 Jiri Eischmann 2017-08-23 10:53:41 UTC
(In reply to Christian Kellner from comment #5)
> For future reference, the mentioned patch git merged upstream, as commit
> 9da5a1092b13468839b1a864b126cacfb72ad016
> It also made it into stable, 4.12.4 I believe, as
> 5cc9b698a494827b15f74ef70a31d7911d00e52a
> 
> So I think this should be fixed (or at least better) in F26, because we
> currently ship 4.12.5-300.fc26.x86_64

The network works, but sadly it corrupts packets. Martin says because of it he has difficulties to download things, connect to services...

Comment 7 Mario Limonciello 2017-08-23 16:12:50 UTC
@Jiri,

Are you sure that's a result of this patch?  This is the first report i've heard of that.

Comment 8 Christian Kellner 2017-08-25 12:10:12 UTC
@Mario, I think what Jiri means is that without the patch it doesn't work well at all but even with the patch the situation is not perfect. Let me cc Benjamin, maybe we can add a test in our Fedora Hardware test suit for that. We still have the TB16 dock in Munich right now, maybe we can be of help.

Comment 9 Jiri Eischmann 2017-08-25 12:16:55 UTC
I'll let Martin speak for himself because it was him who complained about it to me.
I've been using kernel 4.12.8 which should have the patch included since the morning and haven't experienced any noticeable problems with the network.

Comment 10 Martin Bříza 2017-08-25 12:37:27 UTC
Yes, for me, the Ethernet on the Docks is pretty broken. For example, when downloading a whole Koji build with about 13 packages, each time the download got broken at about 4th or 5th package, with (I think) a SSL handshake error. Also when downloading a Fedora ISO 4 times in a row, each of them got corrupted (md5 check just didn't pass).

Also, the USB performance of the dock is terrible, I'm not sure if this is related to the issue the patch in question is supposed to solve but after updating the laptop firmware to 1.2.1.0, my mouse and keyboard get disconnected very often. On the other hand, dock audio works just fine and one would assume all of these devices are on the same USB hub.

I'm currently working around this by plugging a USB-C adapter with ethernet into the Thunderbolt port on the docking station.

Comment 11 Benjamin Berg 2017-08-25 13:22:26 UTC
Martin, could you maybe try disabling RC checksum offloading and see if that helps? Then the corrupted packages should be discarded by the kernel (even if they are only corrupted during the transfer over USB). i.e. try again after running:

  ethtool --offload $DEVICE rx off

Comment 12 Mario Limonciello 2017-08-25 15:55:50 UTC
@Martin

Just to make sure - this is a TB16 not TB15 right?  This is sounding suspiciously like a hardware problem to me.

Comment 13 Jiri Eischmann 2017-08-25 16:16:47 UTC
(In reply to Mario Limonciello from comment #12)
> @Martin
> 
> Just to make sure - this is a TB16 not TB15 right?  This is sounding
> suspiciously like a hardware problem to me.

It's TB16.
You mean the ethernet or USB problem? I think we've started mixing two (most likely) unrelated problems. I have not been able to reproduce the ethernet problem for the whole day. Martin also has Windows 10 installed on his XPS 13, so he could try it there and if the problem still occurs it's very likely a hardware problem.

The USB one doesn't seem like a hardware problem because I'm affected by that, too, after the last firmware update. Devices connected to the USB ports don't work at all or just for a short period of time after they're plugged in.

Comment 14 Mario Limonciello 2017-08-25 16:21:59 UTC
Well i'm not sure if they're related, but since the Ethernet device is a USB device on the hub, I would suspect them to be.

Can you please clarify which XPS machine you guys are affected?  There are at least 4 different XPS models that support TB16.
Please comment your last working and last failed BIOS versions too.

Comment 15 Jiri Eischmann 2017-08-25 17:32:15 UTC
We both have XPS 13 9360. I had problems with Ethernet from the very beginning until I used a patched kernel. But after updating the firmware to 1.3.7 USB devices stopped working*. Now we're on 2.1.0 and they still don't work, no matter if we use the kernel patch or not. I have to have a USB hub connected directly to the laptop. The last working firmware for me was 1.3.5.

* It really depends on the type of the device. The mouse and keyboard don't work at all or just for a very short time after plugging in. I also have a USB sound card. It seems to work, the system identifies the sound card as an audio output, it plays sound, but there are audible corruptions (cracks etc) which don't occur when the sound card is connected directly to the laptop. What I'm experiencing with sound may be similar to what Martin is experiencing with the Ethernet.

Comment 16 Mario Limonciello 2017-08-25 17:51:45 UTC
Ah OK thanks.  I just poked around the Dell forums a little bit and you guys aren't the first ones reporting this on 9360 after upgrade.

http://en.community.dell.com/support-forums/laptop/f/3518/t/20017063?pi41097=1

I'll poke some of the Dell support guys to look at this, it sounds like it might have slipped through the cracks.

I also checked internally on what went into 1.3.6/1.3.7.
At least 1.3.6 had some tweaks for adressing noise which would be most suspicious to me as a possible impact.

For now, can you two downgrade to 1.3.5?  Fwupd probably won't let you, but you can place the .EXE file on a FAT32 partition and do it from F12 menu at POST I expect.

Comment 17 Jiri Eischmann 2017-08-25 18:03:50 UTC
We'll try to downgrade for the time being. BTW I also reported the issue to @DellCaresPRO like Barton George instructed me on Twitter. They said 10 days ago they had people looking into it, but there hasn't been any update since then, so I have no idea if someone is really looking into it and if they've made any progress, and who is "they".

Comment 18 Mario Limonciello 2017-08-25 19:23:29 UTC
I won't be able to shortcut the process by pinging people, but I understand this is being investigated, it will just take some time.

Comment 19 Martin Bříza 2017-08-28 09:05:50 UTC
(In reply to Benjamin Berg from comment #11)
> Martin, could you maybe try disabling RC checksum offloading and see if that
> helps? Then the corrupted packages should be discarded by the kernel (even
> if they are only corrupted during the transfer over USB). i.e. try again
> after running:
> 
>   ethtool --offload $DEVICE rx off

With this, it seems to work alright, thanks! Kernel 4.13.0-0.rc5.git1.1.fc27.x86_64 BTW.

(In reply to Mario Limonciello from comment #16)
> For now, can you two downgrade to 1.3.5?  Fwupd probably won't let you, but
> you can place the .EXE file on a FAT32 partition and do it from F12 menu at
> POST I expect.

I'm able to function this way so I'll probably not go for that - unless it'll be necessary to verify it actually happened between the mentioned versions. 
I'd rather track if there's a new release and then upgrade when it's out and see if it fixes the USB problem.

Comment 20 Martin Bříza 2017-08-30 12:17:03 UTC
Every now and then (especially when downloading large files), the ethernet simply stops working with the following log in dmesg.
Unloading the r8152 module results in gnome-shell dying. After reloading it, ethernet still doesn't work. Disconnecting the Dock in this state kills everything from GDM down to my user session.

[159642.248648] pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
[159642.248666] pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)                                      
[159642.248680] pcieport 0000:00:1c.0:   device [8086:9d10] error status/mask=00001000/00002000                                                               
[159642.248690] pcieport 0000:00:1c.0:    [12] Replay Timer Timeout            
[159661.087306] xhci_hcd 0000:0a:00.0: port 1 resume PLC timeout
[159667.687492] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13                                        
[159667.687514] xhci_hcd 0000:09:00.0: Looking for event-dma 00000004694bc010 trb-start 00000003a0cfefe0 trb-end 00000003a0cfefe0 seg-start 00000003a0cfe000 seg-end 00000003a0cfeff0
[159667.687610] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13                                        
[159667.687627] xhci_hcd 0000:09:00.0: Looking for event-dma 00000004694bc020 trb-start 00000003a0cfefe0 trb-end 00000003a0cfefe0 seg-start 00000003a0cfe000 seg-end 00000003a0cfeff0
[159667.687722] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13                                        
[159667.687735] xhci_hcd 0000:09:00.0: Looking for event-dma 00000004694bc030 trb-start 00000003a0cfefe0 trb-end 00000003a0cfefe0 seg-start 00000003a0cfe000 seg-end 00000003a0cfeff0
[159667.687829] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13                                        
[159667.687838] xhci_hcd 0000:09:00.0: Looking for event-dma 00000004694bc040 trb-start 00000003a0cfefe0 trb-end 00000003a0cfefe0 seg-start 00000003a0cfe000 seg-end 00000003a0cfeff0
[159667.687971] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13                                        
[159667.687988] xhci_hcd 0000:09:00.0: Looking for event-dma 00000004694bc050 trb-start 00000003a0cfefe0 trb-end 00000003a0cfefe0 seg-start 00000003a0cfe000 seg-end 00000003a0cfeff0
[159667.723135] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13                                        
[159667.723158] xhci_hcd 0000:09:00.0: Looking for event-dma 00000004694bc060 trb-start 00000003a0cfefe0 trb-end 00000003a0cfefe0 seg-start 00000003a0cfe000 seg-end 00000003a0cfeff0
[159667.723202] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13                                        
[159667.723219] xhci_hcd 0000:09:00.0: Looking for event-dma 00000004694bc070 trb-start 00000003a0cfefe0 trb-end 00000003a0cfefe0 seg-start 00000003a0cfe000 seg-end 00000003a0cfeff0
[159667.723353] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13                                        
[159667.723377] xhci_hcd 0000:09:00.0: Looking for event-dma 00000004694bc080 trb-start 00000003a0cfefe0 trb-end 00000003a0cfefe0 seg-start 00000003a0cfe000 seg-end 00000003a0cfeff0
[159677.276387] xhci_hcd 0000:0a:00.0: port 1 resume PLC timeout
[159680.097507] r8152 4-1.2:1.0 enp9s0u1u2: Tx timeout
[159680.119840] net_ratelimit: 329 callbacks suppressed
[159680.119852] r8152 4-1.2:1.0 enp9s0u1u2: Tx status -2
[159680.132667] r8152 4-1.2:1.0 enp9s0u1u2: Tx status -2
[159680.142497] r8152 4-1.2:1.0 enp9s0u1u2: Tx status -2
[159680.152489] r8152 4-1.2:1.0 enp9s0u1u2: Tx status -2
[159682.780110] usb 4-1.2: reset SuperSpeed USB device number 3 using xhci_hcd
[159700.007384] xhci_hcd 0000:09:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13

Comment 21 Mario Limonciello 2017-08-30 13:41:16 UTC
As I understand the particular problem linked with the issue in BIOS 1.3.6/1.37 adjusts a voltage regulator (to fix something else; this was an unanticipated/undiscovered regression).  I would recommend for now to downgrade to 1.3.5 until a fixed BIOS is issued.

Comment 22 Martin Bříza 2017-09-05 10:13:36 UTC
It got really annoying lately. How do I downgrade to 1.3.5, please? I can't find it on the Dell website and fwupd doesn't provide anything too.

Comment 23 Martin Bříza 2017-09-11 13:46:40 UTC
Running kernel-4.13.0-1.fc27.x86_64.

BIOS 2.2.1 finally hit the Dell website. I can confirm that with this, the USB overall experience is now much much better (except the occasional mouse stutter but that may as well be on the OS side). There seems to be no problem at all with the dock Ethernet adapter.

Comment 24 Martin Bříza 2017-10-10 11:30:15 UTC
On 4.13.4-300.fc27.x86_64, I still experience the SSL errors when downloading larger amounts of data, like git repositories and such. It gets fixed after disabling RC checksum offloading with the ethtool command you have provided before.

Comment 25 Gerben Meijer 2017-11-14 22:42:02 UTC
This is happening even on my 9560 with 4.13.9 vanilla; when running a background rsync backup job, packages downloaded in a Debian docker build frequently do not match their checksum and need multiple runs to succeed.

Comment 26 Gerben Meijer 2017-11-14 23:00:22 UTC
And just to illustrate my point, on 4.14.0 vanilla:

while true; do 
dd if=/nfsmount/debian-live-9.1.0-amd64-xfce+nonfree.iso bs=16M iflag=direct 2>/dev/null | sha1sum; done

With rx offload on (default):

489ed92b17aa9a4582899356d3123621b5d92189  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
f11ba5f624dbab5a52319801c28a7032cc9b5100  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
e925ff013c99a1b732a99aeaf5d3f1f02c8dfa40  -

With rx offload off:

742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -

Comment 27 Kai-Heng Feng 2018-01-16 08:50:49 UTC
Please try this patch:
https://lkml.org/lkml/2018/1/16/102

Comment 28 Gerben Meijer 2018-01-19 18:52:06 UTC
Applied to 4.14.14. Offload:

tcp-segmentation-offload: on
udp-fragmentation-offload: off
generic-segmentation-offload: on
generic-receive-offload: on
rx-vlan-offload: on
tx-vlan-offload: on

dd | sha1sum loop:

742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -
742462292c76189f63fc3e7af1acc9dec56c0a8d  -

Ran for 10 minutes, so looks like that patch works (doing around 90mbit/s of traffic).

Comment 29 Gerben Meijer 2018-02-21 14:56:01 UTC
On 4.15.4 I see a lot of:

Feb 21 15:43:31 localhost kernel: [18401.483078] pcieport 0000:00:1d.6: AER: Corrected error received: id=00ee
Feb 21 15:43:31 localhost kernel: [18401.483095] pcieport 0000:00:1d.6: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00ee(Transmitter ID)
Feb 21 15:43:31 localhost kernel: [18401.483097] pcieport 0000:00:1d.6:   device [8086:a11e] error status/mask=00001000/00002000
Feb 21 15:43:31 localhost kernel: [18401.483099] pcieport 0000:00:1d.6:    [12] Replay Timer Timeout  


Which may or may not be related. However, randomly, r8152 stops working entirely. Most recent dmesg:


Feb 21 15:43:42 localhost kernel: [18412.136941] ------------[ cut here ]------------
Feb 21 15:43:42 localhost kernel: [18412.136947] NETDEV WATCHDOG: enxa44cc8d0edff (r8152): transmit queue 0 timed out
Feb 21 15:43:42 localhost kernel: [18412.136969] WARNING: CPU: 1 PID: 0 at net/sched/sch_generic.c:323 dev_watchdog+0x215/0x220
Feb 21 15:43:42 localhost kernel: [18412.136972] Modules linked in: sg uas usb_storage rfcomm nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter ctr ccm xt_C
HECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ip6t_REJECT nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter devlink cmac bnep binfmt_misc snd_usb_audio cdc_ether usbnet snd_usbmidi_lib r8152 snd_rawmidi snd_seq_device mii btusb btrtl uvcvideo btbcm btintel videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 bluetooth videodev videobuf2_core ecdh_generic joydev mousedev hid_multitouch snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic arc4 i2c_designware_platform
Feb 21 15:43:42 localhost kernel: [18412.137037]  i2c_designware_core iwlmvm input_leds i2c_hid mac80211 dell_smm_hwmon x86_pkg_temp_thermal crc32_pclmul iwlwifi crc32c_intel i915 snd_hda_intel ghash_clmulni_intel pcbc snd_hda_codec aesni_intel snd_hwdep aes_x86_64 snd_hda_core crypto_simd sha256_mb snd_pcm_oss glue_helper mcryptd snd_mixer_oss cryptd sha256_ssse3 snd_pcm snd_timer sha256_generic dell_smbios_wmi snd soundcore cfg80211 pcspkr int3400_thermal rtsx_pci acpi_thermal_rel intel_hid xhci_pci int3403_thermal processor_thermal_device mei_me xhci_hcd int340x_thermal_zone shpchp intel_lpss_pci mei intel_soc_dts_iosf intel_pch_thermal intel_lpss loop vhost_net tun vhost tap coretemp i2c_i801 kvm_intel kvm irqbypass uinput evdev nfsd ip_tables x_tables
Feb 21 15:43:42 localhost kernel: [18412.137101] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G     U           4.15.4 #5
Feb 21 15:43:42 localhost kernel: [18412.137104] Hardware name: Dell Inc. XPS 15 9560/05FFDN, BIOS 1.7.0 12/15/2017
Feb 21 15:43:42 localhost kernel: [18412.137108] RIP: 0010:dev_watchdog+0x215/0x220
Feb 21 15:43:42 localhost kernel: [18412.137112] RSP: 0018:ffff88087e443ea0 EFLAGS: 00010286
Feb 21 15:43:42 localhost kernel: [18412.137116] RAX: 0000000000000044 RBX: 0000000000000000 RCX: 0000000000000103
Feb 21 15:43:42 localhost kernel: [18412.137119] RDX: 0000000080000103 RSI: ffffffff82063a3a RDI: 00000000ffffffff
Feb 21 15:43:42 localhost kernel: [18412.137122] RBP: ffff8808581a645c R08: ffffffff815daba0 R09: 0000000000001cb5
Feb 21 15:43:42 localhost kernel: [18412.137125] R10: 0000000000000282 R11: ffffffff827c2f8d R12: ffff8808581a6478
Feb 21 15:43:42 localhost kernel: [18412.137127] R13: ffff8808581a6000 R14: 0000000000000001 R15: ffff8808562df880
Feb 21 15:43:42 localhost kernel: [18412.137131] FS:  0000000000000000(0000) GS:ffff88087e440000(0000) knlGS:0000000000000000
Feb 21 15:43:42 localhost kernel: [18412.137134] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 21 15:43:42 localhost kernel: [18412.137137] CR2: 0000005a52651000 CR3: 000000000220a006 CR4: 00000000003606e0
Feb 21 15:43:42 localhost kernel: [18412.137139] Call Trace:
Feb 21 15:43:42 localhost kernel: [18412.137143]  <IRQ>
Feb 21 15:43:42 localhost kernel: [18412.137148]  ? dev_graft_qdisc+0x70/0x70
Feb 21 15:43:42 localhost kernel: [18412.137152]  ? dev_graft_qdisc+0x70/0x70
Feb 21 15:43:42 localhost kernel: [18412.137158]  call_timer_fn+0x11/0x70
Feb 21 15:43:42 localhost kernel: [18412.137163]  expire_timers+0x8e/0xa0
Feb 21 15:43:42 localhost kernel: [18412.137169]  run_timer_softirq+0x9f/0x140
Feb 21 15:43:42 localhost kernel: [18412.137174]  ? timerqueue_add+0x52/0x80
Feb 21 15:43:42 localhost kernel: [18412.137180]  ? recalibrate_cpu_khz+0x10/0x10
Feb 21 15:43:42 localhost kernel: [18412.137185]  ? ktime_get+0x39/0x90
Feb 21 15:43:42 localhost kernel: [18412.137191]  __do_softirq+0xd6/0x1e0
Feb 21 15:43:42 localhost kernel: [18412.137197]  irq_exit+0x91/0xa0
Feb 21 15:43:42 localhost kernel: [18412.137203]  smp_apic_timer_interrupt+0x60/0x90
Feb 21 15:43:42 localhost kernel: [18412.137207]  apic_timer_interrupt+0x95/0xa0
Feb 21 15:43:42 localhost kernel: [18412.137210]  </IRQ>
Feb 21 15:43:42 localhost kernel: [18412.137215] RIP: 0010:cpuidle_enter_state+0x129/0x200
Feb 21 15:43:42 localhost kernel: [18412.137218] RSP: 0018:ffffc900031dfed8 EFLAGS: 00000202 ORIG_RAX: ffffffffffffff11
Feb 21 15:43:42 localhost kernel: [18412.137222] RAX: ffff88087e440000 RBX: 0000000000000006 RCX: 000000000000001f
Feb 21 15:43:42 localhost kernel: [18412.137225] RDX: 20c49ba5e353f7cf RSI: ffffffff82063a3a RDI: ffffffff820657d3
Feb 21 15:43:42 localhost kernel: [18412.137228] RBP: ffff88087e469500 R08: 00000000000016e5 R09: 000000000000050e
Feb 21 15:43:42 localhost kernel: [18412.137231] R10: ffffc900031dfeb8 R11: 0000000000000c7f R12: 000010bee8e19d31
Feb 21 15:43:42 localhost kernel: [18412.137233] R13: 000010bee8be5300 R14: 0000000000000006 R15: 0000000000000000
Feb 21 15:43:42 localhost kernel: [18412.137239]  ? cpuidle_enter_state+0x11e/0x200
Feb 21 15:43:42 localhost kernel: [18412.137246]  do_idle+0x176/0x1b0
Feb 21 15:43:42 localhost kernel: [18412.137251]  cpu_startup_entry+0x14/0x20
Feb 21 15:43:42 localhost kernel: [18412.137256]  start_secondary+0x142/0x170
Feb 21 15:43:42 localhost kernel: [18412.137260]  secondary_startup_64+0xa5/0xb0
Feb 21 15:43:42 localhost kernel: [18412.137264] Code: 00 49 63 4c 24 e8 eb 8c 4c 89 ef c6 05 b3 06 ab 00 01 e8 3f e7 fd ff 89 d9 4c 89 ee 48 c7 c7 b8 f4 0f 82 48 89 c2 e8 3b 4c 86 ff <0f> ff eb be 0f 1f 80 00 00 00 00 48 c7 47 08 00 00 00 00 48 c7 
Feb 21 15:43:42 localhost kernel: [18412.137323] ---[ end trace a9a46c2c7b1cae38 ]---



Which is then followed by:

Feb 21 15:43:42 localhost kernel: [18412.137329] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx timeout
Feb 21 15:43:46 localhost kernel: [18417.000936] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx timeout
Feb 21 15:43:49 localhost kernel: [18419.814431] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx status -2
Feb 21 15:43:49 localhost kernel: [18419.824232] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx status -2
Feb 21 15:43:49 localhost kernel: [18419.834254] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx status -2
Feb 21 15:43:49 localhost kernel: [18419.844304] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx status -2
Feb 21 15:43:52 localhost kernel: [18422.120902] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx timeout
Feb 21 15:43:56 localhost kernel: [18426.984828] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx timeout
Feb 21 15:44:02 localhost kernel: [18432.104805] r8152 4-1.2:1.0 enxa44cc8d0edff: Tx timeout


Only a reboot fixes it. I am considering using the TB16 as a doorstop; it's more stable that way.

Comment 30 Jarod Wilson 2018-03-29 21:09:35 UTC
Looks like this is more of a firmware issue with these docks and/or a driver issue with the 8152, so I'm throwing this back onto the queue where it was.

Comment 31 marianne@tuxette.fr 2018-06-18 15:47:32 UTC
I think I have the same issue with my laptop and dock (Dell TB16). 
Laptop is new and installed in Fedora 28. All firmware are up-to-date. 

Ethernet works fine unless I want to transfert a large amount of data. Session (sftp, rsync or scp) cut abruptly after a few seconds. Nothing relevant appears in system logs.

If I offload the RC checksums (as suggested above) using :  ethtool --offload enp11s0u1u2 rx off  
Everything works fine. 

Tell me if you need more logs or informations

Comment 32 Mario Limonciello 2018-06-18 15:53:00 UTC
FYI this commit ended up landing related to this.  I would recommend to backport it.

https://github.com/torvalds/linux/commit/0b1655143df00ac5349f27b765b2ed13a3ac40ca

Comment 33 Jeremy Cline 2018-06-18 16:05:06 UTC
Hi Mario, thanks for the pointer. Fedora stable releases are currently on 4.16.15 so that fix should be in place. I've got a TB16 at home so I can also try to reproduce this on Fedora 28 this evening. 

marianne, adding the dmesg logs would be helpful. Thanks!

Comment 34 Ondřej Kolín 2018-08-07 13:52:56 UTC
Our bug report from Launchpad:

Hi.

Large amount of data gets corrupted when using the TB16 ethernet port. (rsync synchronization, etc... )

Linux E7490 4.15.0-29-generic #31-Ubuntu SMP Tue Jul 17 15:39:52 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

On my Fedora is this still an issue even with announced bugfix (link copied from this discussion #78.
Linux username-localdomain 4.17.9-200.fc28.x86_64 #1 SMP Mon Jul 23 21:41:29 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

It's fixed by turning the checksum offload off (tested on the Fedora .
sudo ethtool --offload enp11s0u1u2 rx off

https://bugs.launchpad.net/dell-sputnik/+bug/1729674

related in bugzilla:

Comment 35 Ondřej Kolín 2018-08-20 06:25:38 UTC
https://lkml.org/lkml/2018/8/20/42 There is a patch in upstream. Turn off the checksum offloading.

Comment 36 Tomáš Trnka 2018-09-12 14:36:19 UTC
The issue is not unique to the integrated NIC in the dock (so the current workaround in r8152 is not sufficient). I have a r8152-based TP-LINK UE300 USB3-to-GigE dongle connected to my TB16 dock and I'm getting the same packet corruption when I don't turn off rx checksum offloading.

usb 4-1.1.1: new SuperSpeed Gen 1 USB device number 5 using xhci_hcd
usb 4-1.1.1: New USB device found, idVendor=2357, idProduct=0601, bcdDevice=30.00
usb 4-1.1.1: New USB device strings: Mfr=1, Product=2, SerialNumber=6
usb 4-1.1.1: Product: USB 10/100/1000 LAN
usb 4-1.1.1: Manufacturer: TP-LINK
usb 4-1.1.1: SerialNumber: 000001000000

/:  Bus 04.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/2p, 5000M
    |__ Port 1: Dev 2, If 0, Class=Hub, Driver=hub/7p, 5000M
        |__ Port 1: Dev 3, If 0, Class=Hub, Driver=hub/4p, 5000M
            |__ Port 1: Dev 5, If 0, Class=Vendor Specific Class, Driver=r8152, 5000M
            |__ Port 4: Dev 6, If 0, Class=Hub, Driver=hub/2p, 5000M
        |__ Port 2: Dev 4, If 0, Class=Vendor Specific Class, Driver=r8152, 5000M

The dongle is plugged into the internal USB hub in my Dell U2715H screen, which is in turn plugged into the TB16 (latest firmware 1.0.0), connected to my XPS 15 9560 (latest BIOS 1.11.0, Linux 4.18.7-200.fc28.x86_64 at the moment).

I've also seen someone mentioning that (some) USB3 ports on the TB16 are in fact Alpine Ridge pass-through. That does not seem to be the case here, all three ports on my TB16 go through the ASMedia host controller:

0e:00.0 USB controller: ASMedia Technology Inc. ASM1042A USB 3.0 Host Controller

The r8152 workaround triggers just fine for the integrated NIC in the dock:

usb 4-1.2: reset SuperSpeed Gen 1 USB device number 4 using xhci_hcd
usb 4-1.2: Dell TB16 Dock, disable RX aggregation

Comment 37 Mario Limonciello 2018-09-12 14:38:51 UTC
@Tomas,

It sounds like the topology needs to be looked at then for applying this quirk.

Can you connect the dongle to the USB-C port with C-A adapter?  That is the AR pass through port.

Comment 38 Tomáš Trnka 2018-09-12 15:32:46 UTC
Indeed, I found the mention of the pass-through only applying to the USB-C like a minute after I wrote my previous comment. Sorry for the noise.

I don't have a C-A adapter at hand, but I've tried using the Dell DA200 adapter instead (not exactly the same thing as it's an extra hub, but hopefully it helps anyway). So the topology is:

Dongle -> DA200 (hub) -> USB-C port on the TB16 -> AR host controller

/:  Bus 06.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/2p, 10000M
    |__ Port 2: Dev 2, If 0, Class=Hub, Driver=hub/4p, 5000M
        |__ Port 1: Dev 5, If 0, Class=Vendor Specific Class, Driver=r8152, 5000M
        |__ Port 4: Dev 3, If 0, Class=Vendor Specific Class, Driver=r8152, 5000M

0f:00.0 USB controller: Intel Corporation DSL6540 USB 3.1 Controller [Alpine Ridge]

This setup works fine without any corruption with all offloads on (default).

Comment 39 Kai-Heng Feng 2018-09-13 05:04:18 UTC
IIRC, I tested this scenario, and I didn't observe the issue on external r8152 dongle over the ASMedia xHC host.

The v1 patch I sent was using topology to check, but maintainers didn't like it.

I'll see if I can come up a "better" version of it so maintainers will accept it.

Comment 40 Tore H. Larsen 2019-02-28 14:23:36 UTC
cc

Comment 41 Tore H. Larsen 2019-03-02 13:21:55 UTC
Ref. bug # 1600126

I updated r8152 to v2.11 per https://aur.archlinux.org/packages/r8152-dkms/  makes things more stable. 

# cd /usr/src/r8152-2.11.0
# patch -p1 <./linux-4.20.0-add-guard-fix.patch

# more /usr/src/r8152-2.11.0/dkms.conf 
PACKAGE_NAME="r8152"
PACKAGE_VERSION="2.11.0"
BUILT_MODULE_NAME[0]="r8152"
DEST_MODULE_LOCATION[0]="/kernel/drivers/net/usb"
AUTOINSTALL="yes"

# ll /var/lib/dkms/r8152/2.11.0/source
lrwxrwxrwx. 1 root root 21 Mar  1 15:22 /var/lib/dkms/r8152/2.11.0/source -> /usr/src/r8152-2.11.0

# dracut -f 

At least my kbd is still working after 30 minutes. A record on kernels above 4.18.18-300.fc29.

Comment 42 Timur Kristóf 2020-03-02 12:11:42 UTC
The same issue still happens to me on kernel 5.5.6-201.fc31.x86_64
Hardware is a Dell XPS 13 9370 with a Lenovo Thunderbolt 3 dock. My dmesg is full of these messages:


[12696.189484] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12702.333456] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12707.965422] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12713.085385] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12718.205360] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12724.349321] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12729.981295] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12735.101256] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12740.221235] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12746.365199] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12751.997171] r8152 6-1:1.0 enp10s0u1: Tx timeout
[12757.117155] r8152 6-1:1.0 enp10s0u1: Tx timeout

Comment 43 Didier 2020-06-04 14:39:35 UTC
This seems to help for me (Dell XPS13 2-in-1 7390 , kernel 5.6.15-300.fc32.x86_64) when switching (exact chain of events undetermined) between Dell DA300 (r8152 : Tx status -71) and Dell WD19TB ThunderBolt docking adapters : 

https://askubuntu.com/questions/1081128/usb-3-0-ethernet-adapter-not-working-ubuntu-18-04

# echo 0bda:8153:k > /sys/module/usbcore/parameters/quirks

Comment 44 Jeremy Akers 2020-06-19 17:07:23 UTC
Seeing a similar issue on a Dell XPS 9300 (2020) with Linux 5.4:

[  110.467608] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.467613] xhci_hcd 0000:08:00.0: Looking for event-dma 000000086900cfd0 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.478406] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.478412] xhci_hcd 0000:08:00.0: Looking for event-dma 000000086900cfe0 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.479937] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.479942] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06000 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.482654] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.482660] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06010 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.499173] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.499178] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06020 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.505613] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.505618] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06030 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.505676] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.505678] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06040 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.505764] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.505766] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06050 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.507398] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.507405] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06060 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.509353] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.509359] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06070 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.510017] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.510021] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06080 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.516799] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.516806] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06090 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.516930] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.516934] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c060a0 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.521145] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.521151] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c060b0 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.546425] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.546431] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c060c0 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.546773] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.546778] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c060d0 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.546945] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.546948] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c060e0 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.547252] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.547256] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c060f0 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.547358] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.547360] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06100 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0
[  110.550637] xhci_hcd 0000:08:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 4 comp_code 13
[  110.550643] xhci_hcd 0000:08:00.0: Looking for event-dma 0000000861c06110 trb-start 000000086900cfb0 trb-end 000000086900cfb0 seg-start 000000086900c000 seg-end 000000086900cff0

I've tried 5 different USB/Thunderbolt Ethernet adapters. Same issue on all of them.

Comment 45 Arcadiy Ivanov 2020-08-10 02:33:21 UTC
There is Dell TB19 firmware available that is installable via fwupdmgr on Linux: https://www.dell.com/support/home/en-bm/drivers/driversdetails?driverid=cwcf9&oscode=rhl80&productcode=dell-wd19tb-dock

Comment 46 Arcadiy Ivanov 2020-08-10 02:34:02 UTC
Install via: sudo fwupdmgr install ~/Downloads/WD19FirmwareUpdateLinux_01.00.14.cab

Comment 47 Alex Grönholm 2020-08-10 07:54:23 UTC
Thanks for the info (I own a WD19TB dock too) but that hardly helps with the TB16 problem. The WD19 series docks have working USB controllers, unlike TB16.

Comment 48 Arcadiy Ivanov 2020-08-11 02:17:30 UTC
(In reply to Alex Grönholm from comment #47)
> Thanks for the info (I own a WD19TB dock too) but that hardly helps with the
> TB16 problem. The WD19 series docks have working USB controllers, unlike
> TB16.

It was a long night and TB16 looked like WD19 to me :) 

That said, I experience exactly the same issues with Dell Precision 5540 + WD19 as in comment42 and comment44.


Note You need to log in before you can comment on or make changes to this bug.