Bug 720476

Summary: ehci_hcd 0000:00:1d.0: fatal error; HC died; cleaning up
Product: [Fedora] Fedora Reporter: Sergio Basto <sergio>
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED CURRENTRELEASE QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 15CC: aquini, gansalmon, itamar, jonathan, kernel-maint, madhu.chinakonda
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-07-14 16:15:15 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Sergio Basto 2011-07-11 18:24:01 UTC
Description of problem:

recently, After suspend and resume, sometimes when plugging any usb device like mouse, my USB stack seems to collapse. All USB devices including the mouse & keyboard stop working, the 
syslog says:
Jul 11 18:34:34 segulix kernel: [136231.404389] usb 2-1.7: new full speed USB device using ehci_hcd and address 13
Jul 11 18:42:15 segulix kernel: [136692.027844] usb 2-1.3: new high speed USB device using ehci_hcd and address 14
Jul 11 18:42:15 segulix kernel: [136692.106587] ehci_hcd 0000:00:1d.0: fatal error
Jul 11 18:42:15 segulix kernel: [136692.110476] ehci_hcd 0000:00:1d.0: HC died; cleaning up
Jul 11 18:42:15 segulix kernel: [136692.113383] ehci_hcd 0000:00:1d.0: force halt; handshake ffffc90005092024 00004000 00004000 -> -110
Jul 11 18:42:15 segulix kernel: [136692.113389] ehci_hcd 0000:00:1d.0: HC died; cleaning up


Version-Release number of selected component (if applicable): 

After Kernel-2.6.38.6-27, I think it starts at 2.6.38.7-30.fc15.x86_64,
I am running 2.6.38.8-32.fc15.x86_64

How reproducible:
After resume computer of a suspend , plug use mouse or 3G pen 


  
Actual results:
Jul 11 18:34:33 segulix kernel: [136225.417672] Suspending console(s) (use no_console_suspend to debug)
Jul 11 18:34:33 segulix kernel: [136225.418370] sd 0:0:0:0: [sda] Synchronizing SCSI cache
Jul 11 18:34:33 segulix kernel: [136225.436251] sdhci-pci 0000:03:00.1: PCI INT B disabled
Jul 11 18:34:33 segulix kernel: [136225.436335] pci 0000:00:1f.3: PCI INT C disabled
Jul 11 18:34:33 segulix kernel: [136225.450317] ehci_hcd 0000:00:1d.0: PCI INT A disabled
Jul 11 18:34:33 segulix kernel: [136225.452397] sd 0:0:0:0: [sda] Stopping disk
Jul 11 18:34:33 segulix kernel: [136225.497236] ehci_hcd 0000:00:1a.0: PCI INT A disabled
Jul 11 18:34:33 segulix kernel: [136225.542467] e1000e 0000:00:19.0: PCI INT A disabled
Jul 11 18:34:33 segulix kernel: [136225.542481] e1000e 0000:00:19.0: wake-up capability enabled by ACPI
Jul 11 18:34:33 segulix kernel: [136225.727030] dell-wmi: Received unknown WMI event (0x11)
Jul 11 18:34:33 segulix kernel: [136225.740322] HDA Intel 0000:00:1b.0: PCI INT A disabled
--
Jul 11 18:34:33 segulix kernel: [136226.829973] CPU3 is up
Jul 11 18:34:33 segulix kernel: [136226.833274] ACPI: Waking up from system sleep state S3
Jul 11 18:34:33 segulix kernel: [136226.990895] PM: early resume of devices complete after 2.108 msecs
Jul 11 18:34:33 segulix kernel: [136226.991203] e1000e 0000:00:19.0: wake-up capability disabled by ACPI
Jul 11 18:34:33 segulix kernel: [136226.991257] ehci_hcd 0000:00:1a.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Jul 11 18:34:33 segulix kernel: [136226.991277] HDA Intel 0000:00:1b.0: PCI INT A -> GSI 22 (level, low) -> IRQ 22
Jul 11 18:34:33 segulix kernel: [136226.991370] ehci_hcd 0000:00:1d.0: PCI INT A -> GSI 17 (level, low) -> IRQ 17
Jul 11 18:34:33 segulix kernel: [136226.991566] pci 0000:00:1f.3: PCI INT C -> GSI 18 (level, low) -> IRQ 18
Jul 11 18:34:33 segulix kernel: [136226.991597] iwlagn 0000:02:00.0: RF_KILL bit toggled to disable radio.
Jul 11 18:34:33 segulix kernel: [136226.991605] sdhci-pci 0000:03:00.1: PCI INT B -> GSI 19 (level, low) -> IRQ 19
Jul 11 18:34:33 segulix kernel: [136226.992017] sd 0:0:0:0: [sda] Starting disk
Jul 11 18:34:33 segulix kernel: [136227.043972] firewire_core: skipped bus generations, destroying all nodes
Jul 11 18:34:33 segulix kernel: [136227.222476] usb 1-1.4: reset high speed USB device using ehci_hcd and address 3
Jul 11 18:34:33 segulix kernel: [136227.296270] ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
Jul 11 18:34:33 segulix kernel: [136227.298111] ata2.00: configured for UDMA/100
Jul 11 18:34:33 segulix kernel: [136227.300207] ata5: SATA link down (SStatus 0 SControl 300)
Jul 11 18:34:33 segulix kernel: [136227.302248] ata6: SATA link down (SStatus 0 SControl 300)
--
Jul 11 18:34:34 segulix NetworkManager[916]: <info> (wlan0): now managed
Jul 11 18:34:34 segulix NetworkManager[916]: <info> (wlan0): device state change: unmanaged -> unavailable (reason 'managed') [10 20 2]
Jul 11 18:34:34 segulix NetworkManager[916]: <info> (wlan0): bringing up device.
Jul 11 18:34:34 segulix NetworkManager[916]: <info> (wlan0): deactivating device (reason: 2).
Jul 11 18:34:34 segulix kernel: [136231.404389] usb 2-1.7: new full speed USB device using ehci_hcd and address 13
Jul 11 18:34:34 segulix kernel: [136231.451702] hub 2-1:1.0: unable to enumerate USB device on port 7
Jul 11 18:34:35 segulix kernel: [136232.689101] dell-wmi: Received unknown WMI event (0x11)
Jul 11 18:42:15 segulix kernel: [136692.027844] usb 2-1.3: new high speed USB device using ehci_hcd and address 14
Jul 11 18:42:15 segulix kernel: [136692.106587] ehci_hcd 0000:00:1d.0: fatal error
Jul 11 18:42:15 segulix kernel: [136692.110476] ehci_hcd 0000:00:1d.0: HC died; cleaning up
Jul 11 18:42:15 segulix kernel: [136692.113383] ehci_hcd 0000:00:1d.0: force halt; handshake ffffc90005092024 00004000 00004000 -> -110
Jul 11 18:42:15 segulix kernel: [136692.113389] ehci_hcd 0000:00:1d.0: HC died; cleaning up
Jul 11 18:42:15 segulix kernel: [136692.117420] usb 2-1.3: device descriptor read/all, error -108
Jul 11 18:42:15 segulix kernel: [136692.117427] hub 2-1:1.0: cannot disable port 3 (err = -19)
Jul 11 18:42:15 segulix kernel: [136692.117441] hub 2-1:1.0: cannot reset port 3 (err = -19)
Jul 11 18:42:15 segulix kernel: [136692.117447] hub 2-1:1.0: cannot disable port 3 (err = -19)

Expected results:
ehci_hcd doesn't die 

Additional info:

Seems that happens just in two lastest updates of kernel in Fedora 15.

Comment 1 Sergio Basto 2011-07-11 18:25:15 UTC
A similar report here:
https://lkml.org/lkml/2011/4/24/97

Comment 2 Sergio Basto 2011-07-14 16:15:15 UTC
This is fixed on: kernel 2.6.38.8-35.fc15.x86_64 

I think for Changelog I think it was : 
* Wed Jul 06 2011 Chuck Ebbert <cebbert> 2.6.38.8-35 - Revert SCSI/block patches from 2.6.38.6 that caused more problems