Bug 569074
Summary: | nouveau NULL pointer after resume | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | [Fedora] Fedora | Reporter: | Mike Snitzer <msnitzer> | ||||||
Component: | kernel | Assignee: | Ben Skeggs <bskeggs> | ||||||
Status: | CLOSED ERRATA | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||||
Severity: | medium | Docs Contact: | |||||||
Priority: | low | ||||||||
Version: | 13 | CC: | airlied, anton, dougsland, gansalmon, itamar, jglisse, jonathan, kernel-maint | ||||||
Target Milestone: | --- | ||||||||
Target Release: | --- | ||||||||
Hardware: | All | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | kernel-2.6.34.6-47.fc13 | Doc Type: | Bug Fix | ||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | |||||||||
: | 601002 (view as bug list) | Environment: | |||||||
Last Closed: | 2010-08-30 18:23:42 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Mike Snitzer
2010-02-27 22:35:34 UTC
Created attachment 396803 [details]
dmesg from y550 that shows NULL pointer after resume
Created attachment 416601 [details]
kernel log showing same NULL pointer with 2.6.33.4-95.fc13.x86_64
Still happens with latest F13 kernel: 2.6.33.4-95.fc13.x86_64
Happens with 2.6.33.5-112.fc13.x86_64 too: BUG: unable to handle kernel NULL pointer dereference at 0000000000000028 IP: [<ffffffffa006f71e>] ttm_bo_pci_offset+0x5d/0x7d [ttm] ... Other bug reports for this same issue: https://bugs.freedesktop.org/show_bug.cgi?id=26521 https://bugs.freedesktop.org/show_bug.cgi?id=27574 https://bugzilla.kernel.org/show_bug.cgi?id=15120 Cc'ing Jerome given his recent ttm rework that is now upstream in 2.6.35. The following commits remove ttm_bo_pci_offset entirely: http://git.kernel.org/linus/0c321c7962718 http://git.kernel.org/linus/82c5da6bf8b55 I'll see if I can test these bits (via rawhide or applying by hand) relative to this bug. (In reply to comment #5) > Cc'ing Jerome given his recent ttm rework that is now upstream in 2.6.35. The > following commits remove ttm_bo_pci_offset entirely: > http://git.kernel.org/linus/0c321c7962718 > http://git.kernel.org/linus/82c5da6bf8b55 > > I'll see if I can test these bits (via rawhide or applying by hand) relative to > this bug. Good news is rawhide's 2.6.34-20.fc14.x86_64 (which I understand has 2.6.35-rc1's drm) no longer hits a NULL pointer in TTM. Bad news is the display still stays blank after resume. Here are the details from the log: kernel: PM: Syncing filesystems ... done. kernel: Freezing user space processes ... (elapsed 0.01 seconds) done. kernel: Freezing remaining freezable tasks ... (elapsed 0.01 seconds) done. kernel: Suspending console(s) (use no_console_suspend to debug) kernel: sd 0:0:0:0: [sda] Synchronizing SCSI cache kernel: sd 0:0:0:0: [sda] Stopping disk kernel: [drm] nouveau 0000:01:00.0: Disabling fbcon acceleration... kernel: [drm] nouveau 0000:01:00.0: Unpinning framebuffer(s)... kernel: [drm] nouveau 0000:01:00.0: Evicting buffers... kernel: pci 0000:00:1f.3: PCI INT C disabled kernel: ehci_hcd 0000:00:1d.7: PCI INT A disabled kernel: uhci_hcd 0000:00:1d.2: PCI INT C disabled kernel: uhci_hcd 0000:00:1d.1: PCI INT B disabled kernel: uhci_hcd 0000:00:1d.0: PCI INT A disabled kernel: ehci_hcd 0000:00:1a.7: PCI INT C disabled kernel: uhci_hcd 0000:00:1a.2: PCI INT C disabled kernel: uhci_hcd 0000:00:1a.1: PCI INT B disabled kernel: uhci_hcd 0000:00:1a.0: PCI INT A disabled kernel: [drm] nouveau 0000:01:00.0: Idling channels... kernel: [drm] nouveau 0000:01:00.0: Suspending GPU objects... kernel: HDA Intel 0000:00:1b.0: PCI INT A disabled kernel: HDA Intel 0000:00:1b.0: power state changed by ACPI to D3 kernel: HDA Intel 0000:01:00.1: PCI INT A disabled kernel: [drm] nouveau 0000:01:00.0: And we're gone! kernel: nouveau 0000:01:00.0: PCI INT A disabled kernel: nouveau 0000:01:00.0: power state changed by ACPI to D3 kernel: PM: suspend of devices complete after 854.583 msecs kernel: PM: late suspend of devices complete after 22.012 msecs ... kernel: nouveau 0000:01:00.0: power state changed by ACPI to D0 kernel: nouveau 0000:01:00.0: power state changed by ACPI to D0 kernel: nouveau 0000:01:00.0: power state changed by ACPI to D0 kernel: nouveau 0000:01:00.0: power state changed by ACPI to D0 kernel: nouveau 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16 kernel: [drm] nouveau 0000:01:00.0: POSTing device... kernel: [drm] nouveau 0000:01:00.0: Parsing VBIOS init table 0 at offset 0xD359 kernel: sd 0:0:0:0: [sda] Starting disk kernel: [drm] nouveau 0000:01:00.0: 0xD62F: Failed parsing init table opcode: INIT_ZM_I2C_BYTE -6 kernel: [drm] nouveau 0000:01:00.0: Parsing VBIOS init table 1 at offset 0xD8A4 kernel: [drm] nouveau 0000:01:00.0: Parsing VBIOS init table 2 at offset 0xE3D3 kernel: [drm] nouveau 0000:01:00.0: Parsing VBIOS init table 3 at offset 0xE408 kernel: [drm] nouveau 0000:01:00.0: Parsing VBIOS init table 4 at offset 0xE59C kernel: [drm] nouveau 0000:01:00.0: Parsing VBIOS init table at offset 0xE601 kernel: [drm] nouveau 0000:01:00.0: 0xBC89: parsing output script 0 kernel: [drm] nouveau 0000:01:00.0: 0xC078: parsing output script 0 kernel: [drm] nouveau 0000:01:00.0: Reinitialising engines... kernel: [drm] nouveau 0000:01:00.0: Restoring GPU objects... kernel: usb 2-3: reset high speed USB device using ehci_hcd and address 2 kernel: ata5: SATA link down (SStatus 0 SControl 300) kernel: ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300) kernel: ata2.00: configured for UDMA/133 kernel: [drm] nouveau 0000:01:00.0: Restoring mode... kernel: [drm] nouveau 0000:01:00.0: 0xBC8D: parsing output script 1 kernel: [drm] nouveau 0000:01:00.0: 0xBB00: parsing clock script 0 kernel: [drm] nouveau 0000:01:00.0: 0xBC84: parsing clock script 1 kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300) kernel: ata1.00: configured for UDMA/133 kernel: PM: resume of devices complete after 3247.071 msecs kernel: Restarting tasks ... done. kernel: video LNXVIDEO:00: Restoring backlight state (In reply to comment #6) > (In reply to comment #5) > > Bad news is the display still stays blank after resume. Here are the details > from the log: > > kernel: [drm] nouveau 0000:01:00.0: 0xD62F: Failed parsing init table opcode: > INIT_ZM_I2C_BYTE -6 This is likely the reason why. I'd say there's more setup for your card in that init table that we're skipping because INIT_ZM_I2C_BYTE fails. Can you file a new bug report against rawhide to track this issue please. It'd be great if you could include your dmesg output after a suspend/resume with "drm.debug=14 log_buf_len=1M" appended to your boot options, as well as a vbios image (I may want vbtracetool traces later too, but we'll discuss that when you open a new bug). Thanks! (In reply to comment #6) > (In reply to comment #5) > > Cc'ing Jerome given his recent ttm rework that is now upstream in 2.6.35. The > > following commits remove ttm_bo_pci_offset entirely: > > http://git.kernel.org/linus/0c321c7962718 > > http://git.kernel.org/linus/82c5da6bf8b55 > > > > I'll see if I can test these bits (via rawhide or applying by hand) relative to > > this bug. > > Good news is rawhide's 2.6.34-20.fc14.x86_64 (which I understand has > 2.6.35-rc1's drm) no longer hits a NULL pointer in TTM. Unfortunately, it seems a comparable NULL pointer still exists w/ 2.6.34-20.fc14.x86_64, see: https://bugzilla.redhat.com/show_bug.cgi?id=601002#c4 But it appears harder to hit (rather than always hitting it on resume with f13's kernel(s): 1 out of 3 resumes hit it with rawhide's 2.6.34-20.fc14.x86_64). Mike, can you give the kernel at http://koji.fedoraproject.org/koji/buildinfo?buildID=183346 a try and see if there's any change? kernel-2.6.34.2-34.fc13 has been submitted as an update for Fedora 13. http://admin.fedoraproject.org/updates/kernel-2.6.34.2-34.fc13 kernel-2.6.34.2-34.fc13 has been pushed to the Fedora 13 testing repository. If problems still persist, please make note of it in this bug report. If you want to test the update, you can install it with su -c 'yum --enablerepo=updates-testing update kernel'. You can provide feedback for this update here: http://admin.fedoraproject.org/updates/kernel-2.6.34.2-34.fc13 kernel-2.6.34.3-37.fc13 has been submitted as an update for Fedora 13. http://admin.fedoraproject.org/updates/kernel-2.6.34.3-37.fc13 kernel-2.6.34.3-37.fc13 has been pushed to the Fedora 13 testing repository. If problems still persist, please make note of it in this bug report. If you want to test the update, you can install it with su -c 'yum --enablerepo=updates-testing update kernel'. You can provide feedback for this update here: http://admin.fedoraproject.org/updates/kernel-2.6.34.3-37.fc13 2.6.34 kernel has been withdrawn. kernel-2.6.34.6-47.fc13 has been submitted as an update for Fedora 13. https://admin.fedoraproject.org/updates/kernel-2.6.34.6-47.fc13 kernel-2.6.34.6-47.fc13 has been pushed to the Fedora 13 stable repository. If problems still persist, please make note of it in this bug report. |