Bug 728204

Summary: nvidia driver Oopsed after resuming from suspend in nv04_dfp_update_fp_control
Product: [Fedora] Fedora Reporter: Tobias Mueller <fedora-bugs>
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED WONTFIX QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 14CC: bskeggs, gansalmon, itamar, jonathan, kernel-maint, madhu.chinakonda
Target Milestone: ---Keywords: Reopened
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-08-16 21:28:05 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Tobias Mueller 2011-08-04 11:27:51 UTC
Description of problem:
I suspended my machine and resumed it. The monitors went for a lovely striped white backgroup with an occasional red or blue stripe in between. I couldn't switch VTs. But I could SSH into the machine.

I tried to pm-suspend again but it then froze to death.

It's the standard F14 kernel:
Name        : kernel
Arch        : x86_64
Version     : 2.6.35.13
Release     : 92.fc14


[266030.572731] [drm] nouveau 0000:01:00.0: 0xC313: Parsing digital output script table
[266030.620074] firewire_sbp2: fw1.0: logged in to LUN 0000 (0 retries)
[266030.623054] [drm] nouveau 0000:01:00.0: Setting dpms mode 0 on tmds encoder (output 2)
[266030.623073] BUG: unable to handle kernel NULL pointer dereference at 0000000000000138
[266030.623078] IP: [<ffffffffa00c8f34>] nv04_dfp_update_fp_control+0x2c/0x15a [nouveau]
[266030.623099] PGD 216202067 PUD 2161a2067 PMD 0 
[266030.623103] Oops: 0000 [#1] SMP 
[266030.623106] last sysfs file: /sys/devices/pci0000:00/0000:00:1e.0/0000:05:03.0/fw1/fw1.0/uevent
[266030.623109] CPU 0 
[266030.623110] Modules linked in: tcp_lp fuse ebtable_nat ebtables ipt_MASQUERADE iptable_nat nf_nat bridge stp llc sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf nf_conntrack_tftp nf_conntrack_netbios_ns ip6t_REJECT ip6t_ipv6header nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 ext2 uinput snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc iTCO_wdt asus_atk0110 iTCO_vendor_support i2c_i801 joydev atl1e microcode xfs exportfs cryptd aes_x86_64 aes_generic xts gf128mul dm_crypt firewire_sbp2 pata_acpi ata_generic usb_storage firewire_ohci firewire_core pata_jmicron crc_itu_t nouveau ttm drm_kms_helper drm i2c_algo_bit video output i2c_core [last unloaded: scsi_wait_scan]
[266030.623154] 
[266030.623157] Pid: 1960, comm: Xorg Tainted: G        W   2.6.35.13-92.fc14.x86_64 #1 P5QL-E/P5QL-E
[266030.623160] RIP: 0010:[<ffffffffa00c8f34>]  [<ffffffffa00c8f34>] nv04_dfp_update_fp_control+0x2c/0x15a [nouveau]
[266030.623175] RSP: 0018:ffff8802161e79a8  EFLAGS: 00010246
[266030.623177] RAX: 0000000000000000 RBX: ffff88022200a000 RCX: 000000000000acc9
[266030.623180] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff88022200a000
[266030.623182] RBP: ffff8802161e79e8 R08: 0000000000000002 R09: 00000000fffffffe
[266030.623184] R10: ffff8802961e7917 R11: 0000000000000000 R12: ffff880222fa8800
[266030.623186] R13: ffff88022200a000 R14: ffff880222080000 R15: ffff880223033800
[266030.623189] FS:  00007fd057fe0860(0000) GS:ffff880002000000(0000) knlGS:0000000000000000
[266030.623192] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[266030.623194] CR2: 0000000000000138 CR3: 0000000216281000 CR4: 00000000000006f0
[266030.623196] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[266030.623198] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[266030.623201] Process Xorg (pid: 1960, threadinfo ffff8802161e6000, task ffff880225afdd00)
[266030.623203] Stack:
[266030.623204]  ffff8802161e79b8 ffff880222090e88 ffff8802161e7a08 ffff88022200a000
[266030.623208] <0> 0000000000000000 0000000000000540 ffff880222080000 ffff880223033800
[266030.623211] <0> ffff8802161e7a08 ffffffffa00c9248 ffff88022200a000 ffff880222fa8800
[266030.623215] Call Trace:
[266030.623231]  [<ffffffffa00c9248>] nv04_tmds_dpms+0x5f/0x64 [nouveau]
[266030.623246]  [<ffffffffa00c8925>] nv04_dfp_commit+0xf6/0x16b [nouveau]
[266030.623253]  [<ffffffffa006797d>] drm_crtc_helper_set_mode+0x2cc/0x356 [drm_kms_helper]
[266030.623259]  [<ffffffffa00683d8>] drm_crtc_helper_set_config+0x5a6/0x74e [drm_kms_helper]
[266030.623266]  [<ffffffff8103c1bd>] ? should_resched+0xe/0x2e
[266030.623270]  [<ffffffff81469601>] ? io_schedule+0x1c/0x63
[266030.623284]  [<ffffffffa0037528>] drm_mode_setcrtc+0x2cb/0x301 [drm]
[266030.623299]  [<ffffffffa00cbce3>] ? nv_crtc_gamma_set+0x62/0x64 [nouveau]
[266030.623308]  [<ffffffffa002c2b8>] drm_ioctl+0x28b/0x389 [drm]
[266030.623318]  [<ffffffffa003725d>] ? drm_mode_setcrtc+0x0/0x301 [drm]
[266030.623323]  [<ffffffff810e9a81>] ? pmd_offset+0x19/0x40
[266030.623327]  [<ffffffff81124877>] vfs_ioctl+0x36/0xa7
[266030.623330]  [<ffffffff811251d8>] do_vfs_ioctl+0x468/0x49b
[266030.623332]  [<ffffffff81125261>] sys_ioctl+0x56/0x79
[266030.623336]  [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b
[266030.623338] Code: 48 89 e5 41 57 41 56 41 55 41 54 53 48 83 ec 18 0f 1f 44 00 00 4c 8b 27 85 f6 49 89 fd 4d 8b b4 24 98 03 00 00 75 73 48 8b 47 30 <8b> 98 38 01 00 00 48 63 d3 48 69 d2 40 05 00 00 49 8d 8c 16 a0 
[266030.623366] RIP  [<ffffffffa00c8f34>] nv04_dfp_update_fp_control+0x2c/0x15a [nouveau]
[266030.623381]  RSP <ffff8802161e79a8>
[266030.623382] CR2: 0000000000000138
[266030.623385] ---[ end trace 9e919dc4ec9bba4f ]---

Comment 1 Dave Jones 2011-08-04 17:46:39 UTC
this is in nvidia's code, we can't fix that.

It might be getting confused because you also have nouveau loaded, but no idea really.

Comment 2 Dave Jones 2011-08-04 17:47:54 UTC
wait, duh, this IS nouveau. My bad.

Comment 3 Ben Skeggs 2011-08-04 23:16:22 UTC
Any chance you could grab the latest F15 kernel from koji and see if this still happens there?

Comment 4 Tobias Mueller 2011-08-04 23:30:24 UTC
Well. I don't think I can reproduce it at all. But I surely can install a new kernel package if you give me instructions on how to do so properly.

Comment 5 Fedora End Of Life 2012-08-16 21:28:07 UTC
This message is a notice that Fedora 14 is now at end of life. Fedora 
has stopped maintaining and issuing updates for Fedora 14. It is 
Fedora's policy to close all bug reports from releases that are no 
longer maintained.  At this time, all open bugs with a Fedora 'version'
of '14' have been closed as WONTFIX.

(Please note: Our normal process is to give advanced warning of this 
occurring, but we forgot to do that. A thousand apologies.)

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, feel free to reopen 
this bug and simply change the 'version' to a later Fedora version.

Bug Reporter: Thank you for reporting this issue and we are sorry that 
we were unable to fix it before Fedora 14 reached end of life. If you 
would still like to see this bug fixed and are able to reproduce it 
against a later version of Fedora, you are encouraged to click on 
"Clone This Bug" (top right of this page) and open it against that 
version of Fedora.

Although we aim to fix as many bugs as possible during every release's 
lifetime, sometimes those efforts are overtaken by events.  Often a 
more recent Fedora release includes newer upstream software that fixes 
bugs or makes them obsolete.

The process we are following is described here: 
http://fedoraproject.org/wiki/BugZappers/HouseKeeping