RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1463368 - [GM206] DRM: EVO timeout
Summary: [GM206] DRM: EVO timeout
Keywords:
Status: CLOSED WORKSFORME
Alias: None
Product: Red Hat Enterprise Linux 7
Classification: Red Hat
Component: xorg-x11-drv-nouveau
Version: 7.4
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: rc
: ---
Assignee: Ben Skeggs
QA Contact: Desktop QE
Mark Flitter
URL:
Whiteboard:
: 1260740 1373523 1373527 (view as bug list)
Depends On:
Blocks: 1260740
TreeView+ depends on / blocked
 
Reported: 2017-06-20 16:16 UTC by Tomas Pelka
Modified: 2017-11-14 12:35 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Issues with Nouveau when using multiple displays on nVidia GM20x hardware Some display combinations connected to nVidia GM20x series devices are subject to many errors, observable in `dmesg` and including the message: `DRM: EVO timeout`. This problem is resolved upstream and a backport is under investigation.
Clone Of:
Environment:
Last Closed: 2017-08-02 16:52:46 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
dmesg (705.34 KB, text/plain)
2017-06-20 16:16 UTC, Tomas Pelka
no flags Details
dmesg 2 (68.33 KB, text/plain)
2017-08-02 15:15 UTC, Tomas Hudziec
no flags Details

Description Tomas Pelka 2017-06-20 16:16:43 UTC
Created attachment 1289740 [details]
dmesg

Description of problem:
KMS seems to be broken on GM206, see attached dmesg 

Version-Release number of selected component (if applicable):
kernel-3.10.0-680.el7.x86_64
xorg-x11-server-Xorg-1.19.3-7.el7.x86_64
xorg-x11-drv-nouveau-1.0.13-2.el7.x86_64

How reproducible:
100%

Steps to Reproduce:
1. boot machine with GM206
2.
3.

Actual results:
kernel stuck on lot of messages, see dmesg

Expected results:
machine should enter gdm and further

Additional info:
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GM206 [GeForce GTX 960] [10de:1401] (rev a1)

Comment 1 Tomas Pelka 2017-06-20 16:26:02 UTC
Seems that the machine at least boot when all outputs except DVI are disconnected.

Dmesh output looks the same, but at least it boots into gdm.

Comment 2 Ben Skeggs 2017-06-21 00:36:48 UTC
I *believe* (hard to say for sure with the trimmed log), that this is an issue that effects certain combinations of displays on GM20x and higher GPUs.  NVIDIA basically split something that was a single hardware concept into two different ones, and added a crossbar between them.

Nouveau in 7.4 doesn't deal with the routing and keeps the two identity-mapped, which can lead to overlapping use of the same hardware blocks with some display combinations.

I recently reworked the display part of Nouveau to handle this, which is a *very* invasive change, and only made it for kernel 4.12.  I suspect, at this late stage, we may have to document this issue as known and delay until 7.5?

Comment 3 Tomas Pelka 2017-06-21 07:15:50 UTC
Sounds fair Ben, thanks.

It might be related to bz1463497.

Comment 8 Tomas Pelka 2017-07-27 12:26:14 UTC
Tomas would you please check this particular card on the machine with updated bios/firmware.

Thanks
-Tom

Comment 9 Tomas Pelka 2017-07-27 12:26:49 UTC
*** Bug 1373527 has been marked as a duplicate of this bug. ***

Comment 10 Tomas Pelka 2017-07-27 12:26:51 UTC
*** Bug 1373523 has been marked as a duplicate of this bug. ***

Comment 11 Tomas Pelka 2017-07-27 12:27:13 UTC
*** Bug 1260740 has been marked as a duplicate of this bug. ***

Comment 12 Tomas Hudziec 2017-08-02 15:15:51 UTC
Created attachment 1308295 [details]
dmesg 2

Machine enters gdm and it is possible to login.
Tested with following outputs used:
- only DVI
- 1xDP
- 2xDP
Dmesg look all right.

kernel-3.10.0-693.el7.x86_64
xorg-x11-server-Xorg-1.19.3-11.el7.x86_64
xorg-x11-drv-nouveau-1.0.13-3.el7.x86_64

Comment 13 Tomas Pelka 2017-08-02 16:52:46 UTC
Seems this is consequence of bios update that Tomas did on our old test machine, closing.


Note You need to log in before you can comment on or make changes to this bug.