Bug 1474686

Summary: amdgpu fails to load firmware on RX470
Product: [Fedora] Fedora Reporter: Bernhard Schuster <redhat.bugzilla>
Component: xorg-x11-drv-amdgpuAssignee: Christopher Atherton <athertoncj>
Status: CLOSED CANTFIX QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 26CC: athertoncj, germano.massullo, ml052804
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-08-03 06:08:56 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Bernhard Schuster 2017-07-25 07:20:37 UTC
Description of problem:
amdgpu fails to

Version-Release number of selected component (if applicable):
#dnf info xorg-x11-drv-amdgpu
Name         : xorg-x11-drv-amdgpu
Version      : 1.3.0
Release      : 1.fc26

#uname -a
Linux compute.ahoi.io 4.11.11-300.fc26.x86_64 #1 SMP Mon Jul 17 16:32:11 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

How reproducible:
Always

Steps to Reproduce:
1. upgrade from f25 to f26
2. boot
3. gnome-shell "Something went wrong" screen greets me


Actual results:

amgpu fails to load firmware:


Jul 25 09:00:47 compute.ahoi.io kernel: [drm] BIOS signature incorrect 0 0
Jul 25 09:00:47 compute.ahoi.io kernel: amdgpu 0000:01:00.0: Invalid PCI ROM header signature: expecting 0xaa55, got 0xffff
Jul 25 09:00:47 compute.ahoi.io kernel: ATOM BIOS: E347
Jul 25 09:00:47 compute.ahoi.io kernel: [drm] GPU post is not needed
Jul 25 09:00:47 compute.ahoi.io kernel: amdgpu 0000:01:00.0: Direct firmware load for amdgpu/polaris10_mc.bin failed with error -2
Jul 25 09:00:47 compute.ahoi.io kernel: mc: Failed to load firmware "amdgpu/polaris10_mc.bin"
Jul 25 09:00:47 compute.ahoi.io kernel: [drm:gmc_v8_0_sw_init [amdgpu]] *ERROR* Failed to load mc firmware!
Jul 25 09:00:47 compute.ahoi.io kernel: [drm:amdgpu_device_init [amdgpu]] *ERROR* sw_init of IP block <gmc_v8_0> failed -2
Jul 25 09:00:47 compute.ahoi.io kernel: amdgpu 0000:01:00.0: amdgpu_init failed
Jul 25 09:00:47 compute.ahoi.io kernel: amdgpu 0000:01:00.0: Fatal error during GPU init
Jul 25 09:00:47 compute.ahoi.io kernel: [drm] amdgpu: finishing device.
Jul 25 09:00:47 compute.ahoi.io kernel: [TTM] Memory type 2 has not been initialized




# lsmod            
Module                  Size  Used by
xt_CHECKSUM            16384  1
ipt_MASQUERADE         16384  3
nf_nat_masquerade_ipv4    16384  1 ipt_MASQUERADE
tun                    28672  1
nf_conntrack_netbios_ns    16384  1
nf_conntrack_broadcast    16384  1 nf_conntrack_netbios_ns
xt_CT                  16384  1
ip6t_rpfilter          16384  1
ip6t_REJECT            16384  2
nf_reject_ipv6         16384  1 ip6t_REJECT
xt_conntrack           16384  20
ip_set                 36864  0
nfnetlink              16384  1 ip_set
ebtable_nat            16384  1
ebtable_broute         16384  1
bridge                143360  1 ebtable_broute
stp                    16384  1 bridge
llc                    16384  2 bridge,stp
ip6table_nat           16384  1
nf_conntrack_ipv6      20480  11
nf_defrag_ipv6         36864  1 nf_conntrack_ipv6
nf_nat_ipv6            16384  1 ip6table_nat
ip6table_mangle        16384  1
ip6table_raw           16384  1
ip6table_security      16384  1
iptable_nat            16384  1
nf_conntrack_ipv4      16384  15
nf_defrag_ipv4         16384  1 nf_conntrack_ipv4
nf_nat_ipv4            16384  1 iptable_nat
nf_nat                 28672  3 nf_nat_ipv6,nf_nat_masquerade_ipv4,nf_nat_ipv4
nf_conntrack          131072  11 nf_conntrack_ipv6,nf_conntrack_ipv4,ipt_MASQUERADE,nf_conntrack_broadcast,nf_conntrack_netbios_ns,xt_CT,nf_nat_ipv6,nf_nat_masquerade_ipv4,xt_conntrack,nf_nat_ipv4,nf_nat
libcrc32c              16384  2 nf_conntrack,nf_nat
iptable_mangle         16384  1
iptable_raw            16384  1
iptable_security       16384  1
ebtable_filter         16384  1
ebtables               32768  3 ebtable_filter,ebtable_nat,ebtable_broute
ip6table_filter        16384  1
ip6_tables             28672  5 ip6table_mangle,ip6table_filter,ip6table_security,ip6table_raw,ip6table_nat
sunrpc                331776  1
vfat                   20480  1
fat                    69632  1 vfat
intel_rapl             20480  0
eeepc_wmi              16384  0
asus_wmi               28672  1 eeepc_wmi
x86_pkg_temp_thermal    16384  0
iTCO_wdt               16384  0
sparse_keymap          16384  1 asus_wmi
intel_powerclamp       16384  0
rfkill                 28672  2 asus_wmi
iTCO_vendor_support    16384  1 iTCO_wdt
coretemp               16384  0
mxm_wmi                16384  0
kvm_intel             200704  0
kvm                   598016  1 kvm_intel
snd_hda_codec_realtek    90112  1
snd_hda_codec_generic    73728  1 snd_hda_codec_realtek
snd_hda_codec_hdmi     49152  1
irqbypass              16384  1 kvm
intel_cstate           20480  0
snd_hda_intel          40960  2
intel_uncore          118784  0
snd_hda_codec         126976  4 snd_hda_intel,snd_hda_codec_hdmi,snd_hda_codec_generic,snd_hda_codec_realtek
intel_rapl_perf        16384  0
snd_hda_core           81920  5 snd_hda_intel,snd_hda_codec,snd_hda_codec_hdmi,snd_hda_codec_generic,snd_hda_codec_realtek
snd_hwdep              16384  1 snd_hda_codec
snd_seq                65536  0
snd_seq_device         16384  1 snd_seq
snd_pcm               102400  4 snd_hda_intel,snd_hda_codec,snd_hda_core,snd_hda_codec_hdmi
joydev                 20480  0
snd_timer              32768  2 snd_seq,snd_pcm
snd                    77824  14 snd_hda_intel,snd_hwdep,snd_seq,snd_hda_codec,snd_timer,snd_hda_codec_hdmi,snd_hda_codec_generic,snd_seq_device,snd_hda_codec_realtek,snd_pcm
mei_me                 40960  0
i2c_i801               24576  0
soundcore              16384  1 snd
mei                   102400  1 mei_me
lpc_ich                24576  0
video                  40960  1 asus_wmi
tpm_infineon           20480  0
tpm_tis                16384  0
tpm_tis_core           20480  1 tpm_tis
tpm                    49152  3 tpm_infineon,tpm_tis,tpm_tis_core
shpchp                 36864  0
soc_button_array       16384  0
wmi                    20480  2 asus_wmi,mxm_wmi
btrfs                1081344  2
xor                    24576  1 btrfs
raid6_pq              114688  1 btrfs
dm_crypt               28672  3
amdgpu               1159168  0
crct10dif_pclmul       16384  0
crc32_pclmul           16384  0
i2c_algo_bit           16384  1 amdgpu
crc32c_intel           24576  2
drm_kms_helper        151552  1 amdgpu
ghash_clmulni_intel    16384  0
e1000e                245760  0
ttm                    94208  1 amdgpu
drm                   348160  3 amdgpu,ttm,drm_kms_helper
uas                    24576  0
ptp                    20480  1 e1000e
usb_storage            69632  1 uas
pps_core               16384  1 ptp
hid_cherry             16384  0



Expected results:
Normal bootup

Additional info:

radeon is blacklisted to prevent any interference, there is no Xorg.conf.d/* which configures anything beyond keyboard.
Xorg.0.log does not even recognize amdgpu as a possible output.

Comment 1 Bernhard Schuster 2017-07-25 07:45:56 UTC
lspci line:

01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/580] (rev cf)

Comment 2 Bernhard Schuster 2017-08-03 06:08:33 UTC
A fresh install fixed this, could have been the fact that amdgpu-pro libs were installed which are not compatible with newer kernels anymore.