Advertisement
Guest User

rx6900xt_navi2_amdgpu_5.10.20_ppc64le_4kpages

a guest
Mar 9th, 2021
329
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 39.51 KB | None | 0 0
  1. [ 69.457441] amdgpu: Topology: Add CPU node
  2. [ 69.458707] amdgpu 0001:03:00.0: enabling device (0140 -> 0142)
  3. [ 69.458717] [drm] initializing kernel modesetting (SIENNA_CICHLID 0x1002:0x73BF 0x1DA2:0xE438 0xC0).
  4. [ 69.458720] amdgpu 0001:03:00.0: amdgpu: Trusted Memory Zone (TMZ) feature not supported
  5. [ 69.458732] [drm] register mmio base: 0x80000000
  6. [ 69.458733] [drm] register mmio size: 1048576
  7. [ 69.458735] [drm] PCI I/O BAR is not found.
  8. [ 69.458744] [drm] PCIE atomic ops is not supported
  9. [ 69.461020] [drm] add ip block number 0 <nv_common>
  10. [ 69.461022] [drm] add ip block number 1 <gmc_v10_0>
  11. [ 69.461023] [drm] add ip block number 2 <navi10_ih>
  12. [ 69.461025] [drm] add ip block number 3 <psp>
  13. [ 69.461026] [drm] add ip block number 4 <smu>
  14. [ 69.461028] [drm] add ip block number 5 <gfx_v10_0>
  15. [ 69.461029] [drm] add ip block number 6 <sdma_v5_2>
  16. [ 69.461031] [drm] add ip block number 7 <vcn_v3_0>
  17. [ 69.461032] [drm] add ip block number 8 <jpeg_v3_0>
  18. [ 69.492308] amdgpu 0001:03:00.0: amdgpu: Fetched VBIOS from ROM BAR
  19. [ 69.492311] amdgpu: ATOM BIOS: 113-E438XTX-UO2
  20. [ 69.492324] [drm] VCN(0) decode is enabled in VM mode
  21. [ 69.492325] [drm] VCN(1) decode is enabled in VM mode
  22. [ 69.492327] [drm] VCN(0) encode is enabled in VM mode
  23. [ 69.492328] [drm] VCN(1) encode is enabled in VM mode
  24. [ 69.492330] [drm] JPEG decode is enabled in VM mode
  25. [ 69.492336] [drm] GPU posting now...
  26. [ 69.492367] amdgpu 0001:03:00.0: amdgpu: HBM ECC is not presented.
  27. [ 69.492370] amdgpu 0001:03:00.0: amdgpu: SRAM ECC is not presented.
  28. [ 69.492374] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
  29. [ 69.492401] amdgpu 0001:03:00.0: BAR 2: releasing [mem 0x6004010000000-0x60040101fffff 64bit pref]
  30. [ 69.492404] amdgpu 0001:03:00.0: BAR 0: releasing [mem 0x6004000000000-0x600400fffffff 64bit pref]
  31. [ 69.492432] pci 0001:02:00.0: BAR 15: releasing [mem 0x6004000000000-0x600403fffffff 64bit pref]
  32. [ 69.492435] pci 0001:01:00.0: BAR 15: releasing [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  33. [ 69.492438] pci 0001:00:00.0: BAR 15: releasing [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  34. [ 69.492447] pci 0001:00:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  35. [ 69.492451] pci 0001:01:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  36. [ 69.492454] pci 0001:02:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  37. [ 69.492458] amdgpu 0001:03:00.0: BAR 0: assigned [mem 0x6004000000000-0x60043ffffffff 64bit pref]
  38. [ 69.492467] amdgpu 0001:03:00.0: BAR 2: assigned [mem 0x6004400000000-0x60044001fffff 64bit pref]
  39. [ 69.492477] pci 0001:00:00.0: PCI bridge to [bus 01-03]
  40. [ 69.492482] pci 0001:00:00.0: bridge window [mem 0x600c080000000-0x600c0ffefffff]
  41. [ 69.492485] pci 0001:00:00.0: bridge window [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  42. [ 69.492490] pci 0001:01:00.0: PCI bridge to [bus 02-03]
  43. [ 69.492495] pci 0001:01:00.0: bridge window [mem 0x600c080000000-0x600c0ffefffff]
  44. [ 69.492499] pci 0001:01:00.0: bridge window [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  45. [ 69.492504] pci 0001:02:00.0: PCI bridge to [bus 03]
  46. [ 69.492509] pci 0001:02:00.0: bridge window [mem 0x600c080000000-0x600c0807fffff]
  47. [ 69.492512] pci 0001:02:00.0: bridge window [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  48. [ 69.492523] amdgpu 0001:03:00.0: amdgpu: VRAM: 16368M 0x0000008000000000 - 0x00000083FEFFFFFF (16368M used)
  49. [ 69.492526] amdgpu 0001:03:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF
  50. [ 69.492529] [drm] Detected VRAM RAM=16368M, BAR=16384M
  51. [ 69.492531] [drm] RAM width 256bits GDDR6
  52. [ 69.492572] [drm] amdgpu: 16368M of VRAM memory ready
  53. [ 69.492577] [drm] amdgpu: 16368M of GTT memory ready.
  54. [ 69.492585] [drm] GART: num cpu pages 131072, num gpu pages 131072
  55. [ 69.499431] [drm] PCIE GART of 512M enabled (table at 0x0000008000000000).
  56. [ 69.500569] EEH: Recovering PHB#1-PE#0
  57. [ 69.500574] EEH: PE location: UOPWR.D100020-Node0-SLOT1 PCIE 4.0 X16, PHB location: N/A
  58. [ 69.500576] EEH: Frozen PHB#1-PE#0 detected
  59. [ 69.500578] EEH: Call Trace:
  60. [ 69.500583] EEH: [00000000d9e7d323] __eeh_send_failure_event+0x7c/0x160
  61. [ 69.500588] EEH: [00000000d61ba426] eeh_dev_check_failure.part.0+0x254/0x5e0
  62. [ 69.500693] EEH: [0000000061d1df81] amdgpu_device_rreg+0x180/0x210 [amdgpu]
  63. [ 69.500803] EEH: [00000000ed1fb3ed] gfxhub_v2_1_set_fault_enable_default+0x68/0x150 [amdgpu]
  64. [ 69.500913] EEH: [000000001cce1aab] gmc_v10_0_hw_init+0x198/0x290 [amdgpu]
  65. [ 69.501014] EEH: [0000000009744e54] amdgpu_device_init+0x1a74/0x1fc0 [amdgpu]
  66. [ 69.501110] EEH: [000000005aac3e93] amdgpu_driver_load_kms+0x30/0x520 [amdgpu]
  67. [ 69.501204] EEH: [0000000044cf3143] amdgpu_pci_probe+0x18c/0x340 [amdgpu]
  68. [ 69.501208] EEH: [00000000827393ff] local_pci_probe+0x68/0x110
  69. [ 69.501211] EEH: [00000000e5937af3] work_for_cpu_fn+0x38/0x60
  70. [ 69.501214] EEH: [0000000027a7f486] process_one_work+0x300/0x5d0
  71. [ 69.501217] EEH: [0000000041c5aee3] worker_thread+0x360/0x780
  72. [ 69.501219] EEH: [00000000787f3030] kthread+0x1e4/0x1f0
  73. [ 69.501222] EEH: [0000000021927c95] ret_from_kernel_thread+0x5c/0x6c
  74. [ 69.501224] EEH: This PCI device has failed 1 times in the last hour and will be permanently disabled after 5 failures.
  75. [ 69.501225] EEH: Notify device drivers to shutdown
  76. [ 69.501228] EEH: Beginning: 'error_detected(IO frozen)'
  77. [ 69.516456] [drm] use_doorbell being set to: [true]
  78. [ 69.516536] [drm] use_doorbell being set to: [true]
  79. [ 69.516639] [drm] use_doorbell being set to: [true]
  80. [ 69.516739] [drm] use_doorbell being set to: [true]
  81. [ 69.518119] [drm] Found VCN firmware Version ENC: 1.3 DEC: 2 VEP: 0 Revision: 17
  82. [ 69.518135] [drm] PSP loading VCN firmware
  83. [ 69.784609] [drm:psp_hw_start [amdgpu]] *ERROR* PSP create ring failed!
  84. [ 69.784671] [drm:psp_hw_init [amdgpu]] *ERROR* PSP firmware loading failed
  85. [ 69.784725] [drm:amdgpu_device_fw_loading [amdgpu]] *ERROR* hw_init of IP block <psp> failed -22
  86. [ 69.784727] amdgpu 0001:03:00.0: amdgpu: amdgpu_device_ip_init failed
  87. [ 69.784738] amdgpu 0001:03:00.0: amdgpu: Fatal error during GPU init
  88. [ 69.785890] amdgpu: probe of 0001:03:00.0 failed with error -22
  89. [ 69.785920] PCI 0001:03:00.0#0000: EEH: no driver
  90. [ 69.785923] PCI 0001:03:00.1#0000: EEH: driver not EEH aware
  91. [ 69.785926] EEH: Finished:'error_detected(IO frozen)' with aggregate recovery state:'none'
  92. [ 69.785931] EEH: Collect temporary log
  93. [ 69.785972] EEH: of node=0001:03:00.0
  94. [ 69.785976] EEH: PCI device/vendor: 73bf1002
  95. [ 69.785979] EEH: PCI cmd/status register: 00100542
  96. [ 69.785980] EEH: PCI-E capabilities and status follow:
  97. [ 69.785991] EEH: PCI-E 00: 0012a010 00008fa1 00002930 00440d04
  98. [ 69.786000] EEH: PCI-E 10: 11040040 00000000 00000000 00000000
  99. [ 69.786002] EEH: PCI-E 20: 00000000
  100. [ 69.786003] EEH: PCI-E AER capability register set follows:
  101. [ 69.786014] EEH: PCI-E AER 00: 20020001 00000000 00000000 00462030
  102. [ 69.786023] EEH: PCI-E AER 10: 00000000 00002000 000001f4 40008001
  103. [ 69.786033] EEH: PCI-E AER 20: 0000000f 8007f000 00000000 00000000
  104. [ 69.786036] EEH: PCI-E AER 30: 00000000 00000000
  105. [ 69.786039] EEH: of node=0001:03:00.1
  106. [ 69.786042] EEH: PCI device/vendor: ab281002
  107. [ 69.786045] EEH: PCI cmd/status register: 00100546
  108. [ 69.786046] EEH: PCI-E capabilities and status follow:
  109. [ 69.786057] EEH: PCI-E 00: 0012a010 00008fa1 00002930 00440d04
  110. [ 69.786065] EEH: PCI-E 10: 11040040 00000000 00000000 00000000
  111. [ 69.786067] EEH: PCI-E 20: 00000000
  112. [ 69.786070] EEH: PCI-E AER capability register set follows:
  113. [ 69.786080] EEH: PCI-E AER 00: 2a020001 00000000 00000000 00462030
  114. [ 69.786089] EEH: PCI-E AER 10: 00000000 00002000 000001e0 00000000
  115. [ 69.786097] EEH: PCI-E AER 20: 00000000 00000000 00000000 00000000
  116. [ 69.786101] EEH: PCI-E AER 30: 00000000 00000000
  117. [ 69.786103] PHB4 PHB#1 Diag-data (Version: 1)
  118. [ 69.786105] brdgCtl: 00000002
  119. [ 69.786107] RootSts: 00000020 00402000 a0440008 00100107 00004000
  120. [ 69.786109] RootErrSts: 00000024 00000000 00000000
  121. [ 69.786110] sourceId: 03000000
  122. [ 69.786112] PhbSts: 0000001c00000000 0000001c00000000
  123. [ 69.786114] Lem: 0000000004000000 0000000000000000 0000000004000000
  124. [ 69.786116] PhbErr: 0000080000000000 0000080000000000 2148000098000240 a008400000000000
  125. [ 69.786120] RxeArbErr: 0000000000000020 0000000000000020 4000030000000000 0000000000000000
  126. [ 69.786122] PcieDlp: 0000000000000000 0000000000000000 7000000000000000
  127. [ 69.786126] PE[000] A/B: 8720002503000000 8000000000000000
  128. [ 69.786128] EEH: Reset with hotplug activity
  129. [ 69.930197] snd_hda_intel 0001:03:00.1: CORB reset timeout#2, CORBRP = 65535
  130. [ 70.400246] snd_hda_intel 0001:03:00.1: CORB reset timeout#2, CORBRP = 65535
  131. [ 70.825252] snd_hda_codec_hdmi hdaudioC0D0: Unable to sync register 0x2f0d00. -5
  132. [ 70.825264] snd_hda_codec_hdmi hdaudioC0D0: HDMI ATI/AMD: no speaker allocation for ELD
  133. [ 70.825275] snd_hda_codec_hdmi hdaudioC0D0: HDMI ATI/AMD: no speaker allocation for ELD
  134. [ 70.825283] snd_hda_codec_hdmi hdaudioC0D0: HDMI ATI/AMD: no speaker allocation for ELD
  135. [ 70.825291] snd_hda_codec_hdmi hdaudioC0D0: HDMI ATI/AMD: no speaker allocation for ELD
  136. [ 70.825299] snd_hda_codec_hdmi hdaudioC0D0: HDMI ATI/AMD: no speaker allocation for ELD
  137. [ 70.825307] snd_hda_codec_hdmi hdaudioC0D0: HDMI ATI/AMD: no speaker allocation for ELD
  138. [ 71.335457] pci 0001:03:00.1: Removing from iommu group 1
  139. [ 71.335661] pci 0001:03:00.0: Removing from iommu group 1
  140. [ 73.513323] EEH: Sleep 5s ahead of complete hotplug
  141. [ 78.547139] pci 0001:03:00.0: [1002:73bf] type 00 class 0x030000
  142. [ 78.547163] pci 0001:03:00.0: reg 0x10: [mem 0x6004000000000-0x600400fffffff 64bit pref]
  143. [ 78.547175] pci 0001:03:00.0: reg 0x18: [mem 0x6004010000000-0x60040101fffff 64bit pref]
  144. [ 78.547184] pci 0001:03:00.0: reg 0x20: [io 0x0000-0x00ff]
  145. [ 78.547191] pci 0001:03:00.0: reg 0x24: [mem 0x600c080000000-0x600c0800fffff]
  146. [ 78.547199] pci 0001:03:00.0: reg 0x30: [mem 0x00000000-0x0001ffff pref]
  147. [ 78.547330] pci 0001:03:00.0: PME# supported from D1 D2 D3hot D3cold
  148. [ 78.547423] pci 0001:03:00.0: 63.012 Gb/s available PCIe bandwidth, limited by 16.0 GT/s PCIe x4 link at 0001:00:00.0 (capable of 252.048 Gb/s with 16.0 GT/s PCIe x16 link)
  149. [ 78.547495] pci 0001:03:00.0: vgaarb: VGA device added: decodes=io+mem,owns=none,locks=none
  150. [ 78.547991] pci 0001:03:00.1: [1002:ab28] type 00 class 0x040300
  151. [ 78.548006] pci 0001:03:00.1: reg 0x10: [mem 0x600c080120000-0x600c080123fff]
  152. [ 78.548118] pci 0001:03:00.1: PME# supported from D1 D2 D3hot D3cold
  153. [ 78.548638] pci 0001:02:00.0: ASPM: current common clock configuration is inconsistent, reconfiguring
  154. [ 78.548679] pci 0001:02:00.0: BAR 13: no space for [io size 0x1000]
  155. [ 78.548681] pci 0001:02:00.0: BAR 13: failed to assign [io size 0x1000]
  156. [ 78.548686] pci 0001:03:00.0: BAR 0: assigned [mem 0x6004000000000-0x600400fffffff 64bit pref]
  157. [ 78.548696] pci 0001:03:00.0: BAR 2: assigned [mem 0x6004010000000-0x60040101fffff 64bit pref]
  158. [ 78.548706] pci 0001:03:00.0: BAR 5: assigned [mem 0x600c080000000-0x600c0800fffff]
  159. [ 78.548711] pci 0001:03:00.0: BAR 6: assigned [mem 0x600c080100000-0x600c08011ffff pref]
  160. [ 78.548713] pci 0001:03:00.1: BAR 0: assigned [mem 0x600c080120000-0x600c080123fff]
  161. [ 78.548718] pci 0001:03:00.0: BAR 4: no space for [io size 0x0100]
  162. [ 78.548720] pci 0001:03:00.0: BAR 4: failed to assign [io size 0x0100]
  163. [ 78.548724] pci 0001:02:00.0: PCI bridge to [bus 03]
  164. [ 78.548728] pci 0001:02:00.0: bridge window [mem 0x600c080000000-0x600c0807fffff]
  165. [ 78.548732] pci 0001:02:00.0: bridge window [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  166. [ 78.548736] PCI: No. 2 try to assign unassigned res
  167. [ 78.548740] pci 0001:02:00.0: BAR 13: no space for [io size 0x1000]
  168. [ 78.548743] pci 0001:02:00.0: BAR 13: failed to assign [io size 0x1000]
  169. [ 78.548745] pci 0001:03:00.0: BAR 4: no space for [io size 0x0100]
  170. [ 78.548748] pci 0001:03:00.0: BAR 4: failed to assign [io size 0x0100]
  171. [ 78.548750] pci 0001:02:00.0: PCI bridge to [bus 03]
  172. [ 78.548755] pci 0001:02:00.0: bridge window [mem 0x600c080000000-0x600c0807fffff]
  173. [ 78.548758] pci 0001:02:00.0: bridge window [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  174. [ 78.548770] pci 0001:03:00.0: Added to existing PE#0
  175. [ 78.548776] pci 0001:03:00.0: Adding to iommu group 1
  176. [ 78.548914] amdgpu 0001:03:00.0: enabling device (0140 -> 0142)
  177. [ 78.548921] [drm] initializing kernel modesetting (SIENNA_CICHLID 0x1002:0x73BF 0x1DA2:0xE438 0xC0).
  178. [ 78.548925] amdgpu 0001:03:00.0: amdgpu: Trusted Memory Zone (TMZ) feature not supported
  179. [ 78.548937] [drm] register mmio base: 0x80000000
  180. [ 78.548939] [drm] register mmio size: 1048576
  181. [ 78.548940] [drm] PCI I/O BAR is not found.
  182. [ 78.548947] [drm] PCIE atomic ops is not supported
  183. [ 78.551169] [drm] add ip block number 0 <nv_common>
  184. [ 78.551171] [drm] add ip block number 1 <gmc_v10_0>
  185. [ 78.551173] [drm] add ip block number 2 <navi10_ih>
  186. [ 78.551174] [drm] add ip block number 3 <psp>
  187. [ 78.551176] [drm] add ip block number 4 <smu>
  188. [ 78.551178] [drm] add ip block number 5 <gfx_v10_0>
  189. [ 78.551180] [drm] add ip block number 6 <sdma_v5_2>
  190. [ 78.551181] [drm] add ip block number 7 <vcn_v3_0>
  191. [ 78.551183] [drm] add ip block number 8 <jpeg_v3_0>
  192. [ 78.582437] amdgpu 0001:03:00.0: amdgpu: Fetched VBIOS from ROM BAR
  193. [ 78.582440] amdgpu: ATOM BIOS: 113-E438XTX-UO2
  194. [ 78.582453] [drm] VCN(0) decode is enabled in VM mode
  195. [ 78.582455] [drm] VCN(1) decode is enabled in VM mode
  196. [ 78.582456] [drm] VCN(0) encode is enabled in VM mode
  197. [ 78.582458] [drm] VCN(1) encode is enabled in VM mode
  198. [ 78.582459] [drm] JPEG decode is enabled in VM mode
  199. [ 78.582489] amdgpu 0001:03:00.0: amdgpu: HBM ECC is not presented.
  200. [ 78.582491] amdgpu 0001:03:00.0: amdgpu: SRAM ECC is not presented.
  201. [ 78.582497] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
  202. [ 78.582522] amdgpu 0001:03:00.0: BAR 2: releasing [mem 0x6004010000000-0x60040101fffff 64bit pref]
  203. [ 78.582525] amdgpu 0001:03:00.0: BAR 0: releasing [mem 0x6004000000000-0x600400fffffff 64bit pref]
  204. [ 78.582552] pci 0001:02:00.0: BAR 15: releasing [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  205. [ 78.582555] pci 0001:01:00.0: BAR 15: releasing [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  206. [ 78.582558] pci 0001:00:00.0: BAR 15: releasing [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  207. [ 78.582565] pci 0001:00:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  208. [ 78.582568] pci 0001:01:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  209. [ 78.582571] pci 0001:02:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  210. [ 78.582574] amdgpu 0001:03:00.0: BAR 0: assigned [mem 0x6004000000000-0x60043ffffffff 64bit pref]
  211. [ 78.582584] amdgpu 0001:03:00.0: BAR 2: assigned [mem 0x6004400000000-0x60044001fffff 64bit pref]
  212. [ 78.582593] pci 0001:00:00.0: PCI bridge to [bus 01-03]
  213. [ 78.582597] pci 0001:00:00.0: bridge window [mem 0x600c080000000-0x600c0ffefffff]
  214. [ 78.582601] pci 0001:00:00.0: bridge window [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  215. [ 78.582606] pci 0001:01:00.0: PCI bridge to [bus 02-03]
  216. [ 78.582611] pci 0001:01:00.0: bridge window [mem 0x600c080000000-0x600c0ffefffff]
  217. [ 78.582615] pci 0001:01:00.0: bridge window [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  218. [ 78.582620] pci 0001:02:00.0: PCI bridge to [bus 03]
  219. [ 78.582624] pci 0001:02:00.0: bridge window [mem 0x600c080000000-0x600c0807fffff]
  220. [ 78.582628] pci 0001:02:00.0: bridge window [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  221. [ 78.582639] amdgpu 0001:03:00.0: amdgpu: VRAM: 16368M 0x0000008000000000 - 0x00000083FEFFFFFF (16368M used)
  222. [ 78.582642] amdgpu 0001:03:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF
  223. [ 78.582645] [drm] Detected VRAM RAM=16368M, BAR=16384M
  224. [ 78.582647] [drm] RAM width 256bits GDDR6
  225. [ 78.582826] [drm] amdgpu: 16368M of VRAM memory ready
  226. [ 78.582831] [drm] amdgpu: 16368M of GTT memory ready.
  227. [ 78.582839] [drm] GART: num cpu pages 131072, num gpu pages 131072
  228. [ 78.589574] [drm] PCIE GART of 512M enabled (table at 0x0000008000000000).
  229. [ 78.596296] [drm] use_doorbell being set to: [true]
  230. [ 78.596663] [drm] use_doorbell being set to: [true]
  231. [ 78.597025] [drm] use_doorbell being set to: [true]
  232. [ 78.597450] [drm] use_doorbell being set to: [true]
  233. [ 78.597861] [drm] Found VCN firmware Version ENC: 1.3 DEC: 2 VEP: 0 Revision: 17
  234. [ 78.597869] [drm] PSP loading VCN firmware
  235. [ 78.853223] [drm:psp_hw_start [amdgpu]] *ERROR* PSP create ring failed!
  236. [ 78.853269] [drm:psp_hw_init [amdgpu]] *ERROR* PSP firmware loading failed
  237. [ 78.853306] [drm:amdgpu_device_fw_loading [amdgpu]] *ERROR* hw_init of IP block <psp> failed -22
  238. [ 78.853309] amdgpu 0001:03:00.0: amdgpu: amdgpu_device_ip_init failed
  239. [ 78.853319] amdgpu 0001:03:00.0: amdgpu: Fatal error during GPU init
  240. [ 78.853350] amdgpu: probe of 0001:03:00.0 failed with error -22
  241. [ 78.853354] pci 0001:03:00.1: Added to existing PE#0
  242. [ 78.853359] pci 0001:03:00.1: Adding to iommu group 1
  243. [ 78.853444] pci 0001:03:00.1: D0 power state depends on 0001:03:00.0
  244. [ 78.853479] snd_hda_intel 0001:03:00.1: enabling device (0140 -> 0142)
  245. [ 78.853484] snd_hda_intel 0001:03:00.1: Force to snoop mode by module option
  246. [ 78.853504] EEH: Notify device driver to resume
  247. [ 78.853506] EEH: Beginning: 'resume'
  248. [ 78.853508] PCI 0001:03:00.0#0000: EEH: no driver
  249. [ 78.853509] PCI 0001:03:00.1#0000: EEH: driver not EEH aware
  250. [ 78.853510] EEH: Finished:'resume'
  251. [ 78.853511] EEH: Recovery successful.
  252. [ 78.853514] EEH: Recovering PHB#1-PE#0
  253. [ 78.853516] EEH: PE location: UOPWR.D100020-Node0-SLOT1 PCIE 4.0 X16, PHB location: N/A
  254. [ 78.853517] EEH: Frozen PHB#1-PE#0 detected
  255. [ 78.853518] EEH: Call Trace:
  256. [ 78.853522] EEH: [00000000d9e7d323] __eeh_send_failure_event+0x7c/0x160
  257. [ 78.853524] EEH: [00000000d61ba426] eeh_dev_check_failure.part.0+0x254/0x5e0
  258. [ 78.853561] EEH: [0000000061d1df81] amdgpu_device_rreg+0x180/0x210 [amdgpu]
  259. [ 78.853606] EEH: [00000000ed1fb3ed] gfxhub_v2_1_set_fault_enable_default+0x68/0x150 [amdgpu]
  260. [ 78.853651] EEH: [000000001cce1aab] gmc_v10_0_hw_init+0x198/0x290 [amdgpu]
  261. [ 78.853688] EEH: [0000000009744e54] amdgpu_device_init+0x1a74/0x1fc0 [amdgpu]
  262. [ 78.853725] EEH: [000000005aac3e93] amdgpu_driver_load_kms+0x30/0x520 [amdgpu]
  263. [ 78.853762] EEH: [0000000044cf3143] amdgpu_pci_probe+0x18c/0x340 [amdgpu]
  264. [ 78.853764] EEH: [00000000827393ff] local_pci_probe+0x68/0x110
  265. [ 78.853766] EEH: [00000000e5937af3] work_for_cpu_fn+0x38/0x60
  266. [ 78.853768] EEH: [0000000027a7f486] process_one_work+0x300/0x5d0
  267. [ 78.853769] EEH: [0000000041c5aee3] worker_thread+0x360/0x780
  268. [ 78.853770] EEH: [00000000787f3030] kthread+0x1e4/0x1f0
  269. [ 78.853772] EEH: [0000000021927c95] ret_from_kernel_thread+0x5c/0x6c
  270. [ 78.853773] EEH: This PCI device has failed 2 times in the last hour and will be permanently disabled after 5 failures.
  271. [ 78.853774] EEH: Notify device drivers to shutdown
  272. [ 78.853775] EEH: Beginning: 'error_detected(IO frozen)'
  273. [ 78.853777] PCI 0001:03:00.0#0000: EEH: no driver
  274. [ 78.853778] PCI 0001:03:00.1#0000: EEH: driver not EEH aware
  275. [ 78.853779] EEH: Finished:'error_detected(IO frozen)' with aggregate recovery state:'none'
  276. [ 78.853782] EEH: Collect temporary log
  277. [ 78.853812] EEH: of node=0001:03:00.0
  278. [ 78.853814] EEH: PCI device/vendor: 73bf1002
  279. [ 78.853816] EEH: PCI cmd/status register: 00100542
  280. [ 78.853817] EEH: PCI-E capabilities and status follow:
  281. [ 78.853824] EEH: PCI-E 00: 0012a010 00008fa1 00002930 00440d04
  282. [ 78.853830] EEH: PCI-E 10: 11040040 00000000 00000000 00000000
  283. [ 78.853831] EEH: PCI-E 20: 00000000
  284. [ 78.853832] EEH: PCI-E AER capability register set follows:
  285. [ 78.853839] EEH: PCI-E AER 00: 20020001 00000000 00000000 00462030
  286. [ 78.853845] EEH: PCI-E AER 10: 00000000 00002000 000001f4 40008001
  287. [ 78.853851] EEH: PCI-E AER 20: 0000000f 8007f000 00000000 00000000
  288. [ 78.853853] EEH: PCI-E AER 30: 00000000 00000000
  289. [ 78.853854] EEH: of node=0001:03:00.1
  290. [ 78.853856] EEH: PCI device/vendor: ab281002
  291. [ 78.853858] EEH: PCI cmd/status register: 00100142
  292. [ 78.853859] EEH: PCI-E capabilities and status follow:
  293. [ 78.853866] EEH: PCI-E 00: 0012a010 00008fa1 00002930 00440d04
  294. [ 78.853871] EEH: PCI-E 10: 11040040 00000000 00000000 00000000
  295. [ 78.853872] EEH: PCI-E 20: 00000000
  296. [ 78.853873] EEH: PCI-E AER capability register set follows:
  297. [ 78.853880] EEH: PCI-E AER 00: 2a020001 00000000 00000000 00462030
  298. [ 78.853886] EEH: PCI-E AER 10: 00000000 00002000 000001e0 00000000
  299. [ 78.853891] EEH: PCI-E AER 20: 00000000 00000000 00000000 00000000
  300. [ 78.853894] EEH: PCI-E AER 30: 00000000 00000000
  301. [ 78.853895] PHB4 PHB#1 Diag-data (Version: 1)
  302. [ 78.853896] brdgCtl: 00000002
  303. [ 78.853897] RootSts: 00000020 00402000 a0440008 00100107 00004000
  304. [ 78.853898] RootErrSts: 00000024 00000000 00000000
  305. [ 78.853899] sourceId: 03000000
  306. [ 78.853900] PhbSts: 0000001c00000000 0000001c00000000
  307. [ 78.853901] Lem: 0000000004000000 0000000000000000 0000000004000000
  308. [ 78.853903] PhbErr: 0000080000000000 0000080000000000 2148000098000240 a008400000000000
  309. [ 78.853904] RxeArbErr: 0000000000000020 0000000000000020 4000030000000000 0000000000000000
  310. [ 78.853905] PcieDlp: 0000000000000000 0000000000000000 7000000000000000
  311. [ 78.853906] PE[000] A/B: 8720002503000000 8000000000000000
  312. [ 78.853908] EEH: Reset with hotplug activity
  313. [ 78.853919] Attempt to iounmap early bolted mapping at 0x0000000000000000
  314. [ 78.853983] pci 0001:03:00.1: Removing from iommu group 1
  315. [ 78.854055] pci 0001:03:00.0: Removing from iommu group 1
  316. [ 80.954155] EEH: Sleep 5s ahead of complete hotplug
  317. [ 85.987779] ------------[ cut here ]------------
  318. [ 85.987788] WARNING: CPU: 0 PID: 177 at arch/powerpc/kernel/eeh_pe.c:438 eeh_pe_tree_remove+0xb8/0x260
  319. [ 85.987789] Modules linked in: amdgpu mfd_core gpu_sched xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_nat_tftp nf_conntrack_tftp tun bridge stp llc nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_mangle iptable_raw iptable_security ip_set nf_tables nfnetlink rfkill ip6table_filter ip6_tables iptable_filter sunrpc snd_hda_codec_hdmi snd_hda_intel snd_usb_audio snd_intel_dspcfg snd_hda_codec at24 regmap_i2c snd_hda_core snd_usbmidi_lib snd_rawmidi snd_hwdep snd_seq joydev snd_seq_device crct10dif_vpmsum snd_pcm mc ofpart ipmi_powernv ipmi_devintf powernv_flash ipmi_msghandler mtd snd_timer rtc_opal opal_prd snd i2c_opal soundcore zram ip_tables ast drm_vram_helper drm_ttm_helper ttm i2c_algo_bit drm_kms_helper syscopyarea
  320. [ 85.987888] sysfillrect sysimgblt fb_sys_fops cec drm vmx_crypto crc32c_vpmsum tg3 i2c_core drm_panel_orientation_quirks nvme nvme_core fuse
  321. [ 85.987907] CPU: 0 PID: 177 Comm: eehd Not tainted 5.10.21-200.4kpagesize.fc33.ppc64le #1
  322. [ 85.987909] NIP: c00000000004b778 LR: c00000000004b710 CTR: c00000000004ce90
  323. [ 85.987912] REGS: c00000000d14f840 TRAP: 0700 Not tainted (5.10.21-200.4kpagesize.fc33.ppc64le)
  324. [ 85.987913] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 28002842 XER: 00000000
  325. [ 85.987926] CFAR: c00000000004b7b0 IRQMASK: 0
  326. GPR00: c00000000004cee8 c00000000d14fad0 c000000002310900 0000000000000001
  327. GPR04: c000000003ec94b0 c000000003ec94b0 0000000028008844 0000000000000100
  328. GPR08: c00000000d7d4068 0000000000000000 0000000000000008 0000000000000000
  329. GPR12: c00000000004ce90 c0000000024f1000 c0000000001a3be8 c00000000d04fcc0
  330. GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  331. GPR20: 0000000000000000 0000000000000000 0000000000000000 0000000000000045
  332. GPR24: 0000000000000002 0000000000000000 0000000000000000 c00000000d7a1800
  333. GPR28: 5deadbeef0000100 5deadbeef0000122 c00000000d7d0000 c00000000d7d4000
  334. [ 85.987975] NIP [c00000000004b778] eeh_pe_tree_remove+0xb8/0x260
  335. [ 85.987977] LR [c00000000004b710] eeh_pe_tree_remove+0x50/0x260
  336. [ 85.987979] Call Trace:
  337. [ 85.987982] [c00000000d14fad0] [0000000000000027] 0x27 (unreliable)
  338. [ 85.987987] [c00000000d14fb50] [c00000000004cee8] eeh_pe_detach_dev+0x58/0xc0
  339. [ 85.987990] [c00000000d14fb80] [c00000000004afbc] eeh_pe_traverse+0x6c/0xf0
  340. [ 85.987994] [c00000000d14fbc0] [c00000000004fb54] eeh_reset_device+0x21c/0x2c8
  341. [ 85.987998] [c00000000d14fc70] [c00000000004ebd0] eeh_handle_normal_event+0x7e0/0xa40
  342. [ 85.988001] [c00000000d14fd50] [c00000000004fd18] eeh_event_handler+0x118/0x1a0
  343. [ 85.988005] [c00000000d14fdb0] [c0000000001a3dc4] kthread+0x1e4/0x1f0
  344. [ 85.988009] [c00000000d14fe20] [c00000000000d4f0] ret_from_kernel_thread+0x5c/0x6c
  345. [ 85.988011] Instruction dump:
  346. [ 85.988013] 67bdf000 639c0100 63bd0122 fb9e0070 fbbe0078 e95f0002 ebdf0038 71490002
  347. [ 85.988023] 41820038 480000c4 2c290000 40820008 <0fe00000> e93f0068 7c294040 418200dc
  348. [ 85.988033] ---[ end trace c7c7bf27e0e1201f ]---
  349. [ 85.988035] ------------[ cut here ]------------
  350. [ 85.988039] WARNING: CPU: 0 PID: 177 at arch/powerpc/kernel/eeh_pe.c:438 eeh_pe_tree_remove+0xb8/0x260
  351. [ 85.988040] Modules linked in: amdgpu mfd_core gpu_sched xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_nat_tftp nf_conntrack_tftp tun bridge stp llc nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_mangle iptable_raw iptable_security ip_set nf_tables nfnetlink rfkill ip6table_filter ip6_tables iptable_filter sunrpc snd_hda_codec_hdmi snd_hda_intel snd_usb_audio snd_intel_dspcfg snd_hda_codec at24 regmap_i2c snd_hda_core snd_usbmidi_lib snd_rawmidi snd_hwdep snd_seq joydev snd_seq_device crct10dif_vpmsum snd_pcm mc ofpart ipmi_powernv ipmi_devintf powernv_flash ipmi_msghandler mtd snd_timer rtc_opal opal_prd snd i2c_opal soundcore zram ip_tables ast drm_vram_helper drm_ttm_helper ttm i2c_algo_bit drm_kms_helper syscopyarea
  352. [ 85.988131] sysfillrect sysimgblt fb_sys_fops cec drm vmx_crypto crc32c_vpmsum tg3 i2c_core drm_panel_orientation_quirks nvme nvme_core fuse
  353. [ 85.988148] CPU: 0 PID: 177 Comm: eehd Tainted: G W 5.10.21-200.4kpagesize.fc33.ppc64le #1
  354. [ 85.988150] NIP: c00000000004b778 LR: c00000000004b710 CTR: c00000000004ce90
  355. [ 85.988152] REGS: c00000000d14f840 TRAP: 0700 Tainted: G W (5.10.21-200.4kpagesize.fc33.ppc64le)
  356. [ 85.988153] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 28002842 XER: 00000000
  357. [ 85.988166] CFAR: c00000000004b7b0 IRQMASK: 0
  358. GPR00: c00000000004cee8 c00000000d14fad0 c000000002310900 0000000000000001
  359. GPR04: c000000003ec9e70 c000000003ec9e70 0000000028008844 0000000000000100
  360. GPR08: c00000000d7d4068 0000000000000000 0000000000000008 0000000000000000
  361. GPR12: c00000000004ce90 c0000000024f1000 c0000000001a3be8 c00000000d04fcc0
  362. GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  363. GPR20: 0000000000000000 0000000000000000 0000000000000000 0000000000000045
  364. GPR24: 0000000000000002 0000000000000000 0000000000000000 c00000000d7a1800
  365. GPR28: 5deadbeef0000100 5deadbeef0000122 c00000000d7d0000 c00000000d7d4000
  366. [ 85.988213] NIP [c00000000004b778] eeh_pe_tree_remove+0xb8/0x260
  367. [ 85.988216] LR [c00000000004b710] eeh_pe_tree_remove+0x50/0x260
  368. [ 85.988217] Call Trace:
  369. [ 85.988219] [c00000000d14fad0] [0000000000000027] 0x27 (unreliable)
  370. [ 85.988223] [c00000000d14fb50] [c00000000004cee8] eeh_pe_detach_dev+0x58/0xc0
  371. [ 85.988227] [c00000000d14fb80] [c00000000004afbc] eeh_pe_traverse+0x6c/0xf0
  372. [ 85.988230] [c00000000d14fbc0] [c00000000004fb54] eeh_reset_device+0x21c/0x2c8
  373. [ 85.988234] [c00000000d14fc70] [c00000000004ebd0] eeh_handle_normal_event+0x7e0/0xa40
  374. [ 85.988237] [c00000000d14fd50] [c00000000004fd18] eeh_event_handler+0x118/0x1a0
  375. [ 85.988240] [c00000000d14fdb0] [c0000000001a3dc4] kthread+0x1e4/0x1f0
  376. [ 85.988244] [c00000000d14fe20] [c00000000000d4f0] ret_from_kernel_thread+0x5c/0x6c
  377. [ 85.988246] Instruction dump:
  378. [ 85.988248] 67bdf000 639c0100 63bd0122 fb9e0070 fbbe0078 e95f0002 ebdf0038 71490002
  379. [ 85.988258] 41820038 480000c4 2c290000 40820008 <0fe00000> e93f0068 7c294040 418200dc
  380. [ 85.988268] ---[ end trace c7c7bf27e0e12020 ]---
  381. [ 85.988318] pci 0001:03:00.0: [1002:73bf] type 00 class 0x030000
  382. [ 85.988340] pci 0001:03:00.0: reg 0x10: [mem 0x6004000000000-0x600400fffffff 64bit pref]
  383. [ 85.988352] pci 0001:03:00.0: reg 0x18: [mem 0x6004010000000-0x60040101fffff 64bit pref]
  384. [ 85.988359] pci 0001:03:00.0: reg 0x20: [io 0x0000-0x00ff]
  385. [ 85.988367] pci 0001:03:00.0: reg 0x24: [mem 0x600c080000000-0x600c0800fffff]
  386. [ 85.988375] pci 0001:03:00.0: reg 0x30: [mem 0x00000000-0x0001ffff pref]
  387. [ 85.988505] pci 0001:03:00.0: PME# supported from D1 D2 D3hot D3cold
  388. [ 85.988598] pci 0001:03:00.0: 63.012 Gb/s available PCIe bandwidth, limited by 16.0 GT/s PCIe x4 link at 0001:00:00.0 (capable of 252.048 Gb/s with 16.0 GT/s PCIe x16 link)
  389. [ 85.988667] pci 0001:03:00.0: vgaarb: VGA device added: decodes=io+mem,owns=none,locks=none
  390. [ 85.989164] pci 0001:03:00.1: [1002:ab28] type 00 class 0x040300
  391. [ 85.989178] pci 0001:03:00.1: reg 0x10: [mem 0x600c080120000-0x600c080123fff]
  392. [ 85.989290] pci 0001:03:00.1: PME# supported from D1 D2 D3hot D3cold
  393. [ 85.989808] pci 0001:02:00.0: ASPM: current common clock configuration is inconsistent, reconfiguring
  394. [ 85.989849] pci 0001:02:00.0: BAR 13: no space for [io size 0x1000]
  395. [ 85.989851] pci 0001:02:00.0: BAR 13: failed to assign [io size 0x1000]
  396. [ 85.989856] pci 0001:03:00.0: BAR 0: assigned [mem 0x6004000000000-0x600400fffffff 64bit pref]
  397. [ 85.989866] pci 0001:03:00.0: BAR 2: assigned [mem 0x6004010000000-0x60040101fffff 64bit pref]
  398. [ 85.989875] pci 0001:03:00.0: BAR 5: assigned [mem 0x600c080000000-0x600c0800fffff]
  399. [ 85.989880] pci 0001:03:00.0: BAR 6: assigned [mem 0x600c080100000-0x600c08011ffff pref]
  400. [ 85.989883] pci 0001:03:00.1: BAR 0: assigned [mem 0x600c080120000-0x600c080123fff]
  401. [ 85.989887] pci 0001:03:00.0: BAR 4: no space for [io size 0x0100]
  402. [ 85.989890] pci 0001:03:00.0: BAR 4: failed to assign [io size 0x0100]
  403. [ 85.989893] pci 0001:02:00.0: PCI bridge to [bus 03]
  404. [ 85.989898] pci 0001:02:00.0: bridge window [mem 0x600c080000000-0x600c0807fffff]
  405. [ 85.989902] pci 0001:02:00.0: bridge window [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  406. [ 85.989906] PCI: No. 2 try to assign unassigned res
  407. [ 85.989910] pci 0001:02:00.0: BAR 13: no space for [io size 0x1000]
  408. [ 85.989912] pci 0001:02:00.0: BAR 13: failed to assign [io size 0x1000]
  409. [ 85.989915] pci 0001:03:00.0: BAR 4: no space for [io size 0x0100]
  410. [ 85.989917] pci 0001:03:00.0: BAR 4: failed to assign [io size 0x0100]
  411. [ 85.989920] pci 0001:02:00.0: PCI bridge to [bus 03]
  412. [ 85.989925] pci 0001:02:00.0: bridge window [mem 0x600c080000000-0x600c0807fffff]
  413. [ 85.989928] pci 0001:02:00.0: bridge window [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  414. [ 85.989940] pci 0001:03:00.0: Added to existing PE#0
  415. [ 85.989946] pci 0001:03:00.0: Adding to iommu group 1
  416. [ 85.990081] amdgpu 0001:03:00.0: enabling device (0140 -> 0142)
  417. [ 85.990088] [drm] initializing kernel modesetting (SIENNA_CICHLID 0x1002:0x73BF 0x1DA2:0xE438 0xC0).
  418. [ 85.990092] amdgpu 0001:03:00.0: amdgpu: Trusted Memory Zone (TMZ) feature not supported
  419. [ 85.990104] [drm] register mmio base: 0x80000000
  420. [ 85.990105] [drm] register mmio size: 1048576
  421. [ 85.990107] [drm] PCI I/O BAR is not found.
  422. [ 85.990113] [drm] PCIE atomic ops is not supported
  423. [ 85.992344] [drm] add ip block number 0 <nv_common>
  424. [ 85.992346] [drm] add ip block number 1 <gmc_v10_0>
  425. [ 85.992347] [drm] add ip block number 2 <navi10_ih>
  426. [ 85.992349] [drm] add ip block number 3 <psp>
  427. [ 85.992351] [drm] add ip block number 4 <smu>
  428. [ 85.992353] [drm] add ip block number 5 <gfx_v10_0>
  429. [ 85.992354] [drm] add ip block number 6 <sdma_v5_2>
  430. [ 85.992356] [drm] add ip block number 7 <vcn_v3_0>
  431. [ 85.992357] [drm] add ip block number 8 <jpeg_v3_0>
  432. [ 86.023918] amdgpu 0001:03:00.0: amdgpu: Fetched VBIOS from ROM BAR
  433. [ 86.023926] amdgpu: ATOM BIOS: 113-E438XTX-UO2
  434. [ 86.023949] [drm] VCN(0) decode is enabled in VM mode
  435. [ 86.023952] [drm] VCN(1) decode is enabled in VM mode
  436. [ 86.023955] [drm] VCN(0) encode is enabled in VM mode
  437. [ 86.023958] [drm] VCN(1) encode is enabled in VM mode
  438. [ 86.023962] [drm] JPEG decode is enabled in VM mode
  439. [ 86.024021] amdgpu 0001:03:00.0: amdgpu: HBM ECC is not presented.
  440. [ 86.024024] amdgpu 0001:03:00.0: amdgpu: SRAM ECC is not presented.
  441. [ 86.024033] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
  442. [ 86.024071] amdgpu 0001:03:00.0: BAR 2: releasing [mem 0x6004010000000-0x60040101fffff 64bit pref]
  443. [ 86.024075] amdgpu 0001:03:00.0: BAR 0: releasing [mem 0x6004000000000-0x600400fffffff 64bit pref]
  444. [ 86.024112] pci 0001:02:00.0: BAR 15: releasing [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  445. [ 86.024116] pci 0001:01:00.0: BAR 15: releasing [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  446. [ 86.024120] pci 0001:00:00.0: BAR 15: releasing [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  447. [ 86.024132] pci 0001:00:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  448. [ 86.024137] pci 0001:01:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  449. [ 86.024142] pci 0001:02:00.0: BAR 15: assigned [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  450. [ 86.024147] amdgpu 0001:03:00.0: BAR 0: assigned [mem 0x6004000000000-0x60043ffffffff 64bit pref]
  451. [ 86.024160] amdgpu 0001:03:00.0: BAR 2: assigned [mem 0x6004400000000-0x60044001fffff 64bit pref]
  452. [ 86.024174] pci 0001:00:00.0: PCI bridge to [bus 01-03]
  453. [ 86.024180] pci 0001:00:00.0: bridge window [mem 0x600c080000000-0x600c0ffefffff]
  454. [ 86.024185] pci 0001:00:00.0: bridge window [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  455. [ 86.024192] pci 0001:01:00.0: PCI bridge to [bus 02-03]
  456. [ 86.024200] pci 0001:01:00.0: bridge window [mem 0x600c080000000-0x600c0ffefffff]
  457. [ 86.024205] pci 0001:01:00.0: bridge window [mem 0x6004000000000-0x6007f7ff0ffff 64bit pref]
  458. [ 86.024213] pci 0001:02:00.0: PCI bridge to [bus 03]
  459. [ 86.024219] pci 0001:02:00.0: bridge window [mem 0x600c080000000-0x600c0807fffff]
  460. [ 86.024225] pci 0001:02:00.0: bridge window [mem 0x6004000000000-0x60045ffffffff 64bit pref]
  461. [ 86.024240] amdgpu 0001:03:00.0: amdgpu: VRAM: 16368M 0x0000008000000000 - 0x00000083FEFFFFFF (16368M used)
  462. [ 86.024244] amdgpu 0001:03:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF
  463. [ 86.024248] [drm] Detected VRAM RAM=16368M, BAR=16384M
  464. [ 86.024251] [drm] RAM width 256bits GDDR6
  465. [ 86.024256] list_add corruption. prev->next should be next (c00800000067e970), but was 0000000000000000. (prev=c0000000685455b8).
  466. [ 86.024282] ------------[ cut here ]------------
  467. [ 86.024284] kernel BUG at lib/list_debug.c:26!
  468. [ 86.024291] Oops: Exception in kernel mode, sig: 5 [#1]
  469. [ 86.024296] LE PAGE_SIZE=4K MMU=Radix SMP NR_CPUS=2048 NUMA PowerNV
  470. [ 86.024300] Modules linked in: amdgpu mfd_core gpu_sched xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_nat_tftp nf_conntrack_tftp tun bridge stp llc nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_mangle iptable_raw iptable_security ip_set nf_tables nfnetlink rfkill ip6table_filter ip6_tables iptable_filter sunrpc snd_hda_codec_hdmi snd_hda_intel snd_usb_audio snd_intel_dspcfg snd_hda_codec at24 regmap_i2c snd_hda_core snd_usbmidi_lib snd_rawmidi snd_hwdep snd_seq joydev snd_seq_device crct10dif_vpmsum snd_pcm mc ofpart ipmi_powernv ipmi_devintf powernv_flash ipmi_msghandler mtd snd_timer rtc_opal opal_prd snd i2c_opal soundcore zram ip_tables ast drm_vram_helper drm_ttm_helper ttm i2c_algo_bit drm_kms_helper syscopyarea
  471. [ 86.024426] sysfillrect sysimgblt fb_sys_fops cec drm vmx_crypto crc32c_vpmsum tg3 i2c_core drm_panel_orientation_quirks nvme nvme_core fuse
  472. [ 86.024454] CPU: 0 PID: 189 Comm: kworker/0:2 Tainted: G W 5.10.21-200.4kpagesize.fc33.ppc64le #1
  473. [ 86.024461] Workqueue: events work_for_cpu_fn
  474. [ 86.024466] NIP: c000000000a4a424 LR: c000000000a4a420 CTR: 0000000000000000
  475. [ 86.024470] REGS: c00000000e0fb380 TRAP: 0700 Tainted: G W (5.10.21-200.4kpagesize.fc33.ppc64le)
  476. [ 86.024474] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 28002444 XER: 20040000
  477. [ 86.024492] CFAR: c000000000216098 IRQMASK: 0
  478. GPR00: c000000000a4a420 c00000000e0fb610 c000000002310900 0000000000000075
  479. GPR04: ffffffffffffffea c000000002099a88 0000000000000001 0000000000000027
  480. GPR08: c000000ffc6dcf90 ffffffffffffffd8 0000000000000023 3030303038303063
  481. GPR12: 0000000000002000 c0000000024f1000 c00000000d14f7b0 c0000000686e5b78
  482. GPR16: c0000000686e5b80 c0000000686e5b70 c0000000686f6d90 c0000000686e5b90
  483. GPR20: c0000000686e5b98 c0000000686e5b88 0000000000000001 c00800000067e970
  484. GPR24: c0080000034ae4c0 0000000000000000 c00000000cf66c58 c0000000686e55d0
  485. GPR28: c00800000067d998 c0000000685455b8 c00800000067e920 c0000000686e55b8
  486. [ 86.024564] NIP [c000000000a4a424] __list_add_valid+0xb4/0xc0
  487. [ 86.024569] LR [c000000000a4a420] __list_add_valid+0xb0/0xc0
  488. [ 86.024572] Call Trace:
  489. [ 86.024577] [c00000000e0fb610] [c000000000a4a420] __list_add_valid+0xb0/0xc0 (unreliable)
  490. [ 86.024592] [c00000000e0fb670] [c00800000066bf80] ttm_bo_device_init+0x158/0x2d0 [ttm]
  491. [ 86.024728] [c00000000e0fb720] [c008000002ef4214] amdgpu_ttm_init+0xcc/0x620 [amdgpu]
  492. [ 86.024874] [c00000000e0fb830] [c0080000033326d0] amdgpu_bo_init+0x80/0xa0 [amdgpu]
  493. [ 86.025020] [c00000000e0fb8a0] [c008000002f9e750] gmc_v10_0_sw_init+0x338/0x480 [amdgpu]
  494. [ 86.025158] [c00000000e0fb940] [c008000002edb3f8] amdgpu_device_init+0x1670/0x1fc0 [amdgpu]
  495. [ 86.025294] [c00000000e0fba90] [c008000002edf108] amdgpu_driver_load_kms+0x30/0x520 [amdgpu]
  496. [ 86.025431] [c00000000e0fbb10] [c008000002ed2a84] amdgpu_pci_probe+0x18c/0x340 [amdgpu]
  497. [ 86.025439] [c00000000e0fbbb0] [c000000000b2d978] local_pci_probe+0x68/0x110
  498. [ 86.025446] [c00000000e0fbc30] [c000000000192ac8] work_for_cpu_fn+0x38/0x60
  499. [ 86.025453] [c00000000e0fbc60] [c000000000197c40] process_one_work+0x300/0x5d0
  500. [ 86.025459] [c00000000e0fbd00] [c000000000198270] worker_thread+0x360/0x780
  501. [ 86.025465] [c00000000e0fbdb0] [c0000000001a3dc4] kthread+0x1e4/0x1f0
  502. [ 86.025472] [c00000000e0fbe20] [c00000000000d4f0] ret_from_kernel_thread+0x5c/0x6c
  503. [ 86.025476] Instruction dump:
  504. [ 86.025480] f8010070 4b7cbc59 60000000 0fe00000 7c0802a6 3c62ff34 7d465378 7d244b78
  505. [ 86.025494] 38638bd0 f8010070 4b7cbc35 60000000 <0fe00000> 60000000 60420000 3c4c018c
  506. [ 86.025512] ---[ end trace c7c7bf27e0e12021 ]---
  507.  
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement