[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1101207: amdgpu: blank screen at the end of the boot process, with no video output at all!.



Hello,

On Mon, Mar 24, 2025 at 07:12:28PM -0300, Matías Teplitzky wrote:
> *** Reporter, please consider answering these questions, where appropriate ***
> 
> I'm reporting this bug from the machine having this problem. As the subject states, all of a sudden, I'm getting a blank screen at the end of the boot process,
> with no video output at all!. I've no GUI and video output remains unresponsive when trying to change and use any TTY. However, I was able to connect through SSH,
> enabling it by blindly typing login creds and propper commands. Thus, I was able to gather some information.
> 
> >From kernel messages extracted with "sudo journalctl -b -g amdgpu":
> 
> -----
> 
> Mar 24 12:19:14 debian-desktop kernel: [drm] amdgpu kernel modesetting enabled.
> Mar 24 12:19:14 debian-desktop kernel: amdgpu: Virtual CRAT table created for CPU
> Mar 24 12:19:14 debian-desktop kernel: amdgpu: Topology: Add CPU node
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: enabling device (0006 -> 0007)
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: Fetched VBIOS from VFCT
> Mar 24 12:19:14 debian-desktop kernel: amdgpu: ATOM BIOS: 113-RAVEN2-117
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: vgaarb: deactivate vga console
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: Trusted Memory Zone (TMZ) feature enabled
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: VRAM: 2048M 0x000000F400000000 - 0x000000F47FFFFFFF (2048M used)
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: GART: 1024M 0x0000000000000000 - 0x000000003FFFFFFF
> Mar 24 12:19:14 debian-desktop kernel: [drm] amdgpu: 2048M of VRAM memory ready
> Mar 24 12:19:14 debian-desktop kernel: [drm] amdgpu: 6960M of GTT memory ready.
> Mar 24 12:19:14 debian-desktop kernel: amdgpu: hwmgr_sw_init smu backed is smu10_smu
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: reserve 0x400000 from 0xf47fc00000 for PSP TMR
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: failed to load ucode RLC_RESTORE_LIST_CNTL(0x29)
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF300F)
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: failed to load ucode RLC_RESTORE_LIST_GPM_MEM(0x2A)
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF000F)
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: failed to load ucode RLC_RESTORE_LIST_SRM_MEM(0x2B)
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF000F)
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: RAS: optional ras ta ucode is not available
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: RAP: optional rap ta ucode is not available
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
> Mar 24 12:19:14 debian-desktop kernel: snd_hda_intel 0000:04:00.1: bound 0000:04:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu])
> Mar 24 12:19:14 debian-desktop kernel: kfd kfd: amdgpu: Allocated 3969056 bytes on gart
> Mar 24 12:19:14 debian-desktop kernel: kfd kfd: amdgpu: Total number of KFD nodes to be created: 1
> Mar 24 12:19:14 debian-desktop kernel: amdgpu: Virtual CRAT table created for GPU
> Mar 24 12:19:14 debian-desktop kernel: amdgpu: Topology: Add dGPU node [0x15d8:0x1002]
> Mar 24 12:19:14 debian-desktop kernel: kfd kfd: amdgpu: added device 1002:15d8
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: SE 1, SH per SE 1, CU per SH 3, active_cu_number 3
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 11 on hub 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 8
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 8
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 8
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 8
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 8
> Mar 24 12:19:14 debian-desktop kernel: amdgpu: pp_dpm_get_sclk_od was not implemented.
> Mar 24 12:19:14 debian-desktop kernel: amdgpu: pp_dpm_get_mclk_od was not implemented.
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: Runtime PM not available
> Mar 24 12:19:14 debian-desktop kernel: [drm] Initialized amdgpu 3.61.0 for 0000:04:00.0 on minor 0
> Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: [drm] Cannot find any crtc or sizes
> 
> -----
> 
> In particular, this line caught my attention: "amdgpu 0000:04:00.0: [drm] Cannot find any crtc or sizes". Soon,
> I found that booting with "nomodeset" as a kernel argument, I was able to get video output but unable to load a GUI
> (I'm using wayland with sway). I understand that this is due to disabling KMS kernel driver.
> 
> After doing some digging on the Internet, I found that booting with "amd_iommu=off" should work and it did!. Note that
> information gathered below, is the result from booting with that kernel paramater as a workaorund.  
> 
> 
>    * What led up to the situation?
> 
> I don't know, really!. If I recall correctly, I've been using the "linux-image-6.12.17-amd64" kernel for maybe a week
> and this problem didn't occur. Once faced with this problem, I tried doing an "apt upgrade" and tried with the updated
> kernel version "linux-image-6.12.19-amd64", but it didn't work. This problem persisted with several reboots and a power off and
> on again cycle. I also tried booting with older kernel versions, but didn't help either.   

I wonder if you did a *real* cold boot between your tests. Sometimes not
all components are properly reset on a warm restart. So ideally unplug
power for a moment between tests. I guess the grub boot menu is shown
before Linux is started?

Can you provide a full boot log for a boot without amd_iommu=off?

Best regards
Uwe

Attachment: signature.asc
Description: PGP signature


Reply to: