Hello, On Mon, Mar 24, 2025 at 07:12:28PM -0300, Matías Teplitzky wrote: > *** Reporter, please consider answering these questions, where appropriate *** > > I'm reporting this bug from the machine having this problem. As the subject states, all of a sudden, I'm getting a blank screen at the end of the boot process, > with no video output at all!. I've no GUI and video output remains unresponsive when trying to change and use any TTY. However, I was able to connect through SSH, > enabling it by blindly typing login creds and propper commands. Thus, I was able to gather some information. > > >From kernel messages extracted with "sudo journalctl -b -g amdgpu": > > ----- > > Mar 24 12:19:14 debian-desktop kernel: [drm] amdgpu kernel modesetting enabled. > Mar 24 12:19:14 debian-desktop kernel: amdgpu: Virtual CRAT table created for CPU > Mar 24 12:19:14 debian-desktop kernel: amdgpu: Topology: Add CPU node > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: enabling device (0006 -> 0007) > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: Fetched VBIOS from VFCT > Mar 24 12:19:14 debian-desktop kernel: amdgpu: ATOM BIOS: 113-RAVEN2-117 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: vgaarb: deactivate vga console > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: Trusted Memory Zone (TMZ) feature enabled > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: VRAM: 2048M 0x000000F400000000 - 0x000000F47FFFFFFF (2048M used) > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: GART: 1024M 0x0000000000000000 - 0x000000003FFFFFFF > Mar 24 12:19:14 debian-desktop kernel: [drm] amdgpu: 2048M of VRAM memory ready > Mar 24 12:19:14 debian-desktop kernel: [drm] amdgpu: 6960M of GTT memory ready. > Mar 24 12:19:14 debian-desktop kernel: amdgpu: hwmgr_sw_init smu backed is smu10_smu > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: reserve 0x400000 from 0xf47fc00000 for PSP TMR > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: failed to load ucode RLC_RESTORE_LIST_CNTL(0x29) > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF300F) > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: failed to load ucode RLC_RESTORE_LIST_GPM_MEM(0x2A) > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF000F) > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: failed to load ucode RLC_RESTORE_LIST_SRM_MEM(0x2B) > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF000F) > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: RAS: optional ras ta ucode is not available > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: RAP: optional rap ta ucode is not available > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available > Mar 24 12:19:14 debian-desktop kernel: snd_hda_intel 0000:04:00.1: bound 0000:04:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu]) > Mar 24 12:19:14 debian-desktop kernel: kfd kfd: amdgpu: Allocated 3969056 bytes on gart > Mar 24 12:19:14 debian-desktop kernel: kfd kfd: amdgpu: Total number of KFD nodes to be created: 1 > Mar 24 12:19:14 debian-desktop kernel: amdgpu: Virtual CRAT table created for GPU > Mar 24 12:19:14 debian-desktop kernel: amdgpu: Topology: Add dGPU node [0x15d8:0x1002] > Mar 24 12:19:14 debian-desktop kernel: kfd kfd: amdgpu: added device 1002:15d8 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: SE 1, SH per SE 1, CU per SH 3, active_cu_number 3 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 11 on hub 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 8 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 8 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 8 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 8 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 8 > Mar 24 12:19:14 debian-desktop kernel: amdgpu: pp_dpm_get_sclk_od was not implemented. > Mar 24 12:19:14 debian-desktop kernel: amdgpu: pp_dpm_get_mclk_od was not implemented. > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: amdgpu: Runtime PM not available > Mar 24 12:19:14 debian-desktop kernel: [drm] Initialized amdgpu 3.61.0 for 0000:04:00.0 on minor 0 > Mar 24 12:19:14 debian-desktop kernel: amdgpu 0000:04:00.0: [drm] Cannot find any crtc or sizes > > ----- > > In particular, this line caught my attention: "amdgpu 0000:04:00.0: [drm] Cannot find any crtc or sizes". Soon, > I found that booting with "nomodeset" as a kernel argument, I was able to get video output but unable to load a GUI > (I'm using wayland with sway). I understand that this is due to disabling KMS kernel driver. > > After doing some digging on the Internet, I found that booting with "amd_iommu=off" should work and it did!. Note that > information gathered below, is the result from booting with that kernel paramater as a workaorund. > > > * What led up to the situation? > > I don't know, really!. If I recall correctly, I've been using the "linux-image-6.12.17-amd64" kernel for maybe a week > and this problem didn't occur. Once faced with this problem, I tried doing an "apt upgrade" and tried with the updated > kernel version "linux-image-6.12.19-amd64", but it didn't work. This problem persisted with several reboots and a power off and > on again cycle. I also tried booting with older kernel versions, but didn't help either. I wonder if you did a *real* cold boot between your tests. Sometimes not all components are properly reset on a warm restart. So ideally unplug power for a moment between tests. I guess the grub boot menu is shown before Linux is started? Can you provide a full boot log for a boot without amd_iommu=off? Best regards Uwe
Attachment:
signature.asc
Description: PGP signature