[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1104732: linux-image-6.1.0-34-amd64: amdgpu error, kernel cpu loop, kernel log spam



Thank you for your time looking into this Salvatore.

> I assume you cannot more specifically say when you saw the problem
> first appearing in the 6.1.y series?

Sorry, no from memory it was around then but I cannot find any logs to firmly conclude a particular version.

> Would you be able to test newer stable series as well (ideally 6.12.y
> as will be shipped in trixie or mainline kernel?)

I may need some instruction on building or applying any kernel patches and how to switch to an alternative kernel on Debian. I am familiar with git and building software, although not the kernel / C applications.
 
> I have searched the upstream issues in
> https://gitlab.freedesktop.org/drm/amd/-/issues and did not found
> something directly matching your report. Could you please report it
> upstream and report back the upstream issue back here so we can link
> those?
>
> Sice you are owning the hardware that would help speed up the
> debugging by having you directly interacting with upstream on the
> matter. Can you do that?

I have filed the bug upstream at https://gitlab.freedesktop.org/drm/amd/-/issues/4212 and will follow up with them, unless I need some Debian specific help.

Kind regards,
MG.

On Monday, 5 May 2025 at 15:23, Salvatore Bonaccorso <carnil@debian.org> wrote:

> Control: tags -1 + moreinfo
> 
> Hi
> 
> thanks for your report.
> 
> On Mon, May 05, 2025 at 12:35:36PM +0100, mg wrote:
> 
> > Package: src:linux
> > Version: 6.1.135-1
> > Severity: important
> > X-Debbugs-Cc: mg-public-addr@protonmail.com
> > 
> > Dear Maintainer,
> > 
> > The system will log an error, followed by 100% cpu usage on one core, I believe by the kernel, which results in the message
> > 
> > `[drm:dc_add_plane_to_context [amdgpu]] *ERROR* Head pipe not found for stream_state 00000000b7629c18 !`
> > 
> > logged endlessly and as fast as the CPU can process to the kernel log.
> > 
> > The trigger for this issue is unclear to me, as it will not happen on every boot of the system, and can take hours, days or
> > weeks to appear after a reboot.
> > 
> > Other system functions appear to work as normal, or not degreded in a way I have noticed, as long as the logs are rotated.
> > 
> > Rebooting the system is my current approach when this error happens, and buys time until it occurs again. This system has run
> > without issue for >60 days on an affected kernel version, so I suspect there is no guarantee this bug will always appear.
> > 
> > The system is run as a mostly headless server, does not hibernate, sleep or suspend. It is connected to a TV via a HDMI cable,
> > that turns on and off throughout the day, and is one of the few inputs relevant to the amd gpu driver that I suspect could be a
> > trigger. This is connected to the motherboard HDMI connection, using the iGPU of a Ryzen 2200G
> > 
> > This system is running openmediavault (intalled from that install media), but I am logging here as I suspect it does not make
> > modifications to the kernel and core debian system.
> > 
> > The last time this was known to be stable for me was on the 5.10 kernels under bullseye, and on both bullseye and bookworm under
> > the 6.x kernel this issue has appeared.
> > 
> > I have captured two instances of this from separate dates included below. The last line is the one to repeat infinitely from
> > this point onwards. It is difficult to capture as the logs will quickly either fill up the hard drive, or get log rotated
> > out, which means that it has been hard to observe anything other than the final message in the logs! It has been recurring
> > around 6-10 times total in a 12 month period.
> > 
> > Please note that the last kernel log included by `reportbug` is on a fresh reboot of the system where this issue has not
> > occured yet and may be of no use - else it would only capture the spammed message log!
> > 
> > Logs from first time I caught the issue:
> > 
> > 2024-07-19T20:17:35.445701+01:00 rhino kernel: [72604.746570] ------------[ cut here ]------------
> > 2024-07-19T20:17:35.445717+01:00 rhino kernel: [72604.746574] WARNING: CPU: 1 PID: 56 at drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc.c:3074 dc_update_planes_and_stream+0x342/0x870 [amdgpu]
> > 2024-07-19T20:17:35.445720+01:00 rhino kernel: [72604.747029] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter bridge stp llc nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables nfnetlink overlay cpufreq_powersave cpufreq_ondemand cpufreq_conservative cpufreq_userspace sunrpc quota_v2 quota_tree binfmt_misc intel_rapl_msr nls_ascii nls_cp437 intel_rapl_common vfat snd_hda_codec_realtek fat stv6111 btusb snd_hda_codec_generic btrtl lnbh25 edac_mce_amd ledtrig_audio btbcm btintel snd_hda_codec_hdmi kvm_amd amdgpu snd_hda_intel btmtk stv0910 iwlmvm kvm snd_intel_dspcfg snd_intel_sdw_acpi bluetooth irqbypass snd_hda_codec mac80211 gpu_sched drm_buddy jitterentropy_rng ghash_clmulni_intel snd_hda_core libarc4 drm_display_helper sha256_ssse3 sha512_ssse3 snd_hwdep sha1_ssse3 sha512_generic cec snd_pcm drbg iwlwifi rc_core ansi_cprng drm_ttm_helper snd_timer aesni_intel ttm snd ddbridge crypto_simd cryptd soundcore
> > 2024-07-19T20:17:35.445722+01:00 rhino kernel: [72604.747114] drm_kms_helper ecdh_generic dvb_core ecc mc rapl cfg80211 ccp wmi_bmof pcspkr sp5100_tco k10temp sg rfkill evdev acpi_cpufreq button wireguard libchacha20poly1305 chacha_x86_64 poly1305_x86_64 curve25519_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel softdog watchdog nct6775 drm nct6775_core hwmon_vid dm_mod fuse loop efi_pstore configfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 btrfs blake2b_generic zstd_compress efivarfs raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic md_mod sd_mod t10_pi crc64_rocksoft crc64 crc_t10dif crct10dif_generic crct10dif_pclmul crct10dif_common xhci_pci ahci libahci crc32_pclmul crc32c_intel xhci_hcd libata igb usbcore scsi_mod i2c_algo_bit dca i2c_piix4 scsi_common usb_common video wmi gpio_amdpt gpio_generic
> > 2024-07-19T20:17:35.445724+01:00 rhino kernel: [72604.747206] CPU: 1 PID: 56 Comm: kworker/1:1H Not tainted 6.1.0-23-amd64 #1 Debian 6.1.99-1
> > 2024-07-19T20:17:35.445725+01:00 rhino kernel: [72604.747212] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./B450 Gaming-ITX/ac, BIOS P3.40 07/17/2019
> > 2024-07-19T20:17:35.445737+01:00 rhino kernel: [72604.747215] Workqueue: events_highpri dm_irq_work_func [amdgpu]
> > 2024-07-19T20:17:35.445738+01:00 rhino kernel: [72604.747659] RIP: 0010:dc_update_planes_and_stream+0x342/0x870 [amdgpu]
> > 2024-07-19T20:17:35.445740+01:00 rhino kernel: [72604.748095] Code: 48 2b 14 25 28 00 00 00 0f 85 38 05 00 00 48 83 c4 50 5b 5d 41 5c 41 5d 41 5e 41 5f e9 57 4e 9a de 45 85 ed 0f 84 51 fe ff ff <0f> 0b 31 c0 eb ca 8b 93 50 06 00 00 83 fa 01 0f 84 68 fe ff ff 48
> > 2024-07-19T20:17:35.445741+01:00 rhino kernel: [72604.748099] RSP: 0018:ffffbeb1c040f870 EFLAGS: 00010202
> > 2024-07-19T20:17:35.445742+01:00 rhino kernel: [72604.748103] RAX: 0000000000000000 RBX: ffff962e90544000 RCX: 0000000000000000
> > 2024-07-19T20:17:35.445744+01:00 rhino kernel: [72604.748106] RDX: 0000000000000000 RSI: ffff962eb3a00000 RDI: ffff962e90544000
> > 2024-07-19T20:17:35.445745+01:00 rhino kernel: [72604.748108] RBP: ffffbeb1c040fc68 R08: 0000000000000000 R09: 0000000000000004
> > 2024-07-19T20:17:35.445746+01:00 rhino kernel: [72604.748110] R10: 0000000000000002 R11: 0000000000000001 R12: ffff962e87c60000
> > 2024-07-19T20:17:35.445747+01:00 rhino kernel: [72604.748112] R13: 0000000000000001 R14: ffff962e90544000 R15: ffff962eefbe0200
> > 2024-07-19T20:17:35.445748+01:00 rhino kernel: [72604.748115] FS: 0000000000000000(0000) GS:ffff962f96a40000(0000) knlGS:0000000000000000
> > 2024-07-19T20:17:35.445749+01:00 rhino kernel: [72604.748118] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > 2024-07-19T20:17:35.445750+01:00 rhino kernel: [72604.748121] CR2: 00007ffcd7bfc0f8 CR3: 00000001baea8000 CR4: 00000000003506e0
> > 2024-07-19T20:17:35.445750+01:00 rhino kernel: [72604.748124] Call Trace:
> > 2024-07-19T20:17:35.445751+01:00 rhino kernel: [72604.748128] <TASK>
> > 2024-07-19T20:17:35.445753+01:00 rhino kernel: [72604.748132] ? __warn+0x7d/0xc0
> > 2024-07-19T20:17:35.445754+01:00 rhino kernel: [72604.748139] ? dc_update_planes_and_stream+0x342/0x870 [amdgpu]
> > 2024-07-19T20:17:35.445755+01:00 rhino kernel: [72604.748573] ? report_bug+0xe2/0x150
> > 2024-07-19T20:17:35.445755+01:00 rhino kernel: [72604.748578] ? handle_bug+0x41/0x70
> > 2024-07-19T20:17:35.445756+01:00 rhino kernel: [72604.748583] ? exc_invalid_op+0x13/0x60
> > 2024-07-19T20:17:35.445757+01:00 rhino kernel: [72604.748588] ? asm_exc_invalid_op+0x16/0x20
> > 2024-07-19T20:17:35.445758+01:00 rhino kernel: [72604.748594] ? dc_update_planes_and_stream+0x342/0x870 [amdgpu]
> > 2024-07-19T20:17:35.445759+01:00 rhino kernel: [72604.749024] ? __mutex_remove_waiter+0x15/0x60
> > 2024-07-19T20:17:35.445760+01:00 rhino kernel: [72604.749031] amdgpu_dm_atomic_commit_tail+0x19e4/0x36f0 [amdgpu]
> > 2024-07-19T20:17:35.445761+01:00 rhino kernel: [72604.749471] ? mode_support_and_system_configuration+0x40f1/0x4c40 [amdgpu]
> > 2024-07-19T20:17:35.445769+01:00 rhino kernel: [72604.749946] commit_tail+0x94/0x130 [drm_kms_helper]
> > 2024-07-19T20:17:35.445770+01:00 rhino kernel: [72604.749976] drm_atomic_helper_commit+0x112/0x140 [drm_kms_helper]
> > 2024-07-19T20:17:35.449642+01:00 rhino kernel: [72604.750003] drm_atomic_commit+0x96/0xc0 [drm]
> > 2024-07-19T20:17:35.449646+01:00 rhino kernel: [72604.750063] ? drm_plane_get_damage_clips.cold+0x1c/0x1c [drm]
> > 2024-07-19T20:17:35.449647+01:00 rhino kernel: [72604.750117] drm_client_modeset_commit_atomic+0x206/0x250 [drm]
> > 2024-07-19T20:17:35.449648+01:00 rhino kernel: [72604.750173] drm_client_modeset_commit_locked+0x56/0x160 [drm]
> > 2024-07-19T20:17:35.449649+01:00 rhino kernel: [72604.750226] ? drm_connector_list_iter_end+0x38/0x50 [drm]
> > 2024-07-19T20:17:35.449650+01:00 rhino kernel: [72604.750283] drm_client_modeset_commit+0x21/0x40 [drm]
> > 2024-07-19T20:17:35.449651+01:00 rhino kernel: [72604.750336] drm_fb_helper_set_par+0x9e/0xe0 [drm_kms_helper]
> > 2024-07-19T20:17:35.449662+01:00 rhino kernel: [72604.750363] drm_fb_helper_hotplug_event+0xc1/0xe0 [drm_kms_helper]
> > 2024-07-19T20:17:35.449664+01:00 rhino kernel: [72604.750389] drm_client_dev_hotplug+0x64/0xb0 [drm]
> > 2024-07-19T20:17:35.449665+01:00 rhino kernel: [72604.750442] handle_hpd_irq_helper+0x159/0x170 [amdgpu]
> > 2024-07-19T20:17:35.449666+01:00 rhino kernel: [72604.750881] process_one_work+0x1c7/0x380
> > 2024-07-19T20:17:35.449667+01:00 rhino kernel: [72604.750889] worker_thread+0x4d/0x380
> > 2024-07-19T20:17:35.449668+01:00 rhino kernel: [72604.750896] ? rescuer_thread+0x3a0/0x3a0
> > 2024-07-19T20:17:35.449669+01:00 rhino kernel: [72604.750901] kthread+0xda/0x100
> > 2024-07-19T20:17:35.449670+01:00 rhino kernel: [72604.750906] ? kthread_complete_and_exit+0x20/0x20
> > 2024-07-19T20:17:35.449671+01:00 rhino kernel: [72604.750911] ret_from_fork+0x22/0x30
> > 2024-07-19T20:17:35.449672+01:00 rhino kernel: [72604.750920] </TASK>
> > 2024-07-19T20:17:35.449673+01:00 rhino kernel: [72604.750921] ---[ end trace 0000000000000000 ]---
> > 2024-07-19T20:17:35.562060+01:00 rhino kernel: [72604.861995] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 00000000b7629c18 !
> > 2024-07-19T20:17:35.562077+01:00 rhino kernel: [72604.862471] ------------[ cut here ]------------
> > 2024-07-19T20:17:35.562078+01:00 rhino kernel: [72604.862472] amdgpu 0000:0b:00.0: Dirty helper failed: ret=-22
> > 2024-07-19T20:17:35.562079+01:00 rhino kernel: [72604.862510] WARNING: CPU: 1 PID: 421866 at drivers/gpu/drm/drm_fb_helper.c:477 drm_fb_helper_damage_work+0x208/0x3a0 [drm_kms_helper]
> > 2024-07-19T20:17:35.562081+01:00 rhino kernel: [72604.862540] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter bridge stp llc nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables nfnetlink overlay cpufreq_powersave cpufreq_ondemand cpufreq_conservative cpufreq_userspace sunrpc quota_v2 quota_tree binfmt_misc intel_rapl_msr nls_ascii nls_cp437 intel_rapl_common vfat snd_hda_codec_realtek fat stv6111 btusb snd_hda_codec_generic btrtl lnbh25 edac_mce_amd ledtrig_audio btbcm btintel snd_hda_codec_hdmi kvm_amd amdgpu snd_hda_intel btmtk stv0910 iwlmvm kvm snd_intel_dspcfg snd_intel_sdw_acpi bluetooth irqbypass snd_hda_codec mac80211 gpu_sched drm_buddy jitterentropy_rng ghash_clmulni_intel snd_hda_core libarc4 drm_display_helper sha256_ssse3 sha512_ssse3 snd_hwdep sha1_ssse3 sha512_generic cec snd_pcm drbg iwlwifi rc_core ansi_cprng drm_ttm_helper snd_timer aesni_intel ttm snd ddbridge crypto_simd cryptd soundcore
> > 2024-07-19T20:17:35.562082+01:00 rhino kernel: [72604.862620] drm_kms_helper ecdh_generic dvb_core ecc mc rapl cfg80211 ccp wmi_bmof pcspkr sp5100_tco k10temp sg rfkill evdev acpi_cpufreq button wireguard libchacha20poly1305 chacha_x86_64 poly1305_x86_64 curve25519_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel softdog watchdog nct6775 drm nct6775_core hwmon_vid dm_mod fuse loop efi_pstore configfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 btrfs blake2b_generic zstd_compress efivarfs raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic md_mod sd_mod t10_pi crc64_rocksoft crc64 crc_t10dif crct10dif_generic crct10dif_pclmul crct10dif_common xhci_pci ahci libahci crc32_pclmul crc32c_intel xhci_hcd libata igb usbcore scsi_mod i2c_algo_bit dca i2c_piix4 scsi_common usb_common video wmi gpio_amdpt gpio_generic
> > 2024-07-19T20:17:35.562084+01:00 rhino kernel: [72604.862705] CPU: 1 PID: 421866 Comm: kworker/1:3 Tainted: G W 6.1.0-23-amd64 #1 Debian 6.1.99-1
> > 2024-07-19T20:17:35.562085+01:00 rhino kernel: [72604.862710] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./B450 Gaming-ITX/ac, BIOS P3.40 07/17/2019
> > 2024-07-19T20:17:35.562087+01:00 rhino kernel: [72604.862712] Workqueue: events drm_fb_helper_damage_work [drm_kms_helper]
> > 2024-07-19T20:17:35.562089+01:00 rhino kernel: [72604.862739] RIP: 0010:drm_fb_helper_damage_work+0x208/0x3a0 [drm_kms_helper]
> > 2024-07-19T20:17:35.562101+01:00 rhino kernel: [72604.862765] Code: 01 48 8b 78 08 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 6c cf 1c df 89 e9 4c 89 e2 48 c7 c7 b8 76 d2 c0 48 89 c6 e8 c8 84 b8 de <0f> 0b e9 ec fe ff ff 0f b6 44 24 38 4d 8b 67 90 31 f6 0f b7 54 24
> > 2024-07-19T20:17:35.562103+01:00 rhino kernel: [72604.862768] RSP: 0018:ffffbeb1d3a1fe18 EFLAGS: 00010286
> > 2024-07-19T20:17:35.562104+01:00 rhino kernel: [72604.862771] RAX: 0000000000000000 RBX: ffff962e836e30cc RCX: 0000000000000027
> > 2024-07-19T20:17:35.562105+01:00 rhino kernel: [72604.862774] RDX: ffff962f96a603a8 RSI: 0000000000000001 RDI: ffff962f96a603a0
> > 2024-07-19T20:17:35.562106+01:00 rhino kernel: [72604.862776] RBP: 00000000ffffffea R08: 0000000000000000 R09: ffffbeb1d3a1fc90
> > 2024-07-19T20:17:35.562107+01:00 rhino kernel: [72604.862778] R10: 0000000000000003 R11: ffff962f9f2c18a8 R12: ffff962e80ebfa10
> > 2024-07-19T20:17:35.562108+01:00 rhino kernel: [72604.862780] R13: ffffbeb1c2f6e1a0 R14: ffffbeb1cf2b21a0 R15: ffff962e836e30d0
> > 2024-07-19T20:17:35.562109+01:00 rhino kernel: [72604.862782] FS: 0000000000000000(0000) GS:ffff962f96a40000(0000) knlGS:0000000000000000
> > 2024-07-19T20:17:35.562110+01:00 rhino kernel: [72604.862785] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > 2024-07-19T20:17:35.562111+01:00 rhino kernel: [72604.862787] CR2: 00007ffcd7bfc0f8 CR3: 00000001baea8000 CR4: 00000000003506e0
> > 2024-07-19T20:17:35.562112+01:00 rhino kernel: [72604.862790] Call Trace:
> > 2024-07-19T20:17:35.562113+01:00 rhino kernel: [72604.862793] <TASK>
> > 2024-07-19T20:17:35.562114+01:00 rhino kernel: [72604.862796] ? __warn+0x7d/0xc0
> > 2024-07-19T20:17:35.562114+01:00 rhino kernel: [72604.862801] ? drm_fb_helper_damage_work+0x208/0x3a0 [drm_kms_helper]
> > 2024-07-19T20:17:35.562116+01:00 rhino kernel: [72604.862828] ? report_bug+0xe2/0x150
> > 2024-07-19T20:17:35.562117+01:00 rhino kernel: [72604.862834] ? handle_bug+0x41/0x70
> > 2024-07-19T20:17:35.562118+01:00 rhino kernel: [72604.862839] ? exc_invalid_op+0x13/0x60
> > 2024-07-19T20:17:35.562119+01:00 rhino kernel: [72604.862842] ? asm_exc_invalid_op+0x16/0x20
> > 2024-07-19T20:17:35.562120+01:00 rhino kernel: [72604.862848] ? drm_fb_helper_damage_work+0x208/0x3a0 [drm_kms_helper]
> > 2024-07-19T20:17:35.562121+01:00 rhino kernel: [72604.862874] ? drm_fb_helper_damage_work+0x208/0x3a0 [drm_kms_helper]
> > 2024-07-19T20:17:35.562122+01:00 rhino kernel: [72604.862900] process_one_work+0x1c7/0x380
> > 2024-07-19T20:17:35.562123+01:00 rhino kernel: [72604.862907] worker_thread+0x4d/0x380
> > 2024-07-19T20:17:35.562124+01:00 rhino kernel: [72604.862913] ? rescuer_thread+0x3a0/0x3a0
> > 2024-07-19T20:17:35.562125+01:00 rhino kernel: [72604.862918] kthread+0xda/0x100
> > 2024-07-19T20:17:35.562126+01:00 rhino kernel: [72604.862922] ? kthread_complete_and_exit+0x20/0x20
> > 2024-07-19T20:17:35.562127+01:00 rhino kernel: [72604.862927] ret_from_fork+0x22/0x30
> > 2024-07-19T20:17:35.562128+01:00 rhino kernel: [72604.862936] </TASK>
> > 2024-07-19T20:17:35.562129+01:00 rhino kernel: [72604.862937] ---[ end trace 0000000000000000 ]---
> > 2024-07-19T20:17:35.562130+01:00 rhino kernel: [72604.863021] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 00000000b7629c18 !
> > 2024-07-19T20:17:35.562131+01:00 rhino kernel: [72604.863567] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 00000000b7629c18 !
> > 
> > Logs from second time I caught the issue:
> > 
> > 2025-05-05T10:07:38.013337+01:00 rhino kernel: [257606.669421] ------------[ cut here ]------------
> > 2025-05-05T10:07:38.013356+01:00 rhino kernel: [257606.669426] WARNING: CPU: 1 PID: 140 at drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc.c:3075 dc_update_planes_and_stream+0x342/0x880 [amdgpu]
> > 2025-05-05T10:07:38.013360+01:00 rhino kernel: [257606.669871] Modules linked in: nf_conntrack_netlink xt_nat xt_tcpudp veth xt_conntrack bridge stp llc xt_set ip_set xt_addrtype xfrm_user xfrm_algo nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables nfnetlink overlay cpufreq_powersave cpufreq_ondemand cpufreq_conservative cpufreq_userspace quota_v2 quota_tree sunrpc binfmt_misc nls_ascii nls_cp437 vfat fat intel_rapl_msr intel_rapl_common edac_mce_amd amdgpu kvm_amd stv6111 btusb lnbh25 btrtl btbcm kvm iwlmvm btintel irqbypass stv0910 snd_hda_codec_realtek btmtk gpu_sched snd_hda_codec_generic mac80211 drm_buddy snd_hda_codec_hdmi ghash_clmulni_intel ledtrig_audio bluetooth drm_display_helper sha256_ssse3 snd_hda_intel libarc4 snd_intel_dspcfg sha1_ssse3 cec snd_intel_sdw_acpi rc_core jitterentropy_rng snd_hda_codec drm_ttm_helper snd_hda_core sha512_ssse3 iwlwifi snd_hwdep ttm sha512_generic aesni_intel snd_pcm ddbridge ctr crypto_simd snd_timer snd drm_kms_helper drbg cryptd soundcore
> > 2025-05-05T10:07:38.013477+01:00 rhino kernel: [257606.669958] dvb_core ansi_cprng cfg80211 ecdh_generic ecc mc wmi_bmof rapl k10temp ccp sp5100_tco pcspkr rfkill evdev sg acpi_cpufreq button wireguard libchacha20poly1305 chacha_x86_64 poly1305_x86_64 curve25519_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel softdog watchdog nct6775 nct6775_core hwmon_vid drm fuse dm_mod efi_pstore configfs loop zram zsmalloc ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 btrfs blake2b_generic zstd_compress efivarfs raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic md_mod sd_mod t10_pi crc64_rocksoft crc64 crc_t10dif crct10dif_generic crct10dif_pclmul crct10dif_common crc32_pclmul crc32c_intel ahci xhci_pci libahci i2c_piix4 xhci_hcd libata igb usbcore scsi_mod i2c_algo_bit dca usb_common scsi_common video wmi gpio_amdpt gpio_generic
> > 2025-05-05T10:07:38.013479+01:00 rhino kernel: [257606.670049] CPU: 1 PID: 140 Comm: kworker/1:1H Not tainted 6.1.0-34-amd64 #1 Debian 6.1.135-1
> > 2025-05-05T10:07:38.013481+01:00 rhino kernel: [257606.670055] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./B450 Gaming-ITX/ac, BIOS P3.40 07/17/2019
> > 2025-05-05T10:07:38.013482+01:00 rhino kernel: [257606.670057] Workqueue: events_highpri dm_irq_work_func [amdgpu]
> > 2025-05-05T10:07:38.013483+01:00 rhino kernel: [257606.670492] RIP: 0010:dc_update_planes_and_stream+0x342/0x880 [amdgpu]
> > 2025-05-05T10:07:38.013484+01:00 rhino kernel: [257606.670920] Code: 48 2b 14 25 28 00 00 00 0f 85 4a 05 00 00 48 83 c4 58 5b 5d 41 5c 41 5d 41 5e 41 5f e9 87 23 ef ca 45 85 ed 0f 84 51 fe ff ff <0f> 0b 31 c0 eb ca 8b 93 50 06 00 00 83 fa 01 0f 84 68 fe ff ff 48
> > 2025-05-05T10:07:38.013485+01:00 rhino kernel: [257606.670923] RSP: 0018:ffffb931c05a7800 EFLAGS: 00010202
> > 2025-05-05T10:07:38.013498+01:00 rhino kernel: [257606.670927] RAX: 0000000000000000 RBX: ffff8a8dc8627800 RCX: 0000000000000000
> > 2025-05-05T10:07:38.013499+01:00 rhino kernel: [257606.670931] RDX: 0000000000000000 RSI: ffff8a8dce640000 RDI: ffff8a8dc8627800
> > 2025-05-05T10:07:38.013500+01:00 rhino kernel: [257606.670933] RBP: ffffb931c05a7c68 R08: 0000000000000000 R09: 0000000000000004
> > 2025-05-05T10:07:38.013511+01:00 rhino kernel: [257606.670935] R10: 0000000000000002 R11: 0000000000000001 R12: ffff8a8dc72e0000
> > 2025-05-05T10:07:38.013512+01:00 rhino kernel: [257606.670937] R13: 0000000000000001 R14: ffff8a8dc8627800 R15: ffff8a8d10458000
> > 2025-05-05T10:07:38.013513+01:00 rhino kernel: [257606.670940] FS: 0000000000000000(0000) GS:ffff8a8ed6a40000(0000) knlGS:0000000000000000
> > 2025-05-05T10:07:38.013514+01:00 rhino kernel: [257606.670943] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > 2025-05-05T10:07:38.013515+01:00 rhino kernel: [257606.670945] CR2: 00007f0207068ff8 CR3: 00000001dacbe000 CR4: 00000000003506e0
> > 2025-05-05T10:07:38.013516+01:00 rhino kernel: [257606.670949] Call Trace:
> > 2025-05-05T10:07:38.013518+01:00 rhino kernel: [257606.670953] <TASK>
> > 2025-05-05T10:07:38.013519+01:00 rhino kernel: [257606.670955] ? __mutex_remove_waiter+0x15/0x60
> > 2025-05-05T10:07:38.013520+01:00 rhino kernel: [257606.670967] amdgpu_dm_atomic_commit_tail+0x1975/0x3700 [amdgpu]
> > 2025-05-05T10:07:38.013529+01:00 rhino kernel: [257606.671418] commit_tail+0x94/0x130 [drm_kms_helper]
> > 2025-05-05T10:07:38.013530+01:00 rhino kernel: [257606.671447] drm_atomic_helper_commit+0x112/0x140 [drm_kms_helper]
> > 2025-05-05T10:07:38.013531+01:00 rhino kernel: [257606.671474] drm_atomic_commit+0x96/0xc0 [drm]
> > 2025-05-05T10:07:38.013543+01:00 rhino kernel: [257606.671533] ? drm_plane_get_damage_clips.cold+0x1c/0x1c [drm]
> > 2025-05-05T10:07:38.013544+01:00 rhino kernel: [257606.671586] drm_client_modeset_commit_atomic+0x206/0x250 [drm]
> > 2025-05-05T10:07:38.013545+01:00 rhino kernel: [257606.671642] drm_client_modeset_commit_locked+0x56/0x160 [drm]
> > 2025-05-05T10:07:38.013546+01:00 rhino kernel: [257606.671694] ? drm_connector_list_iter_end+0x38/0x50 [drm]
> > 2025-05-05T10:07:38.013547+01:00 rhino kernel: [257606.671751] drm_client_modeset_commit+0x21/0x40 [drm]
> > 2025-05-05T10:07:38.013548+01:00 rhino kernel: [257606.671803] drm_fb_helper_set_par+0x9e/0xe0 [drm_kms_helper]
> > 2025-05-05T10:07:38.013549+01:00 rhino kernel: [257606.671829] drm_fb_helper_hotplug_event+0xc1/0xe0 [drm_kms_helper]
> > 2025-05-05T10:07:38.013550+01:00 rhino kernel: [257606.671855] drm_client_dev_hotplug+0x64/0xb0 [drm]
> > 2025-05-05T10:07:38.013551+01:00 rhino kernel: [257606.671907] handle_hpd_irq_helper+0x159/0x170 [amdgpu]
> > 2025-05-05T10:07:38.013551+01:00 rhino kernel: [257606.672352] process_one_work+0x1c7/0x380
> > 2025-05-05T10:07:38.013564+01:00 rhino kernel: [257606.672358] worker_thread+0x4d/0x380
> > 2025-05-05T10:07:38.013565+01:00 rhino kernel: [257606.672363] ? rescuer_thread+0x3a0/0x3a0
> > 2025-05-05T10:07:38.013566+01:00 rhino kernel: [257606.672366] kthread+0xda/0x100
> > 2025-05-05T10:07:38.013567+01:00 rhino kernel: [257606.672372] ? kthread_complete_and_exit+0x20/0x20
> > 2025-05-05T10:07:38.013568+01:00 rhino kernel: [257606.672377] ret_from_fork+0x22/0x30
> > 2025-05-05T10:07:38.013569+01:00 rhino kernel: [257606.672387] </TASK>
> > 2025-05-05T10:07:38.013570+01:00 rhino kernel: [257606.672389] ---[ end trace 0000000000000000 ]---
> > 2025-05-05T10:07:38.129410+01:00 rhino kernel: [257606.784267] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.129428+01:00 rhino kernel: [257606.784751] ------------[ cut here ]------------
> > 2025-05-05T10:07:38.129429+01:00 rhino kernel: [257606.784752] amdgpu 0000:0b:00.0: Dirty helper failed: ret=-22
> > 2025-05-05T10:07:38.129430+01:00 rhino kernel: [257606.784794] WARNING: CPU: 1 PID: 2162593 at drivers/gpu/drm/drm_fb_helper.c:477 drm_fb_helper_damage_work+0x208/0x3a0 [drm_kms_helper]
> > 2025-05-05T10:07:38.129447+01:00 rhino kernel: [257606.784824] Modules linked in: nf_conntrack_netlink xt_nat xt_tcpudp veth xt_conntrack bridge stp llc xt_set ip_set xt_addrtype xfrm_user xfrm_algo nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables nfnetlink overlay cpufreq_powersave cpufreq_ondemand cpufreq_conservative cpufreq_userspace quota_v2 quota_tree sunrpc binfmt_misc nls_ascii nls_cp437 vfat fat intel_rapl_msr intel_rapl_common edac_mce_amd amdgpu kvm_amd stv6111 btusb lnbh25 btrtl btbcm kvm iwlmvm btintel irqbypass stv0910 snd_hda_codec_realtek btmtk gpu_sched snd_hda_codec_generic mac80211 drm_buddy snd_hda_codec_hdmi ghash_clmulni_intel ledtrig_audio bluetooth drm_display_helper sha256_ssse3 snd_hda_intel libarc4 snd_intel_dspcfg sha1_ssse3 cec snd_intel_sdw_acpi rc_core jitterentropy_rng snd_hda_codec drm_ttm_helper snd_hda_core sha512_ssse3 iwlwifi snd_hwdep ttm sha512_generic aesni_intel snd_pcm ddbridge ctr crypto_simd snd_timer snd drm_kms_helper drbg cryptd soundcore
> > 2025-05-05T10:07:38.129450+01:00 rhino kernel: [257606.784904] dvb_core ansi_cprng cfg80211 ecdh_generic ecc mc wmi_bmof rapl k10temp ccp sp5100_tco pcspkr rfkill evdev sg acpi_cpufreq button wireguard libchacha20poly1305 chacha_x86_64 poly1305_x86_64 curve25519_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel softdog watchdog nct6775 nct6775_core hwmon_vid drm fuse dm_mod efi_pstore configfs loop zram zsmalloc ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 btrfs blake2b_generic zstd_compress efivarfs raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic md_mod sd_mod t10_pi crc64_rocksoft crc64 crc_t10dif crct10dif_generic crct10dif_pclmul crct10dif_common crc32_pclmul crc32c_intel ahci xhci_pci libahci i2c_piix4 xhci_hcd libata igb usbcore scsi_mod i2c_algo_bit dca usb_common scsi_common video wmi gpio_amdpt gpio_generic
> > 2025-05-05T10:07:38.129451+01:00 rhino kernel: [257606.784995] CPU: 1 PID: 2162593 Comm: kworker/1:2 Tainted: G W 6.1.0-34-amd64 #1 Debian 6.1.135-1
> > 2025-05-05T10:07:38.129453+01:00 rhino kernel: [257606.785001] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./B450 Gaming-ITX/ac, BIOS P3.40 07/17/2019
> > 2025-05-05T10:07:38.129454+01:00 rhino kernel: [257606.785003] Workqueue: events drm_fb_helper_damage_work [drm_kms_helper]
> > 2025-05-05T10:07:38.129455+01:00 rhino kernel: [257606.785030] RIP: 0010:drm_fb_helper_damage_work+0x208/0x3a0 [drm_kms_helper]
> > 2025-05-05T10:07:38.129456+01:00 rhino kernel: [257606.785056] Code: 01 48 8b 78 08 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 9c 4a 6c cb 89 e9 4c 89 e2 48 c7 c7 b8 36 03 c1 48 89 c6 e8 38 d1 07 cb <0f> 0b e9 ec fe ff ff 0f b6 44 24 38 4d 8b 67 90 31 f6 0f b7 54 24
> > 2025-05-05T10:07:38.129457+01:00 rhino kernel: [257606.785059] RSP: 0018:ffffb931d88fbe18 EFLAGS: 00010286
> > 2025-05-05T10:07:38.129458+01:00 rhino kernel: [257606.785063] RAX: 0000000000000000 RBX: ffff8a8dcaa840cc RCX: 0000000000000027
> > 2025-05-05T10:07:38.129459+01:00 rhino kernel: [257606.785065] RDX: ffff8a8ed6a603e8 RSI: 0000000000000001 RDI: ffff8a8ed6a603e0
> > 2025-05-05T10:07:38.129460+01:00 rhino kernel: [257606.785068] RBP: 00000000ffffffea R08: 0000000000000000 R09: ffffb931d88fbc90
> > 2025-05-05T10:07:38.129461+01:00 rhino kernel: [257606.785070] R10: 0000000000000003 R11: ffff8a8edf2c18a8 R12: ffff8a8dc0eeb800
> > 2025-05-05T10:07:38.129462+01:00 rhino kernel: [257606.785072] R13: ffffb931c348b1a0 R14: ffffb931cf2b21a0 R15: ffff8a8dcaa840d0
> > 2025-05-05T10:07:38.129463+01:00 rhino kernel: [257606.785074] FS: 0000000000000000(0000) GS:ffff8a8ed6a40000(0000) knlGS:0000000000000000
> > 2025-05-05T10:07:38.129464+01:00 rhino kernel: [257606.785077] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > 2025-05-05T10:07:38.129465+01:00 rhino kernel: [257606.785079] CR2: 000055e676413058 CR3: 000000011a33c000 CR4: 00000000003506e0
> > 2025-05-05T10:07:38.129466+01:00 rhino kernel: [257606.785083] Call Trace:
> > 2025-05-05T10:07:38.129467+01:00 rhino kernel: [257606.785086] <TASK>
> > 2025-05-05T10:07:38.129468+01:00 rhino kernel: [257606.785092] process_one_work+0x1c7/0x380
> > 2025-05-05T10:07:38.129469+01:00 rhino kernel: [257606.785100] worker_thread+0x4d/0x380
> > 2025-05-05T10:07:38.129469+01:00 rhino kernel: [257606.785104] ? rescuer_thread+0x3a0/0x3a0
> > 2025-05-05T10:07:38.129470+01:00 rhino kernel: [257606.785108] kthread+0xda/0x100
> > 2025-05-05T10:07:38.129471+01:00 rhino kernel: [257606.785113] ? kthread_complete_and_exit+0x20/0x20
> > 2025-05-05T10:07:38.129472+01:00 rhino kernel: [257606.785118] ret_from_fork+0x22/0x30
> > 2025-05-05T10:07:38.129473+01:00 rhino kernel: [257606.785128] </TASK>
> > 2025-05-05T10:07:38.129474+01:00 rhino kernel: [257606.785129] ---[ end trace 0000000000000000 ]---
> > 2025-05-05T10:07:38.129475+01:00 rhino kernel: [257606.785216] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.129477+01:00 rhino kernel: [257606.785757] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.129478+01:00 rhino kernel: [257606.786302] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.129509+01:00 rhino kernel: [257606.786856] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.129510+01:00 rhino kernel: [257606.787419] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.129511+01:00 rhino kernel: [257606.787992] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.133116+01:00 rhino kernel: [257606.788635] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.133129+01:00 rhino kernel: [257606.789244] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.133130+01:00 rhino kernel: [257606.789849] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.133131+01:00 rhino kernel: [257606.790462] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.133131+01:00 rhino kernel: [257606.791092] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.133132+01:00 rhino kernel: [257606.791734] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.137187+01:00 rhino kernel: [257606.792417] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.137196+01:00 rhino kernel: [257606.793078] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.137198+01:00 rhino kernel: [257606.793749] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.137200+01:00 rhino kernel: [257606.794429] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.137200+01:00 rhino kernel: [257606.795118] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.137201+01:00 rhino kernel: [257606.795816] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.141123+01:00 rhino kernel: [257606.796553] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.141134+01:00 rhino kernel: [257606.797285] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.141135+01:00 rhino kernel: [257606.798020] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> > 2025-05-05T10:07:38.141136+01:00 rhino kernel: [257606.798765] [drm:dc_add_plane_to_context [amdgpu]] ERROR Head pipe not found for stream_state 0000000099fd0230 !
> 
> 
> I assume you cannot more specifically say when you saw the problem
> first appearing in the 6.1.y series?
> 
> Would you be able to test newer stable series as well (ideally 6.12.y
> as will be shipped in trixie or mainline kernel?)
> 
> I have searched the upstream issues in
> https://gitlab.freedesktop.org/drm/amd/-/issues and did not found
> something directly matching your report. Could you please report it
> upstream and report back the upstream issue back here so we can link
> those?
> 
> Sice you are owning the hardware that would help speed up the
> debugging by having you directly interacting with upstream on the
> matter. Can you do that?
> 
> Regards,
> Salvatore


Reply to: