[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#969160: linux-image-5.7.0-2-amd64: amdgpu_dm_atomic_commit_tail issue freezes X



Package: src:linux
Version: 5.7.10-1
Severity: important

amdgpu crashes leaving X unressponsible. This seems to be a race condition and is already fixed upstream for kernels 5.8.0, 5.7.13, 5.4.56
https://bugzilla.kernel.org/show_bug.cgi?id=207383

It is quite often reproducible by waking up the monitors after they have been switched off by the screensaver.

I'm using a multi monitor setup with a
0a:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 [Radeon RX 5600 OEM/5600 XT / 5700/5700 XT] (rev c4)

-- Package-specific info:
** Version:
Linux version 5.7.0-2-amd64 (debian-kernel@lists.debian.org) (gcc version 9.3.0 (Debian 9.3.0-16), GNU ld (GNU Binutils for Debian) 2.35) #1 SMP Debian 5.7.10-1 (2020-07-26)

** Command line:
BOOT_IMAGE=/vmlinuz-5.7.0-2-amd64 root=UUID=70f13b00-5a54-4309-887a-333da8417dd7 ro acpi_enforce_resources=lax splash

** Tainted: W (512)
 * kernel issued warning

** Kernel log:
Aug 26 17:26:38 localhost kernel: [12803.982534] show_signal_msg: 17 callbacks suppressed
Aug 26 17:26:38 localhost kernel: [12803.982537] kscreenlocker_g[27883]: segfault at 58 ip 00007fddb372a9e9 sp 00007ffc4b806000 error 4 in libQt5Core.so.5.14.2[7fddb34e8000+2ef000]
Aug 26 17:26:38 localhost kernel: [12803.982545] Code: 58 48 39 d0 0f 85 57 02 00 00 b8 01 00 00 00 e9 34 fd ff ff 48 8b 74 24 38 48 8b 46 08 48 85 c0 74 0d 48 8b 46 08 8b 74 24 40 <3b> 70 08 7c 32 48 8d 7c 24 60 e8 48 71 00 00 48 8b 7c 24 08 b8 01
Aug 26 17:26:39 localhost kernel: [12804.947817] ------------[ cut here ]------------
Aug 26 17:26:39 localhost kernel: [12804.947921] WARNING: CPU: 4 PID: 29465 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn20/dcn20_hwseq.c:109 dcn20_setup_gsl_group_as_lock+0x7f/0x200 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.947923] Modules linked in: fuse xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_counter nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables nfnetlink tun bridge stp llc binfmt_misc snd_hda_codec_realtek snd_hda_codec_generic amd64_edac_mod edac_mce_amd ledtrig_audio kvm_amd snd_hda_codec_hdmi kvm eeepc_wmi snd_hda_intel ax88179_178a asus_wmi snd_intel_dspcfg usbnet battery sparse_keymap irqbypass sg mii joydev snd_hda_codec rfkill snd_hda_core video pcspkr wmi_bmof snd_hwdep sp5100_tco snd_pcm k10temp watchdog snd_timer ccp snd rng_core soundcore evdev acpi_cpufreq ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nct6775 hwmon_vid r8169 realtek libphy parport_pc ppdev lp parport ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 dm_crypt dm_mod raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic raid0 multipath
Aug 26 17:26:39 localhost kernel: [12804.947953]  linear radeon raid1 md_mod sr_mod cdrom hid_generic usbhid hid sd_mod amdgpu gpu_sched i2c_algo_bit ttm crc32_pclmul crc32c_intel drm_kms_helper ghash_clmulni_intel cec xhci_pci drm xhci_hcd ahci aesni_intel libahci libata libaes crypto_simd cryptd glue_helper usbcore nvme scsi_mod nvme_core i2c_piix4 t10_pi crc_t10dif crct10dif_generic mfd_core crct10dif_pclmul usb_common crct10dif_common wmi button
Aug 26 17:26:39 localhost kernel: [12804.947972] CPU: 4 PID: 29465 Comm: kworker/u64:5 Not tainted 5.7.0-2-amd64 #1 Debian 5.7.10-1
Aug 26 17:26:39 localhost kernel: [12804.947974] Hardware name: System manufacturer System Product Name/TUF GAMING X570-PLUS, BIOS 1407 04/01/2020
Aug 26 17:26:39 localhost kernel: [12804.947984] Workqueue: events_unbound commit_work [drm_kms_helper]
Aug 26 17:26:39 localhost kernel: [12804.948076] RIP: 0010:dcn20_setup_gsl_group_as_lock+0x7f/0x200 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.948078] Code: 84 d2 74 47 84 c0 75 26 48 8b 87 30 03 00 00 0f b6 80 70 02 00 00 a8 01 0f 84 dc 00 00 00 a8 02 74 55 a8 04 0f 84 f7 00 00 00 <0f> 0b 0f 0b 48 8b 44 24 28 65 48 33 04 25 28 00 00 00 0f 85 59 01
Aug 26 17:26:39 localhost kernel: [12804.948079] RSP: 0018:ffffa67a4828fa48 EFLAGS: 00010202
Aug 26 17:26:39 localhost kernel: [12804.948081] RAX: 0000000000000007 RBX: ffff964871ce0b08 RCX: 0000000000000000
Aug 26 17:26:39 localhost kernel: [12804.948082] RDX: 0000000000000001 RSI: ffff964871ce0b08 RDI: ffff9649a8d70000
Aug 26 17:26:39 localhost kernel: [12804.948083] RBP: 0000000000000001 R08: ffffa67a4828fa44 R09: 0000000000000000
Aug 26 17:26:39 localhost kernel: [12804.948084] R10: 0000000000000000 R11: ffffa67a4828fab4 R12: 0000000000000001
Aug 26 17:26:39 localhost kernel: [12804.948085] R13: ffff9649a8d70000 R14: ffff9648b7980000 R15: ffff964927614400
Aug 26 17:26:39 localhost kernel: [12804.948087] FS:  0000000000000000(0000) GS:ffff9649beb00000(0000) knlGS:0000000000000000
Aug 26 17:26:39 localhost kernel: [12804.948088] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 26 17:26:39 localhost kernel: [12804.948089] CR2: 00007fa2dc0289e0 CR3: 0000000792882000 CR4: 0000000000340ee0
Aug 26 17:26:39 localhost kernel: [12804.948091] Call Trace:
Aug 26 17:26:39 localhost kernel: [12804.948181]  dcn20_pipe_control_lock+0x1ac/0x1c0 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.948265]  dc_commit_updates_for_stream+0x12de/0x1930 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.948339]  ? amdgpu_display_get_crtc_scanoutpos+0x85/0x190 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.948441]  amdgpu_dm_atomic_commit_tail+0xb4c/0x1ff0 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.948448]  ? load_balance+0x172/0xcf0
Aug 26 17:26:39 localhost kernel: [12804.948452]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948453]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948455]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948456]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948458]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948459]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948461]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948462]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948463]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948465]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948466]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948468]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948469]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948471]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948472]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948473]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948475]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948476]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948478]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948479]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948481]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948482]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948483]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948485]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948486]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948489]  ? _cond_resched+0x15/0x30
Aug 26 17:26:39 localhost kernel: [12804.948492]  ? wait_for_completion_timeout+0x36/0x100
Aug 26 17:26:39 localhost kernel: [12804.948493]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948495]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.948496]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.948507]  commit_tail+0x94/0x130 [drm_kms_helper]
Aug 26 17:26:39 localhost kernel: [12804.948510]  process_one_work+0x1b4/0x380
Aug 26 17:26:39 localhost kernel: [12804.948513]  worker_thread+0x50/0x3c0
Aug 26 17:26:39 localhost kernel: [12804.948516]  kthread+0xf9/0x130
Aug 26 17:26:39 localhost kernel: [12804.948518]  ? process_one_work+0x380/0x380
Aug 26 17:26:39 localhost kernel: [12804.948520]  ? kthread_park+0x90/0x90
Aug 26 17:26:39 localhost kernel: [12804.948522]  ret_from_fork+0x22/0x40
Aug 26 17:26:39 localhost kernel: [12804.948525] ---[ end trace 45bf1f99143b024d ]---
Aug 26 17:26:39 localhost kernel: [12804.948544] ------------[ cut here ]------------
Aug 26 17:26:39 localhost kernel: [12804.948644] WARNING: CPU: 4 PID: 29465 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn20/dcn20_hwseq.c:127 dcn20_setup_gsl_group_as_lock+0x81/0x200 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.948645] Modules linked in: fuse xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_counter nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables nfnetlink tun bridge stp llc binfmt_misc snd_hda_codec_realtek snd_hda_codec_generic amd64_edac_mod edac_mce_amd ledtrig_audio kvm_amd snd_hda_codec_hdmi kvm eeepc_wmi snd_hda_intel ax88179_178a asus_wmi snd_intel_dspcfg usbnet battery sparse_keymap irqbypass sg mii joydev snd_hda_codec rfkill snd_hda_core video pcspkr wmi_bmof snd_hwdep sp5100_tco snd_pcm k10temp watchdog snd_timer ccp snd rng_core soundcore evdev acpi_cpufreq ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi nct6775 hwmon_vid r8169 realtek libphy parport_pc ppdev lp parport ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 dm_crypt dm_mod raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic raid0 multipath
Aug 26 17:26:39 localhost kernel: [12804.948669]  linear radeon raid1 md_mod sr_mod cdrom hid_generic usbhid hid sd_mod amdgpu gpu_sched i2c_algo_bit ttm crc32_pclmul crc32c_intel drm_kms_helper ghash_clmulni_intel cec xhci_pci drm xhci_hcd ahci aesni_intel libahci libata libaes crypto_simd cryptd glue_helper usbcore nvme scsi_mod nvme_core i2c_piix4 t10_pi crc_t10dif crct10dif_generic mfd_core crct10dif_pclmul usb_common crct10dif_common wmi button
Aug 26 17:26:39 localhost kernel: [12804.948684] CPU: 4 PID: 29465 Comm: kworker/u64:5 Tainted: G        W         5.7.0-2-amd64 #1 Debian 5.7.10-1
Aug 26 17:26:39 localhost kernel: [12804.948685] Hardware name: System manufacturer System Product Name/TUF GAMING X570-PLUS, BIOS 1407 04/01/2020
Aug 26 17:26:39 localhost kernel: [12804.948696] Workqueue: events_unbound commit_work [drm_kms_helper]
Aug 26 17:26:39 localhost kernel: [12804.948794] RIP: 0010:dcn20_setup_gsl_group_as_lock+0x81/0x200 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.948796] Code: 74 47 84 c0 75 26 48 8b 87 30 03 00 00 0f b6 80 70 02 00 00 a8 01 0f 84 dc 00 00 00 a8 02 74 55 a8 04 0f 84 f7 00 00 00 0f 0b <0f> 0b 48 8b 44 24 28 65 48 33 04 25 28 00 00 00 0f 85 59 01 00 00
Aug 26 17:26:39 localhost kernel: [12804.948798] RSP: 0018:ffffa67a4828fa48 EFLAGS: 00010202
Aug 26 17:26:39 localhost kernel: [12804.948799] RAX: 0000000000000007 RBX: ffff964871ce0b08 RCX: 0000000000000000
Aug 26 17:26:39 localhost kernel: [12804.948800] RDX: 0000000000000001 RSI: ffff964871ce0b08 RDI: ffff9649a8d70000
Aug 26 17:26:39 localhost kernel: [12804.948801] RBP: 0000000000000001 R08: ffffa67a4828fa44 R09: 0000000000000000
Aug 26 17:26:39 localhost kernel: [12804.948803] R10: 0000000000000000 R11: ffffa67a4828fab4 R12: 0000000000000001
Aug 26 17:26:39 localhost kernel: [12804.948804] R13: ffff9649a8d70000 R14: ffff9648b7980000 R15: ffff964927614400
Aug 26 17:26:39 localhost kernel: [12804.948805] FS:  0000000000000000(0000) GS:ffff9649beb00000(0000) knlGS:0000000000000000
Aug 26 17:26:39 localhost kernel: [12804.948807] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 26 17:26:39 localhost kernel: [12804.948808] CR2: 00007fa2dc0289e0 CR3: 0000000792882000 CR4: 0000000000340ee0
Aug 26 17:26:39 localhost kernel: [12804.948809] Call Trace:
Aug 26 17:26:39 localhost kernel: [12804.948906]  dcn20_pipe_control_lock+0x1ac/0x1c0 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.949000]  dc_commit_updates_for_stream+0x12de/0x1930 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.949071]  ? amdgpu_display_get_crtc_scanoutpos+0x85/0x190 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.949170]  amdgpu_dm_atomic_commit_tail+0xb4c/0x1ff0 [amdgpu]
Aug 26 17:26:39 localhost kernel: [12804.949175]  ? load_balance+0x172/0xcf0
Aug 26 17:26:39 localhost kernel: [12804.949178]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949179]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949181]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949182]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949184]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949185]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949186]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949188]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949189]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949191]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949192]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949193]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949195]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949196]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949198]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949199]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949201]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949202]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949203]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949205]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949206]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949208]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949209]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949211]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949212]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949214]  ? _cond_resched+0x15/0x30
Aug 26 17:26:39 localhost kernel: [12804.949216]  ? wait_for_completion_timeout+0x36/0x100
Aug 26 17:26:39 localhost kernel: [12804.949218]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949219]  ? __switch_to_asm+0x40/0x70
Aug 26 17:26:39 localhost kernel: [12804.949221]  ? __switch_to_asm+0x34/0x70
Aug 26 17:26:39 localhost kernel: [12804.949231]  commit_tail+0x94/0x130 [drm_kms_helper]
Aug 26 17:26:39 localhost kernel: [12804.949233]  process_one_work+0x1b4/0x380
Aug 26 17:26:39 localhost kernel: [12804.949236]  worker_thread+0x50/0x3c0
Aug 26 17:26:39 localhost kernel: [12804.949239]  kthread+0xf9/0x130
Aug 26 17:26:39 localhost kernel: [12804.949240]  ? process_one_work+0x380/0x380
Aug 26 17:26:39 localhost kernel: [12804.949243]  ? kthread_park+0x90/0x90
Aug 26 17:26:39 localhost kernel: [12804.949245]  ret_from_fork+0x22/0x40
Aug 26 17:26:39 localhost kernel: [12804.949247] ---[ end trace 45bf1f99143b024e ]---

-- System Information:
Debian Release: bullseye/sid
  APT prefers testing
  APT policy: (500, 'testing'), (500, 'stable')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 5.7.0-2-amd64 (SMP w/24 CPU threads)
Kernel taint flags: TAINT_WARN
Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8), LANGUAGE not set
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages linux-image-5.7.0-2-amd64 depends on:
ii  initramfs-tools [linux-initramfs-tool]  0.137
ii  kmod                                    27+20200310-2
ii  linux-base                              4.6

Versions of packages linux-image-5.7.0-2-amd64 recommends:
ii  apparmor             2.13.4-3
ii  firmware-linux-free  20200122-1

Versions of packages linux-image-5.7.0-2-amd64 suggests:
pn  debian-kernel-handbook  <none>
ii  grub-pc                 2.04-8
pn  linux-doc-5.7           <none>

Versions of packages linux-image-5.7.0-2-amd64 is related to:
ii  firmware-amd-graphics     20200619-1
pn  firmware-atheros          <none>
pn  firmware-bnx2             <none>
pn  firmware-bnx2x            <none>
pn  firmware-brcm80211        <none>
pn  firmware-cavium           <none>
pn  firmware-intel-sound      <none>
pn  firmware-intelwimax       <none>
pn  firmware-ipw2x00          <none>
pn  firmware-ivtv             <none>
pn  firmware-iwlwifi          <none>
pn  firmware-libertas         <none>
ii  firmware-linux-nonfree    20200619-1
ii  firmware-misc-nonfree     20200619-1
pn  firmware-myricom          <none>
pn  firmware-netxen           <none>
pn  firmware-qlogic           <none>
ii  firmware-realtek          20200619-1
pn  firmware-samsung          <none>
pn  firmware-siano            <none>
pn  firmware-ti-connectivity  <none>
pn  xen-hypervisor            <none>

-- no debconf information


Reply to: