Bug#1016548: linux-image-5.19.0-rc6-amd64: Corrupted cursor and broken resume with amdgpu driver with Radeon RX 6600
Package: linux-image-5.19.0-rc6-amd64
Version: 5.19~rc6-1~exp1
Severity: important
X-Debbugs-Cc: george.shuklin@gmail.com
Dear Maintainer,
After upgrade from 5.18.0-3-amd64 to 5.19.0-rc6-amd64 video subsystem start
misbehave:
* Visual artefacts after cursor
* Hangs when X is active
* Unable to show anything after resume on DisplayPort monitor (but working
otherwise, like ssh).
Following kernel trace was observed:
[ 193.941836] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for
fences timed out!
[ 199.061275] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for
fences timed out!
[ 199.071242] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0
timeout, signaled seq=3956, emitted seq=3958
[ 199.071340] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information:
process Xorg pid 2028 thread Xorg:cs0 pid 3062
[ 199.071424] amdgpu 0000:0a:00.0: amdgpu: GPU reset begin!
[ 199.491788] amdgpu 0000:0a:00.0: [drm:amdgpu_ring_test_helper [amdgpu]]
*ERROR* ring kiq_2.1.0 test failed (-110)
[ 199.491863] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
[ 199.767776] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
[ 199.781910] [drm] free PSP TMR buffer
[ 199.827452] CPU: 11 PID: 81 Comm: kworker/u64:7 Not tainted 5.19.0-rc6-amd64
#1 Debian 5.19~rc6-1~exp1
[ 199.827454] Hardware name: System manufacturer System Product Name/PRIME
X570-P, BIOS 3405 02/01/2021
[ 199.827455] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
[ 199.827460] Call Trace:
[ 199.827461] <TASK>
[ 199.827463] dump_stack_lvl+0x45/0x5e
[ 199.827467] amdgpu_do_asic_reset+0x28/0x434 [amdgpu]
[ 199.827576] amdgpu_device_gpu_recover_imp.cold+0x628/0x8fc [amdgpu]
[ 199.827672] amdgpu_job_timedout+0x153/0x190 [amdgpu]
[ 199.827765] drm_sched_job_timedout+0x76/0x110 [gpu_sched]
[ 199.827767] process_one_work+0x1e5/0x3b0
[ 199.827770] worker_thread+0x50/0x3a0
[ 199.827772] ? rescuer_thread+0x390/0x390
[ 199.827773] kthread+0xe8/0x110
[ 199.827774] ? kthread_complete_and_exit+0x20/0x20
[ 199.827775] ret_from_fork+0x22/0x30
[ 199.827778] </TASK>
[ 199.827779] amdgpu 0000:0a:00.0: amdgpu: MODE1 reset
[ 199.827781] amdgpu 0000:0a:00.0: amdgpu: GPU mode1 reset
[ 199.827855] amdgpu 0000:0a:00.0: amdgpu: GPU smu mode1 reset
[ 200.341240] amdgpu 0000:0a:00.0: amdgpu: GPU reset succeeded, trying to
resume
[ 200.341406] [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
[ 200.341430] [drm] VRAM is lost due to GPU reset!
[ 200.341435] [drm] PSP is resuming...
...
[ 200.897511] amdgpu 0000:0a:00.0: amdgpu: GPU reset(2) succeeded!
[ 200.906284] amdgpu_cs_ioctl: 16 callbacks suppressed
[ 200.906286] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize
parser -125!
[ 200.906954] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize
parser -125!
[ 200.907311] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize
parser -125!
Xorg log is clean (no errors observed).
Reverting back to 5.18.0-3-amd64 fix the problem.
GPU details: Radeon 6600.
lspci data:
0a:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi
23 [Radeon RX 6600/6600 XT/6600M] (rev c7) (prog-if 00 [VGA controller])
Subsystem: Micro-Star International Co., Ltd. [MSI] Navi 23 [Radeon RX
6600/6600 XT/6600M]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort+ <TAbort-
<MAbort- >SERR- <PERR- INTx-
Latency: 0, Cache Line Size: 64 bytes
Interrupt: pin A routed to IRQ 95
IOMMU group: 23
Region 0: Memory at 7c00000000 (64-bit, prefetchable) [size=8G]
Region 2: Memory at 7e00000000 (64-bit, prefetchable) [size=256M]
Region 4: I/O ports at e000 [size=256]
Region 5: Memory at fcc00000 (32-bit, non-prefetchable) [size=1M]
Expansion ROM at fcd00000 [disabled] [size=128K]
-- System Information:
Debian Release: bookworm/sid
APT prefers testing
APT policy: (900, 'testing'), (800, 'unstable'), (500, 'oldstable-updates'), (500, 'oldstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386
Kernel: Linux 5.18.0-3-amd64 (SMP w/12 CPU threads; PREEMPT)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE=en_US:en
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled
Reply to: