[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1016548: linux-image-5.19.0-rc6-amd64: Corrupted cursor and broken resume with amdgpu driver with Radeon RX 6600



Package: linux-image-5.19.0-rc6-amd64
Version: 5.19~rc6-1~exp1
Severity: important
X-Debbugs-Cc: george.shuklin@gmail.com

Dear Maintainer,

After upgrade from 5.18.0-3-amd64 to 5.19.0-rc6-amd64 video subsystem start
misbehave:

* Visual artefacts after cursor
* Hangs when X is active
* Unable to show anything after resume on DisplayPort monitor (but working
otherwise, like ssh).

Following kernel trace was observed:

[  193.941836] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for
fences timed out!
[  199.061275] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for
fences timed out!
[  199.071242] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0
timeout, signaled seq=3956, emitted seq=3958
[  199.071340] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information:
process Xorg pid 2028 thread Xorg:cs0 pid 3062
[  199.071424] amdgpu 0000:0a:00.0: amdgpu: GPU reset begin!
[  199.491788] amdgpu 0000:0a:00.0: [drm:amdgpu_ring_test_helper [amdgpu]]
*ERROR* ring kiq_2.1.0 test failed (-110)
[  199.491863] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
[  199.767776] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
[  199.781910] [drm] free PSP TMR buffer
[  199.827452] CPU: 11 PID: 81 Comm: kworker/u64:7 Not tainted 5.19.0-rc6-amd64
#1  Debian 5.19~rc6-1~exp1
[  199.827454] Hardware name: System manufacturer System Product Name/PRIME
X570-P, BIOS 3405 02/01/2021
[  199.827455] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
[  199.827460] Call Trace:
[  199.827461]  <TASK>
[  199.827463]  dump_stack_lvl+0x45/0x5e
[  199.827467]  amdgpu_do_asic_reset+0x28/0x434 [amdgpu]
[  199.827576]  amdgpu_device_gpu_recover_imp.cold+0x628/0x8fc [amdgpu]
[  199.827672]  amdgpu_job_timedout+0x153/0x190 [amdgpu]
[  199.827765]  drm_sched_job_timedout+0x76/0x110 [gpu_sched]
[  199.827767]  process_one_work+0x1e5/0x3b0
[  199.827770]  worker_thread+0x50/0x3a0
[  199.827772]  ? rescuer_thread+0x390/0x390
[  199.827773]  kthread+0xe8/0x110
[  199.827774]  ? kthread_complete_and_exit+0x20/0x20
[  199.827775]  ret_from_fork+0x22/0x30
[  199.827778]  </TASK>
[  199.827779] amdgpu 0000:0a:00.0: amdgpu: MODE1 reset
[  199.827781] amdgpu 0000:0a:00.0: amdgpu: GPU mode1 reset
[  199.827855] amdgpu 0000:0a:00.0: amdgpu: GPU smu mode1 reset
[  200.341240] amdgpu 0000:0a:00.0: amdgpu: GPU reset succeeded, trying to
resume
[  200.341406] [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
[  200.341430] [drm] VRAM is lost due to GPU reset!
[  200.341435] [drm] PSP is resuming...
...
[  200.897511] amdgpu 0000:0a:00.0: amdgpu: GPU reset(2) succeeded!
[  200.906284] amdgpu_cs_ioctl: 16 callbacks suppressed
[  200.906286] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize
parser -125!
[  200.906954] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize
parser -125!
[  200.907311] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize
parser -125!


Xorg log is clean (no errors observed).

Reverting back to 5.18.0-3-amd64 fix the problem.

GPU details: Radeon 6600.

lspci data:

0a:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi
23 [Radeon RX 6600/6600 XT/6600M] (rev c7) (prog-if 00 [VGA controller])
        Subsystem: Micro-Star International Co., Ltd. [MSI] Navi 23 [Radeon RX
6600/6600 XT/6600M]
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort+ <TAbort-
<MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 95
        IOMMU group: 23
        Region 0: Memory at 7c00000000 (64-bit, prefetchable) [size=8G]
        Region 2: Memory at 7e00000000 (64-bit, prefetchable) [size=256M]
        Region 4: I/O ports at e000 [size=256]
        Region 5: Memory at fcc00000 (32-bit, non-prefetchable) [size=1M]
        Expansion ROM at fcd00000 [disabled] [size=128K]



-- System Information:
Debian Release: bookworm/sid
  APT prefers testing
  APT policy: (900, 'testing'), (800, 'unstable'), (500, 'oldstable-updates'), (500, 'oldstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 5.18.0-3-amd64 (SMP w/12 CPU threads; PREEMPT)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE=en_US:en
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled


Reply to: