[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#991453: linux-image-5.10.0-8-amd64: Radeon 6800 XT: 100% GPU core usage & 74 Watts when idle



> It's already backported to 5.10, just after 5.10.46 was released:
> https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?h=l
> inux-5.10.y&id=fea853aca3210c21dfcf07bb82d501b7fd1900a7

Just found out the reverted commit was introduced just before the 5.10.46 tag
was created, which should mean that any version before 5.10.46 should NOT
have this problem. The version uploaded to the Debian archive before that
was 5.10.40-1, which in my case was linux-image-5.10.0-7-amd64.
I no longer had that installed, so added the following line to /e/a/sources.list:
deb [check-valid-until=no] https://snapshot.debian.org/archive/debian/20210529T204006Z/ sid main

Installed linux-image-5.10.0-7-amd64 and rebooted into that ...

On zaterdag 24 juli 2021 14:22:38 CEST Diederik de Haas wrote:
> On zaterdag 24 juli 2021 01:29:55 CEST piorunz wrote:
> > GPU core works at 100% usage at all times, even at idle.
> > 
> > $ cat /sys/class/drm/card0/device/gpu_busy_percent
> > 99
> > 
> > $ sensors
> > (...)
> > amdgpu-pci-0900
> > Adapter: PCI adapter
> > vddgfx:        1.14 V
> > fan1:        1098 RPM  (min =    0 RPM, max = 3000 RPM)
> > edge:         +51.0°C  (crit = +100.0°C, hyst = -273.1°C)
> > 
> >                        (emerg = +105.0°C)
> > 
> > junction:     +55.0°C  (crit = +110.0°C, hyst = -273.1°C)
> > 
> >                        (emerg = +115.0°C)
> > 
> > mem:          +56.0°C  (crit = +100.0°C, hyst = -273.1°C)
> > 
> >                        (emerg = +105.0°C)
> > 
> > power1:       74.00 W  (cap = 272.00 W)
> > 
> > radeontop - 100% GPU usage and full clocks:
> > Graphics pipe 100.00%
> > 1.00G / 1.00G Memory Clock 100.00%
> > 2.47G / 2.58G Shader Clock  95.92%
> 
> I'm getting the same results on my Radeon RX Vega 64.
> $ cat /sys/class/drm/card0/device/gpu_busy_percent
> 99
> $ sensors
> nvme-pci-0100
> Adapter: PCI adapter
> Composite:    +43.9°C  (low  = -273.1°C, high = +72.8°C)
>                        (crit = +75.8°C)
> Sensor 1:     +43.9°C  (low  = -273.1°C, high = +65261.8°C)
> Sensor 2:     +49.9°C  (low  = -273.1°C, high = +65261.8°C)
> 
> amdgpu-pci-0c00
> Adapter: PCI adapter
> vddgfx:        1.09 V
> fan1:        1240 RPM  (min =    0 RPM, max = 3500 RPM)
> edge:         +50.0°C  (crit = +85.0°C, hyst = -273.1°C)
>                        (emerg = +90.0°C)
> junction:     +63.0°C  (crit = +105.0°C, hyst = -273.1°C)
>                        (emerg = +110.0°C)
> mem:          +51.0°C  (crit = +95.0°C, hyst = -273.1°C)
>                        (emerg = +100.0°C)
> power1:       74.00 W  (cap = 260.00 W)
> 
> k10temp-pci-00c3
> Adapter: PCI adapter
> Tctl:         +66.0°C
> Tdie:         +46.0°C
> 
> And also for radeontop.

$ cat /sys/class/drm/card0/device/gpu_busy_percent
0
$ sensors
nvme-pci-0100
Adapter: PCI adapter
Composite:    +32.9°C  (low  = -273.1°C, high = +72.8°C)
                       (crit = +75.8°C)
Sensor 1:     +32.9°C  (low  = -273.1°C, high = +65261.8°C)
Sensor 2:     +41.9°C  (low  = -273.1°C, high = +65261.8°C)

amdgpu-pci-0c00
Adapter: PCI adapter
vddgfx:      750.00 mV 
fan1:        1182 RPM  (min =    0 RPM, max = 3500 RPM)
edge:         +37.0°C  (crit = +85.0°C, hyst = -273.1°C)
                       (emerg = +90.0°C)
junction:     +38.0°C  (crit = +105.0°C, hyst = -273.1°C)
                       (emerg = +110.0°C)
mem:          +39.0°C  (crit = +95.0°C, hyst = -273.1°C)
                       (emerg = +100.0°C)
power1:        7.00 W  (cap = 260.00 W)

k10temp-pci-00c3
Adapter: PCI adapter
Tctl:         +65.6°C  
Tdie:         +45.6°C

# radeontop
Graphics pipe 0.83%
0.17G / 0.94G Memory Clock 17.67%
0.03G / 1.63G Shader Clock  1.81%

So running a kernel != 5.10.46 does not have this problem :)

Attachment: signature.asc
Description: This is a digitally signed message part.


Reply to: