Notes about ROCm on RDNA2

To: debian-ai@lists.debian.org
Subject: Notes about ROCm on RDNA2
From: Cordell Bloor <cgmb-deb@slerp.xyz>
Date: Sun, 10 Jul 2022 18:49:17 -0600
Message-id: <[🔎] ccc12699-62be-e833-b4a4-8bbc437fa644@slerp.xyz>

Hi folks,

I wanted to briefly share some of what that I've learned about usingROCm with RDNA2 GPUs. I'm sure many of you are aware that there's only acouple RDNA2 GPUs that are officially supported by the AMD ROCm projectupstream: the Radeon Pro W6800 and Radeon Pro v620. In practice,however, all Navi 21 GPUs share the same processor id (gfx1030) and willwork just fine despite not being officially supported.

Other Navi 2x GPUs have a different processor id. If I recall correctly,Navi 22 is gfx1031, Navi 23 is gfx1032, Navi 24 is gfx1033, etc. Thus,if you try to use the AMD binaries built for Navi 21 on Navi 22 orabove, no compatible code objects will be found and your program willexit with a fatal error from the HIP runtime.

I'd thought that this incompatibility was fundamental, but it seems Iwas wrong. LLVM treats the gfx1030–gfx1036 targets identically. Itgenerates code objects with different processor ids stamped in themetadata, but the executable code is all the same. In fact, if you telllibhsakmt to report the device as being gfx1030, any of the gfx103x GPUswill be able to load and execute code compiled for gfx1030. This can bedone by setting an environment variable [1]:


    export HSA_OVERRIDE_GFX_VERSION=10.3.0

As far as I can tell, despite having differing ids, all RDNA2 desktopGPUs share the same ISA and can execute the same code. This was apleasant surprise, as it greatly expands the list of hardware that couldbe used with ROCm.


Sincerely,
Cory Bloor

[1]:https://github.com/RadeonOpenCompute/ROCT-Thunk-Interface/blob/rocm-5.2.0/src/topology.c#L1180

Reply to:

Follow-Ups:
- Re: Notes about ROCm on RDNA2
  - From: "xyz20003@gmail.com" <xyz20003@gmail.com>
- Re: Notes about ROCm on RDNA2
  - From: Cordell Bloor <cgmb-deb@slerp.xyz>

Prev by Date: lodepng_0.0~git20220618.b4ed2cd-1~exp1_amd64.changes ACCEPTED into experimental, experimental
Next by Date: Processing of lodepng_0.0~git20220618.b4ed2cd-1_source.changes
Previous by thread: lodepng_0.0~git20220618.b4ed2cd-1~exp1_amd64.changes ACCEPTED into experimental, experimental
Next by thread: Re: Notes about ROCm on RDNA2
Index(es):
- Date
- Thread