Re: RFC: Adding an X-ROCm-Built-For field to packages targeting ISAs

To: debian-ai@lists.debian.org
Subject: Re: RFC: Adding an X-ROCm-Built-For field to packages targeting ISAs
From: Cordell Bloor <cgmb@slerp.xyz>
Date: Sat, 28 Jun 2025 11:23:09 -0600
Message-id: <[🔎] 68806b35-265b-4ba0-a08c-47f93ac43112@slerp.xyz>
In-reply-to: <[🔎] 84fad571-ed31-495a-9400-a2ac719b4353@debian.org>
References: <[🔎] 84fad571-ed31-495a-9400-a2ac719b4353@debian.org>

Hi Christian,

There is perhaps an additional complexity with the generic targets thatyou may wish to consider in your design.

The new generic targets have a hidden version number. When you specifythat you wish to build for gfx11-generic, the compiler turns that into acommand to build for gfx11-generic-v0. If we were to imagine that therewere a new gfx11 GPU released that was added to gfx11-generic and thatrequired changes to the gfx11-generic code generation in order tofunction, then LLVM would increment the internal ISA version number [1].Using that newer version of LLVM, a request to build for gfx11-genericwould build for gfx11-generic-v1.

This version number is so that if you attempt to run an oldgfx11-generic-v0 binary on that new and incompatible gfx11 GPU, the HIPRuntime would know that the code object is not compatible and would notload it. This would be resolved by rebuilding the binary forgfx11-generic on the newer compiler, which would output gfx11-generic-v1code objects that the HIP Runtime would recognize as being compatiblewith that new hardware.

In any case, the point of this is that I think the information that youcare about is the compiler's full target name with the version number.This distinction doesn't matter yet, as we're not using generic targetson Debian yet and those are the only targets that have a version number.Also, I don't think LLVM has ever incremented a generic target versionnumber yet. Nevertheless, it's something to consider for the future ifwe're designing for the long term.


On 2025-06-27 00:38, Christian Kastner wrote:

I would like to propose that all binary packages built for AMD GPU ISAs
document those ISAs in an X-ROCm-Built-For field.

For example:

   X-ROCm-Built-For: gfx900 gfx1030 gfx1200 ...

I do mean *all* packages, so including all our reverse dependencies.

It would be nice if this could be consistent across various differenttypes of accelerators. Ultimately, this field basically means that theprogram was built by calling `clang++--offload-arch=<$X-ROCm-Built-For[0]>--offload-arch=<$X-ROCm-Built-For[1]> ....`.


For AMD GPUs, those values are gfx900 gfx1030 gfx1200 ...
For Intel GPUS, those values are bdw, acm_g10, acm_g11, pvc ...
For NVIDIA GPUs, those values are sm_60, sm_70, sm_80, sm_90 ...

I also wonder if other sorts of accelerators might be supported throughthe same mechanism (e.g., NPUs). To compile code for the XDNA NPU, youinvokes clang using --target=aie2-none-unknown-elf.

Would it make sense to have one field that specifies the acceleratorarchitectures for all vendors? Or would it make more sense to have adifferent field for each vendor / accelerator toolchain? e.g.,X-Offload-Arch vs. X-<Vendor>-<Device Type/Runtime/Toolchain>-Arch?

Your approach is basically the latter (modulo minor naming differences).I don't have anything against that. It's just worth making an explicitdecision to take that approach, if that's the plan.

It's debatable whether this should also be added to -dev packages.
I myself don't think this would contribute much, other than extra
maintenance work.

I don't think it makes sense on them anyway, as they don't contain anyGPU code. That also cleanly solves matters for libraries such as rocfft,where the library does not contain any GPU code (because it depends onrun-time compilation) and therefore works on many different GPUs.

We could also use this list to "bridge" back to our CI. Does a package
pass all its tests on the listed ISAs -> otherwise, report a bug.

Although, I suppose this idea implies a somewhat differentinterpretation of the field. You are not saying, "this is the ISA thatthe package was built for" but rather "these are the GPUs that thepackage supports". Those are very different things in the case ofgeneric targets, the SPIR-V target, and run-time compilation. You'llneed to be clear about which you mean.


Sincerely,
Cory Bloor

[1]: Of course, if a new gfx11 GPU did not require any changes to thegfx11-generic code generation to function, the version number would notbe incremented. That's the best-case scenario, because it means that oldbinaries remain compatible with newer hardware and require nothing butan update to the driver / runtime.

Reply to:

Follow-Ups:
- Re: RFC: Adding an X-ROCm-Built-For field to packages targeting ISAs
  - From: Christian Kastner <ckk@debian.org>

References:
- RFC: Adding an X-ROCm-Built-For field to packages targeting ISAs
  - From: Christian Kastner <ckk@debian.org>

Prev by Date: Re: Subject: Proposal: AI-native Debian Branch — A Strategic Path Toward OS-Level Intelligence
Next by Date: Re: RFC: Adding an X-ROCm-Built-For field to packages targeting ISAs
Previous by thread: Re: RFC: Adding an X-ROCm-Built-For field to packages targeting ISAs
Next by thread: Re: RFC: Adding an X-ROCm-Built-For field to packages targeting ISAs
Index(es):
- Date
- Thread