[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: ROCm test launcher modifications when there is other non-AMD GPU cards.



Hi Clement,

On 2025-05-12 13:06, LONGEAC Clement wrote:
> We would have to hide NVIDIA in order to test only on AMD Radeon RX
> 6400 . Rather than making a specific command for testing on the AMD or a
> specific card, would it be possible to add an option to the rocm-test-
> launcher /debian/test/opencl command, allowing specific selection of one
> or more cards for the test phases? Should we select the specific card
> we're interested in?

Hm, interesting problem. rocm-test-launcher was designed to be a helper
for tests driven by the autopkgtest command, rather than being invoked
directly. Among other things, the utility checks for GPU presence only
so that it can skip, rather than error out, on a lack thereof, which is
what needs to happen in the official CI (as opposed to [1]).

I saw GPU selection as part of the testbed setup. To this end, packages
'rocm-qemu-support' and 'rocm-podman-support' provide suitable
autopkgtest backends [2, 3].

Given the above, adding this feature to rocm-test-launcher would seem
like a layer violation, so Cory's intuition was right.

However, it also shouldn't be needed. When designing tests, if you want
to run them in your shell, you might as well invoke the tests directly,
ie:

  OCL_ICD_VENDORS=/etc/OpenCL/vendors/amdocl64.icd debian/tests/script*

And in d/tests/control, you would just use

  rocm-test-launcher debian/tests/script*

and you could rely on the fact that our team's CI [1] provides properly
set up testbeds.

Side note: The synopsis of rocm-test-launcher is

   rocm-test-launcher CMD [ARGS]

so if your intention was for debian/tests/script* to expand to multiple
tests, you'll need to invoke rocm-test-launcher for each of them, or
create a small wrapper like this one [4].

On 2025-05-12 17:56, Cordell Bloor wrote:
> You should direct that question towards the pkg-rocm-tools
> maintainer(or author). I've CC'd Christian Kastner

Thanks. In any case, I'm subscribed to the list :)

Best,
Christian

[1]: https://ci.rocm.debian.net

[2]: For example, with the 'qemu+rocm' backend, you could run an
     autopkgtest with the one GPU in PCI slot 09:00.0 like so:

     $ autopkgtest -B <package> -- qemu+rocm --gpu 09:00.0 <img>

[3]: Sadly, after much trying, it seems that the analog for [1] in
     rootless containers, using the 'podman+rocm' backend, is not
     possible due to come cgroupsv2 restriction. However, I still
     have the code for that, and I guess I could ship it for people who
     want to try it in rootful containers.

[4]:
https://sources.debian.org/src/rocrand/5.7.1-6/debian/tests/run-testdir/


Reply to: