[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Inquiry about GSoC 2025 Debian ROCm proposal



Hi Bo-Jun and Xuanteng,

On 2025-03-26 01:13, Xuanteng Huang wrote:
try to package a ROCm component not yet included in Debian.

If you have access to appropriate hardware, I might suggest packaging rocWMMA [1]. It is a library for accelerating mixed-precision matrix multiply-accumulate operations. It is specifically designed to help take advantage of Wave Matrix Multiply Accumulate (WMMA) instructions in RDNA 3/4 GPUs and Matrix Fused-Multiply-Add (MFMA) instructions in CDNA 1/2/3 GPUs.

I reviewed the library a couple years ago but I didn't get any further than creating an empty Salsa repo [2] because there weren't any libraries or applications that used rocWMMA. That seems to have changed, as llama.cpp will now look for it when GGML_HIP_ROCWMMA_FATTN is ON [3].

Sincerely,
Cory Bloor

[1]: https://github.com/ROCm/rocWMMA
[2]: https://salsa.debian.org/rocm-team/rocwmma
[3]: https://github.com/ggml-org/llama.cpp/blob/b4958/ggml/src/ggml-hip/CMakeLists.txt#L42C5-L42C27


Reply to: