Re: Inquiry about GSoC 2025 Debian ROCm proposal

To: Xuanteng Huang <xuanteng.huang@outlook.com>, 黃柏竣 <adopcarry@gmail.com>
Cc: Debian ROCm Team <debian-ai@lists.debian.org>
Subject: Re: Inquiry about GSoC 2025 Debian ROCm proposal
From: Cordell Bloor <cgmb@slerp.xyz>
Date: Wed, 26 Mar 2025 01:55:51 -0600
Message-id: <[🔎] 94b402a6-50cb-4109-83bb-b44c78b220b0@slerp.xyz>
In-reply-to: <[🔎] CB412E2C-2441-4E0A-9AF1-3C7537CD19D4@outlook.com>
References: <CAEAQhx5PX2xq2j0L12TSarf6z-mXSmi1DSeh906g+ZnWd+j8Wg@mail.gmail.com> <[🔎] CB412E2C-2441-4E0A-9AF1-3C7537CD19D4@outlook.com>

Hi Bo-Jun and Xuanteng,

On 2025-03-26 01:13, Xuanteng Huang wrote:

try to package a ROCm component not yet included in Debian.

If you have access to appropriate hardware, I might suggest packagingrocWMMA [1]. It is a library for accelerating mixed-precision matrixmultiply-accumulate operations. It is specifically designed to help takeadvantage of Wave Matrix Multiply Accumulate (WMMA) instructions in RDNA3/4 GPUs and Matrix Fused-Multiply-Add (MFMA) instructions in CDNA 1/2/3GPUs.

I reviewed the library a couple years ago but I didn't get any furtherthan creating an empty Salsa repo [2] because there weren't anylibraries or applications that used rocWMMA. That seems to have changed,as llama.cpp will now look for it when GGML_HIP_ROCWMMA_FATTN is ON [3].


Sincerely,
Cory Bloor

[1]: https://github.com/ROCm/rocWMMA
[2]: https://salsa.debian.org/rocm-team/rocwmma

[3]:https://github.com/ggml-org/llama.cpp/blob/b4958/ggml/src/ggml-hip/CMakeLists.txt#L42C5-L42C27

Reply to:

Follow-Ups:
- Re: Inquiry about GSoC 2025 Debian ROCm proposal
  - From: 黃柏竣 <adopcarry@gmail.com>

References:
- Re: Inquiry about GSoC 2025 Debian ROCm proposal
  - From: Xuanteng Huang <xuanteng.huang@outlook.com>

Prev by Date: Re: Inquiry about GSoC 2025 Debian ROCm proposal
Next by Date: Re: Inquiry about GSoC 2025 Debian ROCm proposal
Previous by thread: Re: Inquiry about GSoC 2025 Debian ROCm proposal
Next by thread: Re: Inquiry about GSoC 2025 Debian ROCm proposal
Index(es):
- Date
- Thread