Re: Building llama.cpp for AMD GPU using only Debian packages?

To: Cordell Bloor <cgmb@slerp.xyz>
Cc: Petter Reinholdtsen <pere@hungry.com>, debian-ai@lists.debian.org
Subject: Re: Building llama.cpp for AMD GPU using only Debian packages?
From: Jamie Bainbridge <jamie.bainbridge@gmail.com>
Date: Sat, 1 Feb 2025 16:13:15 +1000
Message-id: <[🔎] CAAvyFNirbOGzcfJWk0F6VE1g2nTHHKPmW-n+m9mKEVq2v24Dfg@mail.gmail.com>
In-reply-to: <2a0bb4a5-bec1-4abe-b10a-3905afca8a09@slerp.xyz>
References: <sa6sewtv5km.fsf@hjemme.reinholdtsen.name> <a2e3c24b-6459-41cb-995f-93bb4bdfee97@slerp.xyz> <sa6wmm5s28m.fsf@hjemme.reinholdtsen.name> <sa6r0cds1br.fsf@hjemme.reinholdtsen.name> <af3fa2a8-63f0-44ec-bb0e-5105a3235f6b@slerp.xyz> <a6840953-4656-47ed-96f7-ee7fe2f5b247@slerp.xyz> <sa6tth8p2rk.fsf@hjemme.reinholdtsen.name> <sa6a5izpnvp.fsf@hjemme.reinholdtsen.name> <sa64j96q4gg.fsf@hjemme.reinholdtsen.name> <sa6sep3dh5h.fsf@hjemme.reinholdtsen.name> <CAAvyFNjufDunbeV42O6Or4ty7bxXerbwOF_0STbvXSVkqus95w@mail.gmail.com> <2a0bb4a5-bec1-4abe-b10a-3905afca8a09@slerp.xyz>

On Sat, 1 Feb 2025 at 00:47, Cordell Bloor <cgmb@slerp.xyz> wrote:
> [1]: I'd hoped that with so many friends on the team, I'd be able to
> have some influence on the technical direction of the library.
> Unfortunately, that proved not to be the case. It was a bit of a life
> lesson for me.
>
> Though whether AMD upstream
> accepts that tuning is an open question.

Open source software lives or dies by its community. With an
inattentive upstream as you describe above, it makes me wonder if
effort spent in ROCm is worth it.

ggeranov has also said llama.cpp is just maintaining ROCm support and
future effort will be put into Vulkan inference.

If Vulkan prompt processing could be sped up (maybe like jart's matmul
work? https://justine.lol/matmul/) then there would be little need to
use ROCm for llama.cpp at all.

I am only new here and I don't know how the rest of the ROCm ecosystem
operates, there is surely more than hobbyists running llama.cpp on
desktop GPUs, but if everyone else in this space is working on Vulkan
maybe Debian (and AMD) could leverage Vulkan more instead?

Jamie

Reply to:

Follow-Ups:
- Re: Building llama.cpp for AMD GPU using only Debian packages?
  - From: Cordell Bloor <cgmb@slerp.xyz>

Next by Date: Bug#1094763: libamdhip64-dev: CMake config helpers are installed in the wrong directory
Next by thread: Re: Building llama.cpp for AMD GPU using only Debian packages?
Index(es):
- Date
- Thread