Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++

To: debian-ai@lists.debian.org
Subject: Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
From: Christian Kastner <ckk@debian.org>
Date: Thu, 6 Feb 2025 23:50:21 +0100
Message-id: <[🔎] 823b7a25-77a2-44fd-a050-6be40933ce2e@debian.org>
In-reply-to: <[🔎] sa6zfizh41c.fsf@hjemme.reinholdtsen.name>
References: <d373f55c-2869-490b-aeaf-0fba8c10c02e@debian.org> <fdedee66-9a55-475e-9e23-acfdfc351025@debian.org> <d373f55c-2869-490b-aeaf-0fba8c10c02e@debian.org> <sa65xxw4jhn.fsf@hjemme.reinholdtsen.name> <22d3e2d2-cfbd-431d-9211-e902ac3dfe4b@debian.org> <22d3e2d2-cfbd-431d-9211-e902ac3dfe4b@debian.org> <d373f55c-2869-490b-aeaf-0fba8c10c02e@debian.org> <de29a469-6c9b-4025-bbed-988e10dc5a38@slerp.xyz> <0aa4f182-da25-4ba5-8d9f-a1d1f8ad9221@debian.org> <ece647c1-3dba-4737-a215-c93112990fe4@debian.org> <7976e018-a547-4bba-82ba-13847980356e@debian.org> <efae84f0-dfa9-4cd2-a869-752ae1bd22cd@debian.org> <[🔎] c9735f7c-982a-4e81-a048-bc588833dccf@debian.org> <[🔎] 2f158aa02fac5d00dcdcfc8a6ce0ee2a147bc3c0.camel@debian.org> <[🔎] sa6ldukhzw2.fsf@hjemme.reinholdtsen.name> <[🔎] 9d8ea37e-310e-4a61-83c2-b8820a17f016@debian.org> <d373f55c-2869-490b-aeaf-0fba8c10c02e@debian.org> <[🔎] sa67c63j4iv.fsf@hjemme.reinholdtsen.name> <[🔎] ed4dabdf-a5db-422f-a72f-faf1ef026a50@debian.org> <[🔎] sa6zfizh41c.fsf@hjemme.reinholdtsen.name>

On 2025-02-06 09:26, Petter Reinholdtsen wrote:
> I hope so too, but I guess we will soon find out.  My initial draft on
> <URL: https://salsa.debian.org/deeplearning-team/whisper.cpp > will need
> a lot of updates  to bring it in line with this new approach. :)

If it helps, after finally achieving a satisfying result after many
iterations, I deliberately re-wrote the history of the llama.cpp repo to
(1) start with a very minimal packaging for the baseline, and then to
(2) add feature by feature.

In fact, since practically all of the backend configuration/performance
stuff is actually just about ggml, which llama.cpp and whisper.cpp
share, you should be able to mirror every commit after the first.

Happy to help, let me know if I should do anything.

Best,
Christian

Reply to:

References:
- Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
  - From: Christian Kastner <ckk@debian.org>
- Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
  - From: "M. Zhou" <lumin@debian.org>
- Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
  - From: Petter Reinholdtsen <pere@hungry.com>
- Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
  - From: Christian Kastner <ckk@debian.org>
- Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
  - From: Petter Reinholdtsen <pere@hungry.com>
- Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
  - From: Christian Kastner <ckk@debian.org>
- Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
  - From: Petter Reinholdtsen <pere@hungry.com>

Prev by Date: Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
Next by Date: Architectures for packages
Previous by thread: Re: Bug#1063673: ITP: llama.cpp -- Inference of Meta's LLaMA model (and others) in pure C/C++
Next by thread: Re: Bug#1094806: ITP: ollama -- large language model tools
Index(es):
- Date
- Thread