Re: Debian Monthly [debian-devel]: AI News Report 2024/10

To: debian-devel@lists.debian.org
Cc: debian-devel <debian-devel@lists.debian.org>
Subject: Re: Debian Monthly [debian-devel]: AI News Report 2024/10
From: Mo Zhou <lumin@debian.org>
Date: Sat, 9 Nov 2024 07:51:47 -0800
Message-id: <[🔎] 68f04f56-9721-4d8a-ae0d-0a8412d572cd@debian.org>
In-reply-to: <[🔎] 1418616029.6560222.1731158393331.JavaMail.zimbra@synchrotron-soleil.fr>
References: <[🔎] c91005e7-a94e-457e-a385-c01b08933d78@debian.org> <[🔎] CAODfWeEj0dQzeZEkR2v+Y3-6XEjCw4PTSyY_h8wN84xuG0ZtQQ@mail.gmail.com> <[🔎] 1418616029.6560222.1731158393331.JavaMail.zimbra@synchrotron-soleil.fr>

The LLM I used to produce that exact news report was gpt-4o-mini,
from openai. ChatGPT is the name of openai's LLM web interface and
its underlying LLM model name could change. It took roughly 3
minutes to perform the bulk API calls.

That said, I basically implemented support for all commonly seen
LLM inference services:

(4 commercial ones)
  openai, anthropic, google, xai,
(4 self-hosted)
  llamafile, ollama, vllm, zmq (built-in but kind of outdated.)

Other services missing from the list are also supported as long
as it has compatibility mode to the openai api.

For the particular use case like summarizing a mailing list, self-hosted
one will be much slower to respond to the bulk API call unless it is
hosted on a GPU cluster :-)

Small LLMs are not necessarily smart enough. The open llm leaderboard[3]
is a good reference for figuring out the best open-access llm for
self-hosting.

In terms of "Debian hosted computer with AMD GPU for LLM inference" --
that is exactly one of the long term goals of debian deep learning
team (debian-ai@l.d.o). Team members are working to prepare the ROCm
packages and the ROCm version of pytorch.

I find ollama[1] and llamafile[2] quite handy to use locally if do not
mind using software from outside of debian archive, with a spare GPU.

[1] https://github.com/ollama/ollama
[2] https://github.com/Mozilla-Ocho/llamafile
[3] https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard


On 11/9/24 05:19, PICCA Frederic-Emmanuel wrote:

is it via ChatGPT or an llm self hosted ?

Can we imagine having a Debian hosted computer with and AMD GPU dedicated to this use case ?

Se should provide these summaries letter for most of our mailing list :)

cheers

Fred

----- Le 9 Nov 24, à 14:09, Hector Oron zumbi@debian.org a écrit :

Hello Lumin,

El sáb, 9 nov 2024 a las 10:27, DebGPT (<lumin@debian.org>) escribió:

This is an experiment, by letting LLM go through all 369 emails from
debian-devel on Oct. The command for producing the news report
is included below. Use debgpt's git HEAD if you want to try.

First time I see this kind of email, I thought time ago that'd be a
really cool use of AI, to produce a summary of mailing lists - since I
struggle to read everything.

I just want to thank you for putting this together and, at least from
my side, this is very much appreciated.

Regards
--
  Héctor Orón  -.. . -... .. .- -.   -.. . ...- . .-.. --- .--. . .-.

Reply to:

Follow-Ups:
- Re: Debian Monthly [debian-devel]: AI News Report 2024/10
  - From: Mo Zhou <lumin@debian.org>

References:
- Debian Monthly [debian-devel]: AI News Report 2024/10
  - From: DebGPT <lumin@debian.org>
- Re: Debian Monthly [debian-devel]: AI News Report 2024/10
  - From: Hector Oron <zumbi@debian.org>
- Re: Debian Monthly [debian-devel]: AI News Report 2024/10
  - From: PICCA Frederic-Emmanuel <frederic-emmanuel.picca@synchrotron-soleil.fr>

Prev by Date: Re: Debian Monthly [debian-devel]: AI News Report 2024/10
Next by Date: Bug#1087205: ITP: xtruss -- X11 protocol trace utility
Previous by thread: Re: Debian Monthly [debian-devel]: AI News Report 2024/10
Next by thread: Re: Debian Monthly [debian-devel]: AI News Report 2024/10
Index(es):
- Date
- Thread