Bug#1095237: ITP: vllm -- A high-throughput and memory-efficient inference and serving engine for LLMs

To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: Bug#1095237: ITP: vllm -- A high-throughput and memory-efficient inference and serving engine for LLMs
From: "M. Zhou" <lumin@debian.org>
Date: Wed, 05 Feb 2025 13:08:01 -0500
Message-id: <[🔎] 423c45a3aba7fdacbf76ebaf3d24adeada35deb1.camel@debian.org>
Reply-to: "M. Zhou" <lumin@debian.org>, 1095237@bugs.debian.org

Package: wnpp
Severity: wishlist
Owner: Mo Zhou <lumin@debian.org>
X-Debbugs-Cc: debian-devel@lists.debian.org, debian-ai@lists.debian.org

* Package name    : vllm
  Version         : 0.7.1
  Upstream Contact: 
* URL             : 
* License         : Apache-2.0
  Programming Lang: Python
  Description     : A high-throughput and memory-efficient inference and serving engine for LLMs

I think this is one of the most important applications in the reverse
dependency tree of pytorch package. vllm has a very large tree of dependencies
and many of them are missing. I'm just setting vllm as a long term goal.

Alternatives are ollama and llama.cpp.

Everything, including vllm's necessary dependencies will be maintained by
debian deep learning team.

Thank you for using reportbug

Reply to:

Prev by Date: Bug#1095227: ITP: crysfml -- Crystallographic Fortran Modules Library
Next by Date: Bug#1095271: ITP: golang-github-foxboron-go-uefi -- Linux UEFI library written in pure Go.
Previous by thread: Bug#1095227: ITP: crysfml -- Crystallographic Fortran Modules Library
Next by thread: Bug#1095271: ITP: golang-github-foxboron-go-uefi -- Linux UEFI library written in pure Go.
Index(es):
- Date
- Thread