Re: bi weekly update

To: 千代航平 <kouhei.sendai@gmail.com>, debian-ai@lists.debian.org
Subject: Re: bi weekly update
From: "M. Zhou" <cdluminate@riseup.net>
Date: Mon, 14 Jul 2025 12:02:49 -0400
Message-id: <[🔎] 87f45ef6-c24b-40e1-a366-e1b5b7a1523e@riseup.net>
In-reply-to: <[🔎] CAMN-Zwk6XHV0a-ex6_aDTUB11HJqV6eAyQuqwQ6uvgKSiziMOw@mail.gmail.com>
References: <[🔎] CAMN-Zwk6XHV0a-ex6_aDTUB11HJqV6eAyQuqwQ6uvgKSiziMOw@mail.gmail.com>


On 7/13/25 10:15 AM, 千代航平 wrote:

Of course, my next goal is gpu version.
However, I struggle with ray and xformers.
First, ray is build with bazel. However, it has strict versiondependencies with bazel it self, and it download source code from theInternet. Copy the code by hand might not be a big issue, but theBazel version is critical for me. I'm struggling with this problemalmost all week....

ray is not mandatory:https://github.com/vllm-project/vllm/blob/3fc964433a84bad785d9d0656fd56195462321b8/vllm/config.py#L2098-L2142

According to the code, ray is needed for multi-node distributedinference. If we disable ray, we are still able to do single-nodeinference with multiple GPUs. Let's figure out a way to disable raywhile building vllm-cuda, so that we can avoid bazel.

Reply to:

References:
- bi weekly update
  - From: 千代航平 <kouhei.sendai@gmail.com>

Prev by Date: Re: bi weekly update
Next by Date: llama.cpp_5882+dfsg-3~exp2_amd64.changes is NEW
Previous by thread: Re: bi weekly update
Next by thread: ROCm on riscv64
Index(es):
- Date
- Thread