whisper-cpp with AMD GPUs (Was: llama-cpp with AMD GPUs)

To: debian-ai@lists.debian.org
Subject: whisper-cpp with AMD GPUs (Was: llama-cpp with AMD GPUs)
From: Cordell Bloor <cgmb@slerp.xyz>
Date: Sun, 11 Feb 2024 01:21:54 -0700
Message-id: <[🔎] 7002cf70-632d-431b-9045-da9582871abf@slerp.xyz>
In-reply-to: <[🔎] d928eda2-d831-e785-7745-0207c3895280@slerp.xyz>
References: <[🔎] 39620f27-dbd1-4756-ab3d-e7464712b880@slerp.xyz> <[🔎] 24d1c8ab-62cf-4632-afc6-6a264dfbba5b@debian.org> <[🔎] 30f19094-2e83-4953-97f0-abcaa2ce6333@slerp.xyz> <[🔎] sa6zfw82cyb.fsf@hjemme.reinholdtsen.name> <[🔎] d928eda2-d831-e785-7745-0207c3895280@slerp.xyz>

Good news everyone!

On 2024-02-10 17:35, Cordell Bloor wrote:

Btw, would whisper-cpp be a better match for Debian than the
openai-whisper implementation?
I know very little about AI applications, but I see that whisper-cpp has a hipblas backend that can be enabled with `-DWHISPER_HIPBLAS=ON`. A quick review of the codebase suggests to me all dependencies required to package whisper-cpp with GPU acceleration are probably already in Debian.

It seems to work fairly well on my Radeon VII.

apt -y update
apt -y upgrade
apt -y install git wget ffmpeg hipcc libhipblas-dev librocblas-dev cmake build-essential
wget https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-large-v3.bin?download=true -O ggml-large-v3.bin
git clone https://github.com/ggerganov/whisper.cpp.git
cd whisper.cpp
git checkout 02b4c52c1289e05c8c04ff8370a4835b8ee99c86
CC=clang-15 CXX=clang++-15 cmake -S. -Bbuild -DWHISPER_HIPBLAS=ON -DAMDGPU_TARGETS="gfx803;gfx900;gfx906;gfx908;gfx90a;gfx1010;gfx1030" -DCMAKE_BUILD_TYPE=Release
make -j16 -C build
make samples
./build/bin/main -m ../ggml-large-v3.bin -f samples/jfk.wav --print-colors

Although, there seems to be a bug in whisper.cpp at that particular commit. I'm seeing occasional errors in the transcriptions output both when using hipblas and when using the purely CPU implementation. When I try running on samples/gb1.wav, the transcription seems to work properly until it reaches 00:02:02.200 at which point it just keeps repeating, "We will be led into the darkness."

Sincerely,
Cory Bloor

Reply to:

Follow-Ups:
- Re: whisper-cpp with AMD GPUs (Was: llama-cpp with AMD GPUs)
  - From: Petter Reinholdtsen <pere@hungry.com>

References:
- llama-cpp with AMD GPUs
  - From: Cordell Bloor <cgmb@slerp.xyz>
- Re: llama-cpp with AMD GPUs
  - From: Christian Kastner <ckk@debian.org>
- Re: llama-cpp with AMD GPUs
  - From: Cordell Bloor <cgmb@slerp.xyz>
- Re: llama-cpp with AMD GPUs
  - From: Petter Reinholdtsen <pere@hungry.com>
- Re: llama-cpp with AMD GPUs
  - From: Cordell Bloor <cgmb@slerp.xyz>

Prev by Date: Re: llama-cpp with AMD GPUs
Next by Date: "Moving a step closer to defining Open Source AI" : OSI presentation at FOSDEM 2024
Previous by thread: Re: llama-cpp with AMD GPUs
Next by thread: Re: whisper-cpp with AMD GPUs (Was: llama-cpp with AMD GPUs)
Index(es):
- Date
- Thread