Vibevoice 🎙️

Hi, I'm Marc Päpper and I wanted to vibe code like Karpathy ;D, so I looked around and found the cool work of Vlad. I extended it to run with a local whisper model, so I don't need to pay for OpenAI tokens. I hope you have fun with it!

What it does 🚀

Simply run cli.py and start dictating text anywhere in your system:

Hold down right control key (Ctrl_r)
Speak your text
Release the key
Watch as your spoken words are transcribed and automatically typed!

Works in any application or window - your text editor, browser, chat apps, anywhere you can type!

Installation 🛠️

git clone https://github.com/mpaepper/vibevoice.git
cd vibevoice
pip install -r requirements.txt
python src/vibevoice/cli.py

Requirements 📋

Python Dependencies

Python 3.12 or higher

System Requirements

CUDA-capable GPU (recommended) -> in server.py you can enable cpu use
CUDA 12.x
cuBLAS
cuDNN 9.x
In case you get this error: OSError: PortAudio library not found run sudo apt install libportaudio2

Handling the CUDA requirements

Make sure that you have CUDA >= 12.4 and cuDNN >= 9.x
I had some trouble at first with Ubuntu 24.04, so I did the following:

sudo apt update && sudo apt upgrade
sudo apt autoremove nvidia* --purge
ubuntu-drivers devices
sudo ubuntu-drivers autoinstall
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb && sudo apt update
sudo apt install cuda-toolkit-12-8

or alternatively:

wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb
sudo apt update
sudo apt install cudnn9-cuda-12

Then after rebooting, it worked well.

Usage 💡

Start the application:

python src/vibevoice/cli.py

Hold down right control key (Ctrl_r) while speaking
Release to transcribe
Your text appears wherever your cursor is!

Configuration

You can customize the trigger key by setting the VOICEKEY environment variable:

export VOICEKEY="ctrl"  # Use left control instead

Credits 🙏

Original inspiration: whisper-keyboard by Vlad
Faster Whisper for the optimized Whisper implementation
Built by Marc Päpper

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src/vibevoice		src/vibevoice
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vibevoice 🎙️

What it does 🚀

Installation 🛠️

Requirements 📋

Python Dependencies

System Requirements

Handling the CUDA requirements

Usage 💡

Configuration

Credits 🙏

About

Releases

Packages

Languages

mpaepper/vibevoice

Folders and files

Latest commit

History

Repository files navigation

Vibevoice 🎙️

What it does 🚀

Installation 🛠️

Requirements 📋

Python Dependencies

System Requirements

Handling the CUDA requirements

Usage 💡

Configuration

Credits 🙏

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages