A TTS that fits in your CPU (and pocket)
Port of OpenAI's Whisper model in C/C++
A high-quality rapid TTS voice cloning model
Real-time NVIDIA GPU dashboard
Python-free Rust inference server
Running large language models on a single GPU
A lightweight text-to-speech model with zero-shot voice cloning
Calculate token/s & GPU memory requirement for any LLM
UME is an in-app debug kits platform for Flutter