Offline inference engine for art, real-time voice conversations
Code for openai.fm, a demo for the OpenAI Speech API
SOTA Open Source TTS
State-of-the-art TTS model under 25MB
Controllable and fast Text-to-Speech for over 7000 languages
Industrial-level controllable zero-shot text-to-speech system
Toolkit for conversational AI
SOTA discrete acoustic codec models with 40/75 tokens per second
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Miso TTS is an 8 billion, highly emotive text-to-speech model
Towards Human-Sounding Speech
TTS with kokoro and onnx runtime
The open-source voice synthesis studio powered by Qwen3-TTS
A simple, high-quality voice conversion tool focused on ease of use
Instant voice cloning by MIT and MyShell. Audio foundation model
Video translation and dubbing tool powered by LLMs
Comprehensive Gradio WebUI for audio processing
Speech Note Linux app. Note taking, reading and translating
Generate audiobooks from e-books, voice cloning & 1107+ languages
Generate audiobooks from e-books
A simple native web interface that uses ChatTTS to synthesize text
Tokenizer-Free TTS for Multilingual Speech Generation
A cross-platform software for text translation and recognition
Offline Text To Speech synthesis for python