Generate audiobooks from e-books, voice cloning & 1107+ languages
Comprehensive Gradio WebUI for audio processing
Synchronized Translation for Videos
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A sound cloning tool with a web interface, using your voice
Speech-AI-Forge is a project developed around TTS generation model
SoTA open-source TTS
Offline Text To Speech synthesis for python
Generate audiobooks from e-books
Qwen3-TTS is an open-source series of TTS models
Reading book source
Generate audiobooks from EPUBs, PDFs and text with captions
SOTA Open Source TTS
Spark-TTS Inference Code
Framework for building neural networks
Towards Human-Sounding Speech
A text-to-speech, speech-to-text and speech-to-speech library
Python library and CLI tool to interface with Google Translate
The official Python SDK for the ElevenLabs API
Multi-lingual large voice generation model, providing inference
A TTS model capable of generating ultra-realistic dialogue
Converts text to speech in realtime
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Virtual AI anchor that combines state-of-the-art technology
Management of Yandex Station and other smart home devices