A nearly-live implementation of OpenAI's Whisper
Chuyển đổi văn bản thành giọng nói không giới hạn
Use Microsoft Edge's online text-to-speech service from Python
Self-host the powerful Chatterbox TTS model
Offline Text To Speech synthesis for python
Long-form streaming TTS system for multi-speaker dialogue generation
A high-quality rapid TTS voice cloning model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Converts text to speech in realtime
Open-source multi-speaker long-form text-to-speech model
Qwen3-omni is a natively end-to-end, omni-modal LLM
An Open Source text-to-speech system built by inverting Whisper
Capable of understanding text, audio, vision, video
State-of-the-art TTS model under 25MB
MARS5 speech model (TTS) from CAMB.AI
Foundational model for human-like, expressive TTS
Spark-TTS Inference Code
Controllable & emotion-expressive zero-shot TTS
Faster Whisper transcription with CTranslate2
SoTA open-source TTS
Instant voice cloning by MIT and MyShell. Audio foundation model
1 min voice data can also be used to train a good TTS model
Open-source framework for intelligent speech interaction
The behavior guidance framework for customer-facing LLM agents
TTS model capable of streaming conversational audio in realtime