Instant voice cloning by MIT and MyShell. Audio foundation model
Management of Yandex Station and other smart home devices
High-Quality Voice Cloning TTS for 600+ Languages
SoTA open-source TTS
A simple native web interface that uses ChatTTS to synthesize text
Controllable & emotion-expressive zero-shot TTS
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
SOTA Open Source TTS
A generative speech model for daily dialogue
Offline Text To Speech synthesis for python
Use Microsoft Edge's online text-to-speech service from Python
A nearly-live implementation of OpenAI's Whisper
MARS5 speech model (TTS) from CAMB.AI
A TTS model capable of generating ultra-realistic dialogue
A sound cloning tool with a web interface, using your voice
Controllable and fast Text-to-Speech for over 7000 languages
Speech-AI-Forge is a project developed around TTS generation model
Towards Human-Sounding Speech
Spark-TTS Inference Code
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Automatically translates the text of a video based on a subtitle file
Mice speech to text with MX Cinnamon OS ISO
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
Open source implementation of Microsoft's VALL-E X zero-shot TTS model