Instant voice cloning by MIT and MyShell. Audio foundation model
High-Quality Voice Cloning TTS for 600+ Languages
SoTA open-source TTS
Controllable & emotion-expressive zero-shot TTS
SOTA Open Source TTS
Open-source framework for intelligent speech interaction
A generative speech model for daily dialogue
A sound cloning tool with a web interface, using your voice
A nearly-live implementation of OpenAI's Whisper
MARS5 speech model (TTS) from CAMB.AI
A TTS model capable of generating ultra-realistic dialogue
MOSS‑TTS Family open‑source speech and sound generation model
Speech-AI-Forge is a project developed around TTS generation model
LLM-based Reinforcement Learning audio edit model
Towards Human-Sounding Speech
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
VITS2 backbone with multilingual-bert
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Dia-1.6B generates lifelike English dialogue and vocal expressions