The open-source voice synthesis studio powered by Qwen3-TTS
SoTA open-source TTS
End-to-end speech processing toolkit
Toolkit for conversational AI
Foundational model for human-like, expressive TTS
Towards Human-Sounding Speech
A high-quality rapid TTS voice cloning model
Offline inference engine for art, real-time voice conversations
An Open Source text-to-speech system built by inverting Whisper
Scalable generative AI framework built for researchers and developers
A generative speech model for daily dialogue
Python library and CLI tool to interface with Google Translate
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
A list of accessible speech corpora for ASR, TTS
WaveRNN Vocoder + TTS
Implementation of a Transformer based neural network
Free and open source text-to-speech software
ColdFusion SDK for the VoiceShot API.
PHP SDK for processing phone calls and SMS through the VoiceShot API.
.NET SDK for processing phone calls and SMS through the VoiceShot API.
ASP SDK for processing phone calls and SMS through the VoiceShot API.
Toolkit for efficient experimentation with Speech Recognition
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Process large speech data wrt transcription, labeling and annotation