ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Long-form streaming TTS system for multi-speaker dialogue generation
Interface for OuteTTS models
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
super expressive prompting model based on ltx2.3
A generative speech model for daily dialogue
High-Quality Voice Cloning TTS for 600+ Languages
One-click deployment (including offline integration package)
Instant voice cloning by MIT and MyShell. Audio foundation model
Spark-TTS Inference Code
End-to-end speech processing toolkit
MARS5 speech model (TTS) from CAMB.AI
Foundational model for human-like, expressive TTS
Synchronized Translation for Videos
Towards Human-Level Text-to-Speech through Style Diffusion
App in java for chatting to a generative A.I. (involving tts and stt)
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Chinese voice dialogue robot/smart speaker project
Conditional Variational Autoencoder with Adversarial Learning
Deep learning for text to speech
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Text-to-Speech System for Galician and Spanish