Long-form streaming TTS system for multi-speaker dialogue generation
MARS5 speech model (TTS) from CAMB.AI
State-of-the-art TTS model under 25MB
Controllable & emotion-expressive zero-shot TTS
FAIR Sequence Modeling Toolkit 2
TTS model capable of streaming conversational audio in realtime
Towards Human-Level Text-to-Speech through Style Diffusion
Two Integrated Text To Speech Engines uses MMS & Silero
Conditional Variational Autoencoder with Adversarial Learning
Implementation of a Transformer based neural network
A python package to analyze and compare voices with deep learning
TensorFlow Implementation of DC-TTS: yet another text-to-speech model