State-of-the-art TTS model under 25MB
Generate audiobooks from e-books
A fast TTS architecture with conditional flow matching
A text-to-speech, speech-to-text and speech-to-speech library
A TTS that fits in your CPU (and pocket)
A lightweight text-to-speech model with zero-shot voice cloning
A Conversational Speech Generation Model
Best practice TTS based on BERT and VITS
Conditional Variational Autoencoder with Adversarial Learning
Implementation of a Transformer based neural network