TTS with kokoro and onnx runtime
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from e-books
A sound cloning tool with a web interface, using your voice
A fast TTS architecture with conditional flow matching
Controllable & emotion-expressive zero-shot TTS
A TTS model capable of generating ultra-realistic dialogue
VITS2 backbone with multilingual-bert
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Implementation of a Transformer based neural network
Conditional Variational Autoencoder with Adversarial Learning