Multi-lingual large voice generation model, providing inference
SOTA Open Source TTS
An Open Source text-to-speech system built by inverting Whisper
Instant voice cloning by MIT and MyShell. Audio foundation model
TTS model capable of streaming conversational audio in realtime
MOSS‑TTS Family open‑source speech and sound generation model
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Long-form streaming TTS system for multi-speaker dialogue generation
LLM-based Reinforcement Learning audio edit model
A TTS model capable of generating ultra-realistic dialogue
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Conditional Variational Autoencoder with Adversarial Learning