Python library and CLI tool to interface with Google Translate
A lightweight text-to-speech model with zero-shot voice cloning
High-Quality Voice Cloning TTS for 600+ Languages
A fast TTS architecture with conditional flow matching
A text-to-speech, speech-to-text and speech-to-speech library
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Dia-1.6B generates lifelike English dialogue and vocal expressions