Instant voice cloning by MIT and MyShell. Audio foundation model
Interface for OuteTTS models
A lightweight text-to-speech model with zero-shot voice cloning
SOTA Open Source TTS
Multi-lingual large voice generation model, providing inference
MARS5 speech model (TTS) from CAMB.AI
A high-quality rapid TTS voice cloning model
A sound cloning tool with a web interface, using your voice
Towards Human-Level Text-to-Speech through Style Diffusion
A cross-platform software for text translation and recognition
Video translation and dubbing tool powered by LLMs
Cross-platform AI language practice app
Offline Text To Speech synthesis for python
Readest is a modern, feature-rich ebook reader
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Spark-TTS Inference Code
Framework for building neural networks
Scalable generative AI framework built for researchers and developers
Long-form streaming TTS system for multi-speaker dialogue generation
Industrial-level controllable zero-shot text-to-speech system
Real-time voice interactive digital human
A simple native web interface that uses ChatTTS to synthesize text
Lightning-fast, on-device TTS, running natively via ONNX
Automatically translates the text of a video based on a subtitle file
Python library and CLI tool to interface with Google Translate