A gradio web UI for running Large Language Models like LLaMA
Multi-Voice and Prompt-Controlled TTS Engine
Generate audiobooks from EPUBs, PDFs and text with captions
The official Python SDK for the ElevenLabs API
MARS5 speech model (TTS) from CAMB.AI
Towards Human-Level Text-to-Speech through Style Diffusion
Multi-lingual large voice generation model, providing inference
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Python library and CLI tool to interface with Google Translate
Official PyTorch Implementation
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
High-quality multi-lingual text-to-speech library by MyShell.ai
Scalable generative AI framework built for researchers and developers
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Implementation of AudioLM audio generation model in Pytorch
A simple native web interface that uses ChatTTS to synthesize text
Controllable and fast Text-to-Speech for over 7000 languages
A TTS model capable of generating ultra-realistic dialogue
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Synchronized Translation for Videos
Conversational voice AI agents
A sound cloning tool with a web interface, using your voice
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A text-to-speech, speech-to-text and speech-to-speech library
Generate audiobooks from e-books