Generate audiobooks from EPUBs, PDFs and text with captions
A gradio web UI for running Large Language Models like LLaMA
StreamSpeech is a seamless model for offline speech recognition
Multi-Voice and Prompt-Controlled TTS Engine
The official Python SDK for the ElevenLabs API
MARS5 speech model (TTS) from CAMB.AI
Python library and CLI tool to interface with Google Translate
Towards Human-Level Text-to-Speech through Style Diffusion
Easy-to-use Speech Toolkit including Self-Supervised Learning model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
High-quality multi-lingual text-to-speech library by MyShell.ai
Official PyTorch Implementation
Controllable and fast Text-to-Speech for over 7000 languages
Multi-lingual large voice generation model, providing inference
Scalable generative AI framework built for researchers and developers
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A TTS model capable of generating ultra-realistic dialogue
A simple native web interface that uses ChatTTS to synthesize text
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Synchronized Translation for Videos
Implementation of AudioLM audio generation model in Pytorch
Conversational voice AI agents
A sound cloning tool with a web interface, using your voice
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Generate audiobooks from e-books