A text-to-speech, speech-to-text and speech-to-speech library
Free, high-quality text-to-speech API endpoint to replace OpenAI
The python library for real-time communication
A lightweight text-to-speech model with zero-shot voice cloning
The official Python SDK for the ElevenLabs API
StreamSpeech is a seamless model for offline speech recognition
Tokenizer-Free TTS for Multilingual Speech Generation
Open source text-to-speech tool, supports extra-long text
Towards Human-Sounding Speech
A nearly-live implementation of OpenAI's Whisper
Converts text to speech in realtime
One-click deployment (including offline integration package)
super expressive prompting model based on ltx2.3
A TTS model capable of generating ultra-realistic dialogue
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Instant voice cloning by MIT and MyShell. Audio foundation model
Controllable & emotion-expressive zero-shot TTS
Like the macOS say command, but with a modern voice
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Video translation and dubbing tool powered by LLMs
Build Vision Agents quickly with any model or video provider
Interface for OuteTTS models
Official MiniMax Model Context Protocol (MCP) server
A simple native web interface that uses ChatTTS to synthesize text
App in java for chatting to a generative A.I. (involving tts and stt)