Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI
Open source text-to-speech tool, supports extra-long text
Open source AI model for generating full songs from lyrics prompts
The most powerful and modular diffusion model GUI, api and backend
Converts text to speech in realtime
Data manipulation and transformation for audio signal processing
Use Microsoft Edge's online text-to-speech service from Python
A single Gradio + React WebUI with extensions for ACE-Step
The official Python SDK for the ElevenLabs API
A sound cloning tool with a web interface, using your voice
Unofficial Python API and agentic skill for Google NotebookLM
Speech recognition for your site
One-click deployment (including offline integration package)
The python library for real-time communication
A TTS model capable of generating ultra-realistic dialogue
Voice Recognition to Text Tool
Manage Claude Code in style
Python inference and LoRA trainer package for the LTX-2 audio–video
Towards Human-Sounding Speech
A react-based starter app for using the Live API over websockets
High-Quality Voice Cloning TTS for 600+ Languages
AI tool that turns Hacker News posts into daily podcast updates
Controllable & emotion-expressive zero-shot TTS
Component library and custom registry built on top of shadcn/ui