Generate audiobooks from e-books
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A sound cloning tool with a web interface, using your voice
NeuTTS model built from small LLM backbones
On-device TTS model by Neuphonic
End-to-end speech processing toolkit
Framework for building realtime multimodal voice AI agents apps
Towards Human-Sounding Speech
A simple, high-quality voice conversion tool focused on ease of use
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Multi-lingual large voice generation model, providing inference
Toolkit for conversational AI
MOSS‑TTS Family open‑source speech and sound generation model
LLM-based Reinforcement Learning audio edit model
Interface for OuteTTS models
A TTS model capable of generating ultra-realistic dialogue
Generate audiobooks from e-books, voice cloning & 1107+ languages
The official Python SDK for the ElevenLabs API
Free, high-quality text-to-speech API endpoint to replace OpenAI
Fast multimodal LLM for real-time voice interaction and AI apps
The official Python library for the Fish Audio API
A simple native web interface that uses ChatTTS to synthesize text
Voice Recognition to Text Tool
Official MiniMax Model Context Protocol (MCP) server
AI-powered tool for generating, optimizing, and translating subtitles