Open source AI VTuber platform with voice chat and Live2D avatars
C++ inference library for multiple SVC/TTS
Framework for building real-time voice and multimodal AI agents
A high-quality rapid TTS voice cloning model
TTS with kokoro and onnx runtime
Capable of understanding text, audio, vision, video
High-Quality Voice Cloning TTS for 600+ Languages
Towards Human-Sounding Speech
Synchronized Translation for Videos
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A TTS model capable of generating ultra-realistic dialogue
State-of-the-art TTS model under 25MB
Build your own AI friend
Code for openai.fm, a demo for the OpenAI Speech API
Instant voice cloning by MIT and MyShell. Audio foundation model
Cross-platform AI language practice app
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
A simple, high-quality voice conversion tool focused on ease of use
A single Gradio + React WebUI with extensions for ACE-Step
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Open-source model for program synthesis
A cross-platform software for text translation and recognition
Port of OpenAI's Whisper model in C/C++
A sound cloning tool with a web interface, using your voice
Interface for OuteTTS models