Code for openai.fm, a demo for the OpenAI Speech API
Synchronized Translation for Videos
Cross-platform AI language practice app
A sound cloning tool with a web interface, using your voice
A simple native web interface that uses ChatTTS to synthesize text
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
The python library for real-time communication
Workflow and speech recognition app
One-click deployment (including offline integration package)
Real-time voice interactive digital human
Speech-AI-Forge is a project developed around TTS generation model
Open source text-to-speech tool, supports extra-long text
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Spark-TTS Inference Code
A high-quality rapid TTS voice cloning model
Build Vision Agents quickly with any model or video provider
Free & Easy AI Voice Accounting Software For Blind & Speechless People
Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
Amica is an open source interface for interactive communication
A webui for different audio related Neural Networks
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Speed Reading