Industrial-level controllable zero-shot text-to-speech system
Synchronized Translation for Videos
Multi-Voice and Prompt-Controlled TTS Engine
Multi-lingual large voice generation model, providing inference
A nearly-live implementation of OpenAI's Whisper
Python library and CLI tool to interface with Google Translate
TTS with kokoro and onnx runtime
Toolkit for audio, music, and speech generation
Offline inference engine for art, real-time voice conversations
Use Microsoft Edge's online text-to-speech service from Python
One-click deployment (including offline integration package)
A single Gradio + React WebUI with extensions for ACE-Step
The python library for real-time communication
Towards Human-Level Text-to-Speech through Style Diffusion
End-to-end speech processing toolkit
A TTS model capable of generating ultra-realistic dialogue
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
MARS5 speech model (TTS) from CAMB.AI
Foundational model for human-like, expressive TTS
C++ inference library for multiple SVC/TTS
Provides CTP stock options and Zhongtai Securities XTP
The TypeScript AI agent framework
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second