Open speech-to-speech models and pipelines by Hugging Face toolkit AI
The open-source voice synthesis studio powered by Qwen3-TTS
Long-form streaming TTS system for multi-speaker dialogue generation
Tokenizer-Free TTS for Multilingual Speech Generation
Open Source Speech Language Model
Translate the video from one language to another and embed dubbing
Free open source speech synthesizer for Russian and other languages
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Open-source framework for intelligent speech interaction
Offline Text To Speech synthesis for python
1 min voice data can also be used to train a good TTS model
Controllable & emotion-expressive zero-shot TTS
A generative speech model for daily dialogue
Industrial-level controllable zero-shot text-to-speech system
One-stop AI digital human system with video voice synthesis tools
A fast TTS architecture with conditional flow matching
Production ready toolkit to run AI locally
A text-to-speech, speech-to-text and speech-to-speech library
Qwen3-TTS is an open-source series of TTS models
C++ inference library for multiple SVC/TTS
Framework for building real-time voice and multimodal AI agents
StreamSpeech is a seamless model for offline speech recognition
Open source AI VTuber platform with voice chat and Live2D avatars
Spark-TTS Inference Code
TTS with kokoro and onnx runtime