SOTA Open Source TTS
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Speech-AI-Forge is a project developed around TTS generation model
Robust Speech Recognition via Large-Scale Weak Supervision
A text-to-speech, speech-to-text and speech-to-speech library
Synchronized Translation for Videos
Comprehensive Gradio WebUI for audio processing
Generate audiobooks from EPUBs, PDFs and text with captions
Speech recognition module for Python
Open Source Speech Language Model
StreamSpeech is a seamless model for offline speech recognition
Qwen3-TTS is an open-source series of TTS models
A robust, efficient, low-latency speech-to-text library
A generative speech model for daily dialogue
Qwen3-omni is a natively end-to-end, omni-modal LLM
Chuyển đổi văn bản thành giọng nói không giới hạn
Offline Text To Speech synthesis for python
Use Microsoft Edge's online text-to-speech service from Python
Industrial-level controllable zero-shot text-to-speech system
A lightweight text-to-speech model with zero-shot voice cloning
A high-quality rapid TTS voice cloning model
Spark-TTS Inference Code
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A TTS that fits in your CPU (and pocket)
TTS with kokoro and onnx runtime