SOTA Open Source TTS
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Speech-AI-Forge is a project developed around TTS generation model
Robust Speech Recognition via Large-Scale Weak Supervision
A text-to-speech, speech-to-text and speech-to-speech library
Generate audiobooks from EPUBs, PDFs and text with captions
Comprehensive Gradio WebUI for audio processing
Speech recognition module for Python
Quick illustration of how one can easily read books together with LLMs
StreamSpeech is a seamless model for offline speech recognition
Qwen3-TTS is an open-source series of TTS models
Qwen3-omni is a natively end-to-end, omni-modal LLM
A robust, efficient, low-latency speech-to-text library
A generative speech model for daily dialogue
Open Source Speech Language Model
Chuyển đổi văn bản thành giọng nói không giới hạn
Offline Text To Speech synthesis for python
Use Microsoft Edge's online text-to-speech service from Python
A TTS that fits in your CPU (and pocket)
Industrial-level controllable zero-shot text-to-speech system
A lightweight text-to-speech model with zero-shot voice cloning
Spark-TTS Inference Code
End-to-end speech processing toolkit
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A high-quality rapid TTS voice cloning model