SOTA Open Source TTS
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Speech-AI-Forge is a project developed around TTS generation model
Robust Speech Recognition via Large-Scale Weak Supervision
A text-to-speech, speech-to-text and speech-to-speech library
Generate audiobooks from EPUBs, PDFs and text with captions
Comprehensive Gradio WebUI for audio processing
Quick illustration of how one can easily read books together with LLMs
Speech recognition module for Python
A high-quality rapid TTS voice cloning model
Open Source Speech Language Model
A generative speech model for daily dialogue
Qwen3-TTS is an open-source series of TTS models
Chuyển đổi văn bản thành giọng nói không giới hạn
A robust, efficient, low-latency speech-to-text library
StreamSpeech is a seamless model for offline speech recognition
Qwen3-omni is a natively end-to-end, omni-modal LLM
Industrial-level controllable zero-shot text-to-speech system
The behavior guidance framework for customer-facing LLM agents
Offline Text To Speech synthesis for python
PersonaPlex code
Use Microsoft Edge's online text-to-speech service from Python
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A lightweight text-to-speech model with zero-shot voice cloning
High-quality multi-lingual text-to-speech library by MyShell.ai