SOTA Open Source TTS
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Speech Note Linux app. Note taking, reading and translating
Speech-AI-Forge is a project developed around TTS generation model
Offline speech recognition API for Android, iOS, Raspberry Pi
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
A free, open source, and extensible speech-to-text application
Speech recognition module for Python
Comprehensive Gradio WebUI for audio processing
Free open source speech synthesizer for Russian and other languages
A text-to-speech, speech-to-text and speech-to-speech library
Speech to Text to Speech, sends text as OSC messages
Code for openai.fm, a demo for the OpenAI Speech API
Tokenizer-Free TTS for Multilingual Speech Generation
The open-source voice synthesis studio powered by Qwen3-TTS
Qwen3-TTS is an open-source series of TTS models
Open-source framework for intelligent speech interaction
Automatic Speech Recognition with Word-level Timestamps
Multilingual speech recognition and audio understanding model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
StreamSpeech is a seamless model for offline speech recognition
Generate audiobooks from EPUBs, PDFs and text with captions
Translate the video from one language to another and embed dubbing
Use Microsoft Edge's online text-to-speech service from Python