Speech Note Linux app. Note taking, reading and translating
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-AI-Forge is a project developed around TTS generation model
Pre-trained Deep Learning models and demos
StreamSpeech is a seamless model for offline speech recognition
Multilingual speech recognition and audio understanding model
Port of OpenAI's Whisper model in C/C++
Open-source multi-speaker long-form text-to-speech model
Audio foundation model excelling in audio understanding
Foundational model for human-like, expressive TTS
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A generative speech model for daily dialogue
TTS with kokoro and onnx runtime
MARS5 speech model (TTS) from CAMB.AI
A lightweight text-to-speech model with zero-shot voice cloning
Open-source framework for intelligent speech interaction
The open-source voice synthesis studio powered by Qwen3-TTS
State-of-the-art TTS model under 25MB
A Conversational Speech Generation Model
Fast and accurate automatic speech recognition (ASR) for edge devices
A high-quality rapid TTS voice cloning model
Repo of Qwen2-Audio chat & pretrained large audio language model
PersonaPlex code
A text-to-speech, speech-to-text and speech-to-speech library
A TTS model capable of generating ultra-realistic dialogue