Offline speech recognition API for Android, iOS, Raspberry Pi
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
Speech recognition module for Python
Open-source industrial-grade ASR models
kaldi-asr/kaldi is the official location of the Kaldi project
Audio foundation model excelling in audio understanding
A PyTorch-based Speech Toolkit
Captcha solver extension for humans
On-device Speech Recognition for Apple Silicon
A free, open source, and extensible speech-to-text application
Port of OpenAI's Whisper model in C/C++
Cross-platform AI language practice app
StreamSpeech is a seamless model for offline speech recognition
Multilingual Automatic Speech Recognition with word-level timestamps
Voice Recognition to Text Tool
Toolkit for conversational AI
OpenVINO™ Toolkit repository
Underthesea - Vietnamese NLP Toolkit
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Repo of Qwen2-Audio chat & pretrained large audio language model
A cross-platform software for text translation and recognition
Speech to Text to Speech, sends text as OSC messages
Training data (data labeling, annotation, workflow) for all data types
Capable of understanding text, audio, vision, video