Robust Speech Recognition via Large-Scale Weak Supervision
Offline speech recognition API for Android, iOS, Raspberry Pi
Multilingual speech recognition and audio understanding model
Speech-to-text, text-to-speech, and speaker recognition
Underthesea - Vietnamese NLP Toolkit
OpenVINO™ Toolkit repository
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Replace OpenAI GPT with another LLM in your app
Han Language Processing
Cross-platform AI language practice app
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Build your own AI friend
Enhances Tesseract OCR output using LLMs (local or API)
Toolkit for conversational AI
Open source AI VTuber platform with voice chat and Live2D avatars
kaldi-asr/kaldi is the official location of the Kaldi project
A pure Javascript Multilingual OCR
Contexts Optical Compression
Recognition and resolution of numbers, units, date/time, etc.
A full spaCy pipeline and models for scientific/biomedical documents
Developer friendly Natural Language Processing
The media player for language learning, with dual subtitles
A GUI Agent app based on UI-TARS to control your computer using AI
Fast and accurate automatic speech recognition (ASR) for edge devices
Open Source Computer Vision Library