Robust Speech Recognition via Large-Scale Weak Supervision
Offline speech recognition API for Android, iOS, Raspberry Pi
Multilingual speech recognition and audio understanding model
Speech-to-text, text-to-speech, and speaker recognition
Underthesea - Vietnamese NLP Toolkit
OpenVINO™ Toolkit repository
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Replace OpenAI GPT with another LLM in your app
Han Language Processing
Cross-platform AI language practice app
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Enhances Tesseract OCR output using LLMs (local or API)
kaldi-asr/kaldi is the official location of the Kaldi project
Toolkit for conversational AI
Build your own AI friend
A pure Javascript Multilingual OCR
Contexts Optical Compression
Open source AI VTuber platform with voice chat and Live2D avatars
Recognition and resolution of numbers, units, date/time, etc.
A full spaCy pipeline and models for scientific/biomedical documents
Developer friendly Natural Language Processing
Training data (data labeling, annotation, workflow) for all data types
Open Source Computer Vision Library
The media player for language learning, with dual subtitles
A GUI Agent app based on UI-TARS to control your computer using AI