Open Source OCR Engine
Face recognition with deep neural networks
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
A Lightweight Face Recognition and Facial Attribute Analysis
Offline speech recognition API for Android, iOS, Raspberry Pi
State-of-the-art 2D and 3D Face Analysis Project
Speech-to-text, text-to-speech, and speaker recognition
Speech recognition module for Python
Port of OpenAI's Whisper model in C/C++
OCR software, free and offline
Captcha solver extension for humans
Audio foundation model excelling in audio understanding
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Open-source industrial-grade ASR models
Contexts Optical Compression
kaldi-asr/kaldi is the official location of the Kaldi project
A pure Javascript Multilingual OCR
On-device Speech Recognition for Apple Silicon
SikuliX version 2.0.0+ (2019+)
Multilingual speech recognition and audio understanding model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A PyTorch-based Speech Toolkit