Open Source OCR Engine
Offline speech recognition API for Android, iOS, Raspberry Pi
Face recognition with deep neural networks
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
State-of-the-art 2D and 3D Face Analysis Project
A Lightweight Face Recognition and Facial Attribute Analysis
Speech-to-text, text-to-speech, and speaker recognition
OCR software, free and offline
Port of OpenAI's Whisper model in C/C++
Multilingual speech recognition and audio understanding model
Speech recognition module for Python
Captcha solver extension for humans
kaldi-asr/kaldi is the official location of the Kaldi project
Contexts Optical Compression
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Open-source industrial-grade ASR models
Audio foundation model excelling in audio understanding
A pure Javascript Multilingual OCR
SikuliX version 2.0.0+ (2019+)
On-device Speech Recognition for Apple Silicon
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OpenVINO™ Toolkit repository
Fast and accurate automatic speech recognition (ASR) for edge devices