Offline speech recognition API for Android, iOS, Raspberry Pi
Open Source OCR Engine
Robust Speech Recognition via Large-Scale Weak Supervision
State-of-the-art 2D and 3D Face Analysis Project
Captcha solver extension for humans
Awesome multilingual OCR toolkits based on PaddlePaddle
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Port of OpenAI's Whisper model in C/C++
OpenVINO™ Toolkit repository
Speech recognition module for Python
Image polygonal annotation with Python
A Lightweight Face Recognition and Facial Attribute Analysis
A pure Javascript Multilingual OCR
Interactive video and image annotation tool for computer vision
Dev tools to reliably understand text and automate conversations
Speech-to-text, text-to-speech, and speaker recognition
Speech to Text to Speech, sends text as OSC messages
Ready-to-use OCR with 80+ supported languages
An image processing library written entirely in JavaScript for Node
OCRmyPDF adds an OCR text layer to scanned PDF files
C++ library for high performance inference on NVIDIA GPUs
Venom is the most complete javascript library for Whatsapp
Cross-platform, customizable ML solutions for live and streaming media
Parser generator to read, process, or translate structured text
A dynamic library tweak for WeChat macOS