Offline speech recognition API for Android, iOS, Raspberry Pi
Face recognition with deep neural networks
Open Source OCR Engine
Robust Speech Recognition via Large-Scale Weak Supervision
Image polygonal annotation with Python
State-of-the-art 2D and 3D Face Analysis Project
The cross-platform open-source app built for handwriting
Dev tools to reliably understand text and automate conversations
Captcha solver extension for humans
Awesome multilingual OCR toolkits based on PaddlePaddle
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A pure Javascript Multilingual OCR
OCRmyPDF adds an OCR text layer to scanned PDF files
OpenVINO™ Toolkit repository
Speech to Text to Speech, sends text as OSC messages
Cross-platform, customizable ML solutions for live and streaming media
Port of OpenAI's Whisper model in C/C++
C++ library for high performance inference on NVIDIA GPUs
Ready-to-use OCR with 80+ supported languages
A python library built to empower developers
High-performance neural network inference framework for mobile
Speech-to-text, text-to-speech, and speaker recognition
Formula recognition based on LaTeX-OCR and ONNXRuntime
Open-Source Python3 tool for recognizing layouts, tables, and math
Interactive video and image annotation tool for computer vision