Open Source OCR Engine
Handwritten Text Recognition (HTR) system implemented with TensorFlow
OCR software, free and offline
Awesome multilingual OCR toolkits based on PaddlePaddle
Contexts Optical Compression
State-of-the-art 2D and 3D Face Analysis Project
A pure Javascript Multilingual OCR
Open source semantic search and text analytics for large document sets
Accurate × Fast × Comprehensive
Port of OpenAI's Whisper model in C/C++
OCR expert VLM powered by Hunyuan's native multimodal architecture
Visual Causal Flow
A framework to enable multimodal models to operate a computer
A cross-platform software for text translation and recognition
SikuliX version 2.0.0+ (2019+)
Open source AI VTuber platform with voice chat and Live2D avatars
OCRmyPDF adds an OCR text layer to scanned PDF files
Enhances Tesseract OCR output using LLMs (local or API)
Towards Studio-Grade Character Animation via In-Context Learning of 3D
AI Agent Application Development Framework
A proof-of-concept jupyter extension which converts english queries
Formula recognition based on LaTeX-OCR and ONNXRuntime
A dynamic library tweak for WeChat macOS
A gallery that showcases on-device ML/GenAI use cases
Omnilingual ASR Open-Source Multilingual SpeechRecognition