Handwritten Text Recognition (HTR) system implemented with TensorFlow
OCR software, free and offline
Awesome multilingual OCR toolkits based on PaddlePaddle
Contexts Optical Compression
State-of-the-art 2D and 3D Face Analysis Project
Crowdsourcing platform for full text transcription and tagging
Accurate × Fast × Comprehensive
OCR expert VLM powered by Hunyuan's native multimodal architecture
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
Visual Causal Flow
Enhances Tesseract OCR output using LLMs (local or API)
A framework to enable multimodal models to operate a computer
OCRmyPDF adds an OCR text layer to scanned PDF files
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Library for OCR-related tasks powered by Deep Learning
A simple tool for reading in poorly redacted documents
AI Agent Application Development Framework
Formula recognition based on LaTeX-OCR and ONNXRuntime
Replace OpenAI GPT with another LLM in your app
Omnilingual ASR Open-Source Multilingual SpeechRecognition
NLP Cloud serves high performance pre-trained or custom models for NER
Repo of Qwen2-Audio chat & pretrained large audio language model
Qwen3-Coder is the code version of Qwen3
Image processing in Python
An on-premises, OCR-free unstructured data extraction