OCRmyPDF adds an OCR text layer to scanned PDF files
OCR software, free and offline
Awesome multilingual OCR toolkits based on PaddlePaddle
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A high-quality tool for convert PDF to Markdown and JSON
Accurate × Fast × Comprehensive
Ready-to-use OCR with 80+ supported languages
Open Source Document Management System for Digital Archives
Library for OCR-related tasks powered by Deep Learning
Visual Causal Flow
Convert AI papers to GUI
Contexts Optical Compression
OCR expert VLM powered by Hunyuan's native multimodal architecture
Multilingual Document Layout Parsing in a Single Vision-Language Model
FaceOnLive Open KYC: Streamlining Identity Verification with AI
Implementation of Nougat Neural Optical Understanding
An OCR translator tool made by utilizing tesseract & python-opencv
CCTV Footage Timestamp Search Tool
A Unified Toolkit for Deep Learning Based Document Image Analysis
e-Dokyumento is web-based Document Management System (DMS)
Ozyr is a simple and easy to use OCR snipping tool
Typeface from Ming Dynasty woodblock printed books
A supercharged version of paperless, scan, index and archive docs
Easy-OCR solution and Tesseract trainer for GNU/Linux