OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR software, free and offline
A high-quality tool for convert PDF to Markdown and JSON
Open Source Document Management System for Digital Archives
Contexts Optical Compression
Accurate × Fast × Comprehensive
Visual Causal Flow
Multilingual Document Layout Parsing in a Single Vision-Language Model
Convert AI papers to GUI
A Unified Toolkit for Deep Learning Based Document Image Analysis
Typeface from Ming Dynasty woodblock printed books
A supercharged version of paperless, scan, index and archive docs
CIntruder - OCR Bruteforcing Toolkit