Awesome multilingual OCR toolkits based on PaddlePaddle
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCRmyPDF adds an OCR text layer to scanned PDF files
Convert AI papers to GUI
A high-quality tool for convert PDF to Markdown and JSON
Accurate × Fast × Comprehensive
Visual Causal Flow
OCR software, free and offline
Contexts Optical Compression
Open Source Document Management System for Digital Archives
OCR expert VLM powered by Hunyuan's native multimodal architecture
Multilingual Document Layout Parsing in a Single Vision-Language Model
FaceOnLive Open KYC: Streamlining Identity Verification with AI
Implementation of Nougat Neural Optical Understanding
An OCR translator tool made by utilizing tesseract & python-opencv
A Unified Toolkit for Deep Learning Based Document Image Analysis
CCTV Footage Timestamp Search Tool
e-Dokyumento is web-based Document Management System (DMS)
Ozyr is a simple and easy to use OCR snipping tool
Typeface from Ming Dynasty woodblock printed books
A supercharged version of paperless, scan, index and archive docs
Easy-OCR solution and Tesseract trainer for GNU/Linux