OCR software, free and offline
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR expert VLM powered by Hunyuan's native multimodal architecture
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Convert AI papers to GUI
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
A Repo For Document AI
A community-supported supercharged version of paperless
Qwen3-omni is a natively end-to-end, omni-modal LLM
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A Python application to add watermarks (text or image) to PDF files
FaceOnLive Open KYC: Streamlining Identity Verification with AI
Implementation of Nougat Neural Optical Understanding
Img2Txt - Extract Text From Images using AI
An OCR translator tool made by utilizing tesseract & python-opencv
The ultimate tool to automate custom telegram message forwarding
CCTV Footage Timestamp Search Tool
A Unified Toolkit for Deep Learning Based Document Image Analysis
e-Dokyumento is web-based Document Management System (DMS)