Open Source OCR Engine
OCR software, free and offline
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR offline image text recognition command line windows program
Ready-to-use OCR with 80+ supported languages
A pure Javascript Multilingual OCR
Free OCR Software: No internet required, easy to use.
Library for OCR-related tasks powered by Deep Learning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A cross-platform software for text translation and recognition
Readest is a modern, feature-rich ebook reader
Convert AI papers to GUI
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
A Repo For Document AI
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
A community-supported supercharged version of paperless
Assist in organizing your piles of documents
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Qwen3-omni is a natively end-to-end, omni-modal LLM