OCR software, free and offline
Open Source OCR Engine
OCRmyPDF adds an OCR text layer to scanned PDF files
A high-quality tool for convert PDF to Markdown and JSON
Visual Causal Flow
Contexts Optical Compression
Accurate × Fast × Comprehensive
JavaScript OCR and text extraction for images and PDFs
OCR offline image text recognition command line windows program
Free OCR Software: No internet required, easy to use.
Awesome multilingual OCR toolkits based on PaddlePaddle
Get your documents ready for gen AI
Enhances Tesseract OCR output using LLMs (local or API)
Open Source Document Management System for Digital Archives
A Repo For Document AI
A pure Javascript Multilingual OCR
A cross-platform software for text translation and recognition
A community-supported supercharged version of paperless
Document content and metadata extraction microservice
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
Multilingual Document Layout Parsing in a Single Vision-Language Model
Scan documents to PDF and other file types, as simply as possible.
Fast and efficient unstructured data extraction
Readest is a modern, feature-rich ebook reader