OCR software, free and offline
Contexts Optical Compression
Visual Causal Flow
OCRmyPDF adds an OCR text layer to scanned PDF files
Formula recognition based on LaTeX-OCR and ONNXRuntime
OCR offline image text recognition command line windows program
Enhances Tesseract OCR output using LLMs (local or API)
JavaScript OCR and text extraction for images and PDFs
A cross-platform software for text translation and recognition
Free OCR Software: No internet required, easy to use.
A pure Javascript Multilingual OCR
Math OCR model that outputs LaTeX and markdown
PDF scientific paper translation with preserved formats
Readest is a modern, feature-rich ebook reader
A simple tool for reading in poorly redacted documents
A Repo For Document AI
Document content and metadata extraction microservice
Fast and efficient unstructured data extraction
Structured data extraction and instruction calling with ML, LLM
A self-hostable bookmark-everything app
Extract and convert data from any document, images, pdfs, word doc
OpenRecall is a fully open-source, privacy-first alternative
A community-supported supercharged version of paperless
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
OCR model for complex documents with layout-aware structured outputs