OCR software, free and offline
Contexts Optical Compression
PDF to Markdown with vision models
Accurate × Fast × Comprehensive
Visual Causal Flow
OCRmyPDF adds an OCR text layer to scanned PDF files
Formula recognition based on LaTeX-OCR and ONNXRuntime
Enhances Tesseract OCR output using LLMs (local or API)
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR expert VLM powered by Hunyuan's native multimodal architecture
A high-quality tool for convert PDF to Markdown and JSON
Multilingual Document Layout Parsing in a Single Vision-Language Model
PDF scientific paper translation with preserved formats
Convert AI papers to GUI
Math OCR model that outputs LaTeX and markdown
A framework to enable multimodal models to operate a computer
Windrecorder is a memory search app by records everything
Get your documents ready for gen AI
A simple tool for reading in poorly redacted documents
Open Source Document Management System for Digital Archives
OpenRecall is a fully open-source, privacy-first alternative
A Repo For Document AI
OCR model for complex documents with layout-aware structured outputs
Document content and metadata extraction microservice
Structured data extraction and instruction calling with ML, LLM