Open Source OCR Engine
OCR software, free and offline
Web application that allows you to perform operations on PDF files
OCRmyPDF adds an OCR text layer to scanned PDF files
PDF Parser for AI-ready data. Automate PDF accessibility
PDF to Markdown with vision models
A high-quality tool for convert PDF to Markdown and JSON
Visual Causal Flow
JavaScript OCR and text extraction for images and PDFs
MD/.JSON Document OCR and structured data extraction API
PDF scientific paper translation with preserved formats
Get your documents ready for gen AI
A simple tool for reading in poorly redacted documents
Open Source Document Management System for Digital Archives
#1 Locally hosted web application that allows you to work on PDFs
A Repo For Document AI
A community-supported supercharged version of paperless
Document content and metadata extraction microservice
Scan documents to PDF and other file types, as simply as possible.
Fast and efficient unstructured data extraction
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
Create, Edit, Delete, Organize , Convert, Export, Secure & Sign PDF.
Download books from the hathitrust website in a fast and easy manner
A Python application to add watermarks (text or image) to PDF files