A simple tool for reading in poorly redacted documents
OCR offline image text recognition command line windows program
Open Source Document Management System for Digital Archives
Enhances Tesseract OCR output using LLMs (local or API)
Edit PDF files with Nano Banana
A Repo For Document AI
#1 Locally hosted web application that allows you to work on PDFs
A pure Javascript Multilingual OCR
Document content and metadata extraction microservice
Generate a bunch of malicious pdf files with phone-home functionality
A cross-platform software for text translation and recognition
A community-supported supercharged version of paperless
A Python tool to help extracting information from structured PDFs
.NET port of the iText library
Scan documents to PDF and other file types, as simply as possible.
Open source PDF editor
PDFsam, a desktop application to split, merge, mix, rotate PDF files
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
Fast and efficient unstructured data extraction
Multilingual Document Layout Parsing in a Single Vision-Language Model
iLovePDF Rest Api - PHP Library
borb is a library for reading, creating and manipulating PDF files
open source Java library for creating and editing PDF files
X-Plane plugin that displays a tablet to aid VR usage
A simple interface for working with TeX documents