A Python tool to help extracting information from structured PDFs
Bibtex parser for Python 3
A fast, extensible and spec-compliant Markdown parser in pure Python
OCRmyPDF adds an OCR text layer to scanned PDF files
Open-Source Python3 tool for recognizing layouts, tables, and math
CLI tool and python library
A simple tool for reading in poorly redacted documents
Edit PDF files with Nano Banana
Video-based AI memory library. Store millions of text chunks in MP4
A Python utility / library to sort imports
Math OCR model that outputs LaTeX and markdown
Tools to ease the creation of snippets, syntax definitions, etc.
CLI tool to extract (meta)data from PDF and manipulate PDF files
Extract one time password (OTP) secrets from QR codes
Re-editable LaTeX/ typst graphics for Inkscape
The data structure for multimodal data
Fast Python reader and editor for ASAM MDF / MF4 (Measurement Format)
A Python application to add watermarks (text or image) to PDF files
2D & 3D TeX-Aware Vector Graphics Language
PDF Indexing Script: Searches PDF for words, records page numbers
Matplotlib style sheets to nicely format figures for scientific papers
100% offline, AI-powered PDF redaction
XML text markup for ancient documents
Query data on the command line with SQL-like SELECTs powered by Python
Quick and reliable way to convert NGINX configurations into JSON