OCRmyPDF adds an OCR text layer to scanned PDF files
The lxml XML toolkit for Python
A simple tool for reading in poorly redacted documents
Re-editable LaTeX/ typst graphics for Inkscape
Edit PDF files with Nano Banana
Open-Source Python3 tool for recognizing layouts, tables, and math
TikZ figures for concepts in physics/chemistry/ML
CLI tool to extract (meta)data from PDF and manipulate PDF files
Tom Preston-Werner's obvious, minimal language
Modern high-performance serialization utilities for Python
Cortex Analyzers Repository
A Python tool to help extracting information from structured PDFs
pytablewriter is a Python library to write a table in various formats
Package for converting and rendering markdown documents in TeX
The social web translator
Diff JSON and JSON-like structures in Python
Video-based AI memory library. Store millions of text chunks in MP4
openvpn-monitor is a web based OpenVPN monitor
Math OCR model that outputs LaTeX and markdown
Automated Integration Testing and Live Documentation for your API
An implementation of the JSON Schema specification for Python
tmux session manager. built on libtmux
Convert between CBOR, JSON, MessagePack, TOML, and YAML
Yet another serialization library on top of dataclasses
Always know what to expect from your data