OCRmyPDF adds an OCR text layer to scanned PDF files
Video-based AI memory library. Store millions of text chunks in MP4
The lxml XML toolkit for Python
TikZ figures for concepts in physics/chemistry/ML
Modern high-performance serialization utilities for Python
Re-editable LaTeX/ typst graphics for Inkscape
LaTeX CV generator from a YAML/JSON input file
Cortex Analyzers Repository
Package for converting and rendering markdown documents in TeX
Build GUI for your Python program with JavaScript, HTML, and CSS
A simple tool for reading in poorly redacted documents
Fast, correct Python JSON library supporting dataclasses, datetimes
Open-Source Python3 tool for recognizing layouts, tables, and math
JupyterLab extension for live editing of LaTeX documents
A Python tool to help extracting information from structured PDFs
The social web translator
Edit PDF files with Nano Banana
Yet another serialization library on top of dataclasses
CLI tool to extract (meta)data from PDF and manipulate PDF files
A file based wiki that uses markdown
Convert between CBOR, JSON, MessagePack, TOML, and YAML
Diff JSON and JSON-like structures in Python
Automated Integration Testing and Live Documentation for your API
tmux session manager. built on libtmux
Always know what to expect from your data