OCRmyPDF adds an OCR text layer to scanned PDF files
The lxml XML toolkit for Python
TikZ figures for concepts in physics/chemistry/ML
Open-Source Python3 tool for recognizing layouts, tables, and math
Edit PDF files with Nano Banana
A simple tool for reading in poorly redacted documents
Math OCR model that outputs LaTeX and markdown
Video-based AI memory library. Store millions of text chunks in MP4
pytablewriter is a Python library to write a table in various formats
A Fast, Extensible Progress Bar for Python and CLI
Re-editable LaTeX/ typst graphics for Inkscape
JupyterLab extension for live editing of LaTeX documents
Build GUI for your Python program with JavaScript, HTML, and CSS
Extract one time password (OTP) secrets from QR codes
A tool that automatically formats Python code to conform to the PEP 8
Tom Preston-Werner's obvious, minimal language
Pure Python library for LaTeX to MathML conversion
CLI tool to filter JSON and JSON Lines data with Python syntax
Diff JSON and JSON-like structures in Python
Formula recognition based on LaTeX-OCR and ONNXRuntime
tmux session manager. built on libtmux
Easily serialize Data Classes to and from JSON
Yet another serialization library on top of dataclasses
CLI tool to extract (meta)data from PDF and manipulate PDF files
Fault-tolerant Python3 package for searching LaTeX documents