OCRmyPDF adds an OCR text layer to scanned PDF files
The lxml XML toolkit for Python
Tom Preston-Werner's obvious, minimal language
TikZ figures for concepts in physics/chemistry/ML
The social web translator
Situational Awareness Server compatible with TAK clients
Video-based AI memory library. Store millions of text chunks in MP4
Open-Source Python3 tool for recognizing layouts, tables, and math
Always know what to expect from your data
Edit PDF files with Nano Banana
A Python tool to help extracting information from structured PDFs
Math OCR model that outputs LaTeX and markdown
A simple tool for reading in poorly redacted documents
A fast serialization and validation library, with builtin
The data structure for multimodal data
Create HTML profiling reports from pandas DataFrame objects
Open Security Controls Assessment Language (OSCAL)
Manipulate JSON-like data with NumPy-like idioms
Extract one time password (OTP) secrets from QR codes
Yet another serialization library on top of dataclasses
pytablewriter is a Python library to write a table in various formats
minted is a LaTeX package that provides syntax highlighting
CLI tool to filter JSON and JSON Lines data with Python syntax
Easily serialize Data Classes to and from JSON
Package for converting and rendering markdown documents in TeX