A Python tool to help extracting information from structured PDFs
Bibtex parser for Python 3
OCRmyPDF adds an OCR text layer to scanned PDF files
Open-Source Python3 tool for recognizing layouts, tables, and math
Math OCR model that outputs LaTeX and markdown
Java library for working with real-world HTML
HTML Loader
Re-editable LaTeX/ typst graphics for Inkscape
A toolchain for web projects, aimed to provide functionalities
Video-based AI memory library. Store millions of text chunks in MP4
CLI tool to extract (meta)data from PDF and manipulate PDF files
A fast, extensible and spec-compliant Markdown parser in pure Python
Converts CSS selectors to XPath expressions
CLI tool and python library
A Python utility / library to sort imports
Tools to ease the creation of snippets, syntax definitions, etc.
Extract one time password (OTP) secrets from QR codes
Matplotlib style sheets to nicely format figures for scientific papers
The data structure for multimodal data
The Go support for Google's protocol buffers
2D & 3D TeX-Aware Vector Graphics Language
Fast Python reader and editor for ASAM MDF / MF4 (Measurement Format)
Relational database replication tool based on XML Schema
A Python application to add watermarks (text or image) to PDF files
Query data on the command line with SQL-like SELECTs powered by Python