Did you say you like data?
Document (PDF, Word, PPTX ...) extraction and parse API
A Java library used to read and extract data from NFC EMV credit cards
Extrect selected entries from LDIF files like grep
An advanced memory forensics framework
A pure-python PDF library capable of splitting, merging, cropping
Extract public Instagram account information from usernames
The Refactoring library based off the Refactoring book
An easy to use, powerful crawler implemented in PHP
Saves Discord chat logs to a file
Extract structured data from webpages using LLM-powered scraping
An on-premises, OCR-free unstructured data extraction
JavaScript OCR and text extraction for images and PDFs
OCR model for complex documents with layout-aware structured outputs
Create prompt-friendly codebase digests from any Git repository URL
Blazing fast Go framework for web crawling and data scraping tasks
Design engineering for Claude Code
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML
A fast, high-level web crawling and web scraping framework
To extract main article from given URL with Node.js
Lightweight Python tool for downloading videos from many platforms
Crawl a website starting from a URL, find relevant pages
Self-hosted AI audio transcription
PDFCraft is a free, privacy-focused PDF toolkit
Open source semantic search and text analytics for large document sets