Document (PDF, Word, PPTX ...) extraction and parse API
Did you say you like data?
Extrect selected entries from LDIF files like grep
A Java library used to read and extract data from NFC EMV credit cards
Extract public Instagram account information from usernames
A pure-python PDF library capable of splitting, merging, cropping
Extract structured data from webpages using LLM-powered scraping
An advanced memory forensics framework
An easy to use, powerful crawler implemented in PHP
The Refactoring library based off the Refactoring book
Extract any website's complete design system with one command
Saves Discord chat logs to a file
JavaScript OCR and text extraction for images and PDFs
Design engineering for Claude Code
OCR model for complex documents with layout-aware structured outputs
An on-premises, OCR-free unstructured data extraction
Create prompt-friendly codebase digests from any Git repository URL
Blazing fast Go framework for web crawling and data scraping tasks
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML
Crawl a website starting from a URL, find relevant pages
To extract main article from given URL with Node.js
Self-hosted AI audio transcription
Python crawler for collecting and downloading Sina Weibo user data
PDFCraft is a free, privacy-focused PDF toolkit
Lightweight Python tool for downloading videos from many platforms