Document (PDF, Word, PPTX ...) extraction and parse API
Zero-copy PDF text extraction library written in Zig
Flexible Node.js AI-assisted crawler library
Extract and convert data from any document, images, pdfs, word doc
Fast and efficient unstructured data extraction
Crawl a website starting from a URL, find relevant pages
Open source NLP guide with models, methods, and real use cases
JavaScript OCR and text extraction for images and PDFs
A Simple and Universal Swarm Intelligence Engine
Python Audio Analysis Library: Feature Extraction, Classification
AI-ready web crawler that extracts and structures website content
dude uncomplicated data extraction: A simple framework
Clean network diagrams, One-time setup, zero upkeep
Eases DOM navigation for HTML and XML documents
A powerful obfuscator for JavaScript and Node.js
End-to-end pipeline converting generative videos
Did you say you like data?
Style React fast with 100% parity on React Native
A collection of hacks and one-off scripts
Image Processing library for Matlab
Synthetic data curation for post-training and data extraction
The highest-scoring AI memory system ever benchmarked
Vision AI browser agent for automation, testing, and extraction
The undetected self-hosted browser automation platform
Open source web scraping system for automated data collection tasks