Turn entire websites into LLM-ready markdown or structured data
Crawl a website starting from a URL, find relevant pages
AI-ready web crawler that extracts and structures website content
Clone any website with one command using AI coding agents
dude uncomplicated data extraction: A simple framework
ExtractThinker is a Document Intelligence library for LLMs
CLI tool to extract (meta)data from PDF and manipulate PDF files
Lightweight library for scraping web-sites with LLMs
MD/.JSON Document OCR and structured data extraction API
Structured data extraction and instruction calling with ML, LLM
Automate browser-based workflows with LLMs and Computer Vision
Fast, local-first web content extraction for LLMs
Unreal Engine Archives Explorer
AI-first Ruby framework for building fast, flexible web scraping spide
PDF Parser for AI-ready data. Automate PDF accessibility
A chrome extension for automating your browser by connecting blocks
No-code LLM Platform to launch APIs and ETL Pipelines
To extract main article from given URL with Node.js
Model Context Protocol server that integrates AgentQL's data
Fast and efficient unstructured data extraction
Automatic extraction of relevant features from time series
Tools to build web AI agents that can authenticate
Extract and convert data from any document, images, pdfs, word doc
Clean network diagrams, One-time setup, zero upkeep
Enhance any agent's browser use skill