dude uncomplicated data extraction: A simple framework
Crawl a website starting from a URL, find relevant pages
AI-ready web crawler that extracts and structures website content
Turn entire websites into LLM-ready markdown or structured data
ExtractThinker is a Document Intelligence library for LLMs
CLI tool to extract (meta)data from PDF and manipulate PDF files
Lightweight library for scraping web-sites with LLMs
Structured data extraction and instruction calling with ML, LLM
Model Context Protocol server that integrates AgentQL's data
Parse text and tables from PDF files.
Unreal Engine Archives Explorer
Open source web scraping system for automated data collection tasks
Library for extracting streaming site data without official APIs
A Python tool to help extracting information from structured PDFs
Automatic extraction of relevant features from time series
Clean network diagrams, One-time setup, zero upkeep
No-code LLM Platform to launch APIs and ETL Pipelines
A chrome extension for automating your browser by connecting blocks
To extract main article from given URL with Node.js
Declarative web scraping
ContextGem: Effortless LLM extraction from documents
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Tools to build web AI agents that can authenticate
Burp Suite extension for JavaScript static analysis