Turn entire websites into LLM-ready markdown or structured data
Python & command-line tool to gather text on the Web
AI-ready web crawler that extracts and structures website content
Declarative web scraping
dude uncomplicated data extraction: A simple framework
Open source web scraping system for automated data collection tasks
Python tool for crawling and extracting structured data from news site
Python library for scraping and analyzing online news articles easily
Cross platform GUI tool for downloading videos from Bilibili sites
A fast, high-level web crawling and web scraping framework
Fast CLI web crawler for discovering endpoints in modern web apps
Desktop tool for collecting and exporting Xiaohongshu post data
An adaptive Web Scraping framework
Scrape tweets, profiles, followers and following from Twitter/X
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Collection of Python web scraping scripts for data extraction tasks
This is a public repository containing scrapers
A scalable web crawler framework for Java
Lightweight Ruby DSL for scraping structured data from web pages
High-performance Rust web crawler and scraper for large-scale data
Progressive PHP web crawler framework with jQuery-like DOM parsing
Lightweight .NET framework for fast web crawling and data scraping
Free batch downloader for image, wallpaper, video, audio, document,