Open-source LLM Friendly Web Crawler & Scraper
A fast, high-level web crawling and web scraping framework
Python & command-line tool to gather text on the Web
Turn entire websites into LLM-ready markdown or structured data
A browser testing and web crawling library for PHP and Symfony
The unix-way web crawler
Check links in web documents or full websites
Redis-based components for Scrapy
Library for Rapid (Web) Crawler and Scraper Development
A free, feature-rich web analyzer and exporter/cloner you will love!
Distributed Crawler Management Framework Based on Scrapy
ACHE is a web crawler for domain-specific search
Goutte, a simple PHP Web Scraper
Easy Spider is a distributed Perl Web Crawler Project from 2006
Gospider - Fast web spider written in Go
Chrome Headless docker images built upon alpine official image
Decentralized Web Search Engine
Download websites as e-book: pdf, txt, epub.
Creating Scrapy scrapers via the Django admin interface
Open source web crawler for Java
Perl Web Scraping Project
WebCollector is an open source web crawler framework based on Java.
Capable to "Crawl" a site and return a report of all links from it
IOSec Addons are enhancements for web security and crawler detection
Zoozle 2008 - 2010 Webpage, Tools and SQL Files