A scalable web crawler framework for Java
Python crawler and API for downloading JMComic albums and images
Free batch downloader for image, wallpaper, video, audio, document,
dude uncomplicated data extraction: A simple framework
The next generation web scraping framework
ACHE is a web crawler for domain-specific search
Simple Python framework for building multithreaded web crawlers
DataHen Till is a companion tool to your existing web scraper
Educational Python web scraping case collection for many sites
Pythonic HTML Parsing for Humans
Ever wanted to download only a part of a Git repository.
Open source web crawler for Java
Lightweight Java web crawler framework with jQuery-style extraction
WebCollector is an open source web crawler framework based on Java.