Java library for working with real-world HTML
ACHE is a web crawler for domain-specific search
Library for Rapid (Web) Crawler and Scraper Development
The web scraper that's nearly impossible to block
DataHen Till is a companion tool to your existing web scraper
Pythonic HTML Parsing for Humans
Android app for saving webpages for offline reading