Python & command-line tool to gather text on the Web
Python library for scraping and analyzing online news articles easily
Python tool for crawling and extracting structured data from news site
Python crawler that downloads image galleries and analyzes titles
Defeating Google's audio reCaptcha with 85% accuracy
Modular web site spider for web developers.
A reader and decompiler for files in the CHM format