Python & command-line tool to gather text on the Web
dude uncomplicated data extraction: A simple framework
Automate the download of entire Twitch.tv channels
Easy Spider is a distributed Perl Web Crawler Project from 2006
Ever wanted to download only a part of a Git repository.