Open source enterprise search server for websites, files, and data
Python library for scraping and analyzing online news articles easily
Python crawler for collecting and downloading Sina Weibo user data
Python tool for crawling and extracting structured data from news site
dude uncomplicated data extraction: A simple framework
Easy Spider is a distributed Perl Web Crawler Project from 2006
Python crawler that downloads image galleries and analyzes titles