The complete web scraping toolkit for PHP
A scalable web crawler framework for Java
Java library for working with real-world HTML
Distributed web crawler admin platform for spiders management
Distributed Crawler Management Framework Based on Scrapy
ACHE is a web crawler for domain-specific search
A service daemon to run Scrapy spiders
Ever wanted to download only a part of a Git repository.
Open source web crawler for Java
Android app for saving webpages for offline reading
WebCollector is an open source web crawler framework based on Java.
Open source Search Engine and Enterprise Search