ACHE is a web crawler for domain-specific search
Elasticsearch File System Crawler (FS Crawler)
Decentralized Web Search Engine
Download websites as e-book: pdf, txt, epub.
Firing Range is a test bed for web application security scanners
Open source web crawler for Java
Lays out dependencies (links) between Excel files in a visual graph
WebCollector is an open source web crawler framework based on Java.
Capable to "Crawl" a site and return a report of all links from it
Role Playing Game Engine
You can quickly find want what you need in HIT campus
A Lua-based crawling scripting language and leveraging selenium