Open source enterprise search server for websites, files, and data
A scalable web crawler framework for Java
Distributed web crawler admin platform for spiders management
ACHE is a web crawler for domain-specific search
Automated mobile app crawler and testing tool built on Appium
An Android rich text class library that supports graphic & text mixing
Lightweight Java web crawler framework with jQuery-style extraction
WebCollector is an open source web crawler framework based on Java.
Open source Search Engine and Enterprise Search