Java library for working with real-world HTML
A scalable web crawler framework for Java
ACHE is a web crawler for domain-specific search
Distributed web crawler admin platform for spiders management
An Android rich text class library that supports graphic & text mixing
Open source web crawler for Java
Ever wanted to download only a part of a Git repository.
Android app for saving webpages for offline reading
DSTK - DataScience ToolKit for All of Us
WebCollector is an open source web crawler framework based on Java.
Open source Search Engine and Enterprise Search