A scalable web crawler framework for Java
Java library for working with real-world HTML
Distributed web crawler admin platform for spiders management
An Android rich text class library that supports graphic & text mixing
Ever wanted to download only a part of a Git repository.
Lightweight Java web crawler framework with jQuery-style extraction