Java library for working with real-world HTML
Library for extracting streaming site data without official APIs
A scalable web crawler framework for Java
An Android rich text class library that supports graphic & text mixing
Open source web crawler for Java
Lightweight Java web crawler framework with jQuery-style extraction
Android app for saving webpages for offline reading