A scalable web crawler framework for Java
Library for extracting streaming site data without official APIs
Java library for working with real-world HTML
Distributed web crawler admin platform for spiders management
Ever wanted to download only a part of a Git repository.
DSTK - DataScience ToolKit for All of Us
Android app for saving webpages for offline reading