A scalable web crawler framework for Java
Library for extracting streaming site data without official APIs
Java library for working with real-world HTML
Educational Python web scraping case collection for many sites
Ever wanted to download only a part of a Git repository.
Lightweight Java web crawler framework with jQuery-style extraction
DSTK - DataScience ToolKit for All of Us