Open source enterprise search server for websites, files, and data
A scalable web crawler framework for Java
Java library for working with real-world HTML
Free batch downloader for image, wallpaper, video, audio, document,
Distributed web crawler admin platform for spiders management
ACHE is a web crawler for domain-specific search
Educational Python web scraping case collection for many sites
Ever wanted to download only a part of a Git repository.
Open source web crawler for Java
Lightweight Java web crawler framework with jQuery-style extraction
Android app for saving webpages for offline reading
WebCollector is an open source web crawler framework based on Java.
Open source Search Engine and Enterprise Search