Library for Rapid (Web) Crawler and Scraper Development
dude uncomplicated data extraction: A simple framework
ACHE is a web crawler for domain-specific search
A web scraping and browser automation library for Node.js
A service daemon to run Scrapy spiders
Distributed web crawler admin platform for spiders management
Distributed Crawler Management Framework Based on Scrapy
Java library for working with real-world HTML
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
The web scraper that's nearly impossible to block
Free batch downloader for image, wallpaper, video, audio, document,
Automate the download of entire Twitch.tv channels
Descubre archivos, rutas escondidas realizando busquedas avanzadas
Easy Spider is a distributed Perl Web Crawler Project from 2006
Web Scraper in Go, similar to BeautifulSoup
JavaScript + BeautifulSoup = JSSoup
Open-Source RPA Software (formerly Kantu)
Gospider - Fast web spider written in Go
Scrape job websites into a single spreadsheet with no duplicates.
Python bindings for the Chromium Embedded Framework (CEF)
Run headless Chrome/Chromium on AWS Lambda
The next web scraper, see through the <html> noise
SEO Macroscope is a website scanning tool, to check your website
Creating Scrapy scrapers via the Django admin interface
GOPA, a spider written in Golang, for Elasticsearch