Open-Source RPA Software (formerly Kantu)
Open source web scraping system for automated data collection tasks
High-performance Rust web crawler and scraper for large-scale data
The complete web scraping toolkit for PHP
Progressive PHP web crawler framework with jQuery-like DOM parsing
A fast, high-level web crawling and web scraping framework
CLI tool to save complete web pages as single self-contained HTML file
Powerful Python crawler framework for scalable web scraping tasks
Open source Douyin crawler for collecting and downloading public data
Blazing fast Go framework for web crawling and data scraping tasks
Lightweight Ruby DSL for scraping structured data from web pages
Lightweight .NET framework for fast web crawling and data scraping
AI-ready web crawler that extracts and structures website content
AI-first Ruby framework for building fast, flexible web scraping spide
Movie metadata scraper and organizer for media libraries and NFO
Python crawler to download photos and videos from Tumblr blogs
Python tool for crawling and extracting structured data from news site
Redis-based components for Scrapy
Lightweight Python tool for downloading videos from many platforms
Python crawler for collecting and downloading Sina Weibo user data
A universal web-util for PHP
Fast Go-based CLI scanner for running automated search engine dorks
A service daemon to run Scrapy spiders
Web crawler for archiving and backing up sites into WARC archives
Simple Python framework for building multithreaded web crawlers