Open-Source RPA Software (formerly Kantu)
Open source web scraping system for automated data collection tasks
High-performance Rust web crawler and scraper for large-scale data
Asyncio-based Python framework for building fast web crawling spiders
A fast, high-level web crawling and web scraping framework
Progressive PHP web crawler framework with jQuery-like DOM parsing
The complete web scraping toolkit for PHP
Open source Douyin crawler for collecting and downloading public data
Movie metadata scraper and organizer for media libraries and NFO
Python crawler for collecting and downloading Sina Weibo user data
Blazing fast Go framework for web crawling and data scraping tasks
Convert websites into structured APIs automatically with Python tool
CLI tool to save complete web pages as single self-contained HTML file
Python tool for crawling and extracting structured data from news site
Python crawler to download photos and videos from Tumblr blogs
Lightweight Python tool for downloading videos from many platforms
AI-ready web crawler that extracts and structures website content
Powerful Python crawler framework for scalable web scraping tasks
Redis-based components for Scrapy
Lightweight Ruby DSL for scraping structured data from web pages
Lightweight .NET framework for fast web crawling and data scraping
Fast Go-based CLI scanner for running automated search engine dorks
A service daemon to run Scrapy spiders
Web crawler for archiving and backing up sites into WARC archives
ML-based HTML scraper that learns extraction rules from examples