Command-line Bilibili video and danmaku downloader with batch support
Web crawler that finds hidden web directories without brute force
Python package to retrieve and manage data of the IMDb
Python library providing APIs for automated website login workflows
ML-based HTML scraper that learns extraction rules from examples
Simple Python framework for building multithreaded web crawlers
Search engine and data mining applications and ClueWeb datasets.
Intelligent proxy pool for collecting and managing public proxies
Zenoss - Intelligent IT Operations Management
Instagram profile crawler that extracts posts, tags, and stats
repair corrupted pcap and pcapng files
Educational Python web scraping case collection for many sites
Async Python framework for fast and flexible web scraping spiders
AST-based JavaScript reverse engineering and variable tracing toolkit
Timothy is a cloud base storage system designed to document your work
Python tool for scraping search engine results from many providers
Advanced toolkit for detecting and exploiting CSRF vulnerabilities
Collection of Python ecommerce and website crawler examples projects
Python crawler that downloads image galleries and analyzes titles
Python library to crawl and retrieve data from WeChat accounts
Asyncio-based Python framework for building fast web crawling spiders