Java library for working with real-world HTML
Lighter, faster browser kernel of blink to integrate HTML UI in apps
Python crawler that downloads image galleries and analyzes titles
Proxy crawler that aggregates, tests, and serves usable proxy nodes
Collection of reverse engineering articles curated for learning
Massive SQL injection vulnerability scanner for automated web testing
Python crawler to download photos and videos from Tumblr blogs
Turn entire websites into LLM-ready markdown or structured data
Fast Go CLI tool for downloading videos from many streaming sites
Python crawler and API for downloading JMComic albums and images
All-in-one reconnaissance and vulnerability scanning toolkit for sites
Python scraper based on AI
Multiprocess Selenium crawler for downloading images by keywords
Download and manage Bilibili Manga chapters with GUI downloader
The unix-way web crawler
Web crawler for archiving and backing up sites into WARC archives
MetaData html scraper and parser for Node.js (supports Promises
Python tool that automates JD.com login and product purchase tasks
ML-based HTML scraper that learns extraction rules from examples
Python tool for crawling and extracting structured data from news site
Vertical novel search engine with unified reading and tracking tools
Collection of 100+ Python web scraping projects and crawler examples
Collection of Python web scraping scripts for data extraction tasks
Python library providing APIs for automated website login workflows
Blazing fast Go framework for web crawling and data scraping tasks