Convert websites into structured APIs automatically with Python tool
Lighter, faster browser kernel of blink to integrate HTML UI in apps
High-performance Rust web crawler and scraper for large-scale data
Collection of JS reverse engineering examples for web scraping study
Python crawler for collecting and downloading Sina Weibo user data
Scrape tweets, profiles, followers and following from Twitter/X
A Python library for automating interaction with websites
NBA Stats API via Basketball Reference
Python library for scraping and analyzing online news articles easily
This is a public repository containing scrapers
Patches for Puppeteer and Playwright to reduce automation detection
Python crawler to download photos and videos from Tumblr blogs
Realtime crawler for COVID-19 outbreak statistics from DXY data
Small event-delegation library for decoupling event binding and handli
Fast Go CLI tool for downloading videos from many streaming sites
Distributed Crawler Management Framework Based on Scrapy
Easy Spider is a distributed Perl Web Crawler Project from 2006
A service daemon to run Scrapy spiders
Using industrial automation techniques for creating web scraping tools
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
ML-based HTML scraper that learns extraction rules from examples
JavaScript + BeautifulSoup = JSSoup
Python bindings for the Chromium Embedded Framework (CEF)
Run headless Chrome/Chromium on AWS Lambda
Python tool for scraping search engine results from many providers