A fast, high-level web crawling and web scraping framework
Python crawler for collecting and downloading Sina Weibo user data
Python tool for crawling and extracting structured data from news site
Realtime crawler for COVID-19 outbreak statistics from DXY data
Cross platform GUI tool for downloading videos from Bilibili sites
Convert websites into structured APIs automatically with Python tool
Python crawler to download photos and videos from Tumblr blogs
Collection of JS reverse engineering examples for web scraping study
Lightweight Python tool for downloading videos from many platforms
Scrape job websites into a single spreadsheet with no duplicates.
Asyncio-based Python framework for building fast web crawling spiders
AI-ready web crawler that extracts and structures website content
Easily turn large sets of image urls to an image dataset
Redis-based components for Scrapy
All-in-one Python web reconnaissance tool for fast target analysis
Open source Douyin crawler for collecting and downloading public data
NBA Stats API via Basketball Reference
Web app for Scrapyd cluster management
Python & command-line tool to gather text on the Web
Python library for scraping and analyzing online news articles easily
Movie metadata scraper and organizer for media libraries and NFO
Scraping publicly-accessible Letterboxd data for movie recommendations
Python scraper based on AI
Open source file indexing & storage analytics powered by Elasticsearch
A Python library for automating interaction with websites