Web Scraping Framework
Redis-based components for Scrapy
Scrape tweets, profiles, followers and following from Twitter/X
Python library for scraping and analyzing online news articles easily
Python crawler to download photos and videos from Tumblr blogs
Python HTTP client with TLS and HTTP/2 fingerprint emulation support
Asyncio-based Python framework for building fast web crawling spiders
Python crawler and API for downloading JMComic albums and images
Async Python library in automating Chromium browsers without WebDriver
Scrape job websites into a single spreadsheet with no duplicates.
Open source file indexing & storage analytics powered by Elasticsearch
AI-ready web crawler that extracts and structures website content
Realtime crawler for COVID-19 outbreak statistics from DXY data
Python tool for crawling and extracting structured data from news site
An adaptive Web Scraping framework
Scraping publicly-accessible Letterboxd data for movie recommendations
Kemono Downloader - A cross-platform Python app built with PyQt6
Download and manage Bilibili Manga chapters with GUI downloader
Collection of Python web scraping scripts for data extraction tasks
Multiprocess Selenium crawler for downloading images by keywords
dude uncomplicated data extraction: A simple framework
Command-line Bilibili video and danmaku downloader with batch support
Web crawler that finds hidden web directories without brute force
Distributed web crawler admin platform for spiders management
Distributed Crawler Management Framework Based on Scrapy