Python & command-line tool to gather text on the Web
Redis-based components for Scrapy
Python library for scraping and analyzing online news articles easily
Python HTTP client with TLS and HTTP/2 fingerprint emulation support
Vertical novel search engine with unified reading and tracking tools
Async Python library in automating Chromium browsers without WebDriver
Open source file indexing & storage analytics powered by Elasticsearch
AI-ready web crawler that extracts and structures website content
Python tool for crawling and extracting structured data from news site
Scraping publicly-accessible Letterboxd data for movie recommendations
dude uncomplicated data extraction: A simple framework
Kemono Downloader - A cross-platform Python app built with PyQt6
Command-line Bilibili video and danmaku downloader with batch support
Web crawler that finds hidden web directories without brute force
Distributed web crawler admin platform for spiders management
Distributed Crawler Management Framework Based on Scrapy
Descubre archivos, rutas escondidas realizando busquedas avanzadas
A service daemon to run Scrapy spiders
Python library providing APIs for automated website login workflows
Web crawler for archiving and backing up sites into WARC archives
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
ML-based HTML scraper that learns extraction rules from examples
Simple Python framework for building multithreaded web crawlers
Intelligent proxy pool for collecting and managing public proxies
Instagram profile crawler that extracts posts, tags, and stats