Install and manage a high performance WordPress stack
A free and open source interactive HTTPS proxy
Easily turn large sets of image urls to an image dataset
Event-driven networking engine written in Python
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Persistent HTTP cache for python requests
Ansible role for s3cmd. Available on Ansible Galaxy
Privacy browser for Android
Check links in web documents or full websites
Command line tool to modify OS X's accessibility database (TCC.db)
toot - Mastodon CLI & TUI
Python binding to Modest and Lexbor engines
A library that scrapes Linkedin for user data
Securely and anonymously share files of any size
dude uncomplicated data extraction: A simple framework
Python & command-line tool to gather text on the Web
Redis-based components for Scrapy
Web Scraping Framework
NeoDB is a self-hosted server tracking what you read/watch/listen/play
A web privacy measurement framework
NBA Stats API via Basketball Reference
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Web app for Scrapyd cluster management
CloudEvents Specification
Easy-to-use and developer-friendly enterprise CMS powered by Django