Dataproc templates and pipelines for solving simple in-cloud data task
Persistent HTTP cache for python requests
Twitter for Python
Check links in web documents or full websites
A next generation HTTP client for Python
NeoDB is a self-hosted server tracking what you read/watch/listen/play
Develop and test your cloud apps offline
Mist is an open source, multicloud management platform
Utilize all available CPU cores for accepting new client connections
A web privacy measurement framework
dude uncomplicated data extraction: A simple framework
Python & command-line tool to gather text on the Web
Redis-based components for Scrapy
A CLI, cURL-like tool for humans
NBA Stats API via Basketball Reference
Ansible role for s3cmd. Available on Ansible Galaxy
toot - Mastodon CLI & TUI
CLI tool to build, test, debug, and deploy Serverless applications
CMS framework for Django
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Automatically mock your HTTP interactions to simplify testing
Changelog CI is a GitHub Action that enables a project
Web app for Scrapyd cluster management
Command line tool to modify OS X's accessibility database (TCC.db)
Scrape job websites into a single spreadsheet with no duplicates.