Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
dude uncomplicated data extraction: A simple framework
E-mails, subdomains and names
A simple Python Pydantic model for Honkai
A fast, high-level web crawling and web scraping framework
Private, fast, and honest web browser
AI-ready web crawler that extracts and structures website content
Python crawler for collecting and downloading Sina Weibo user data
Consolidate and extend hosts files from several well-curated sources
Realtime crawler for COVID-19 outbreak statistics from DXY data
Collection of Python web scraping scripts for data extraction tasks
PostHog provides open-source web & product analytics
Open source Douyin crawler for collecting and downloading public data
openvpn-monitor is a web based OpenVPN monitor
This is the most powerful software taking into account CIS location
Modern, privacy-friendly, and detailed web analytics
Open source file indexing & storage analytics powered by Elasticsearch
Powerful Python crawler framework for scalable web scraping tasks
Python tool for crawling and extracting structured data from news site
Set of Ansible scripts that simplifies the setup of a personal VPN
Firebase Admin Python SDK
Dataproc templates and pipelines for solving simple in-cloud data task
CloudEvents Specification
Multi-cloud security auditing tool
Scraping publicly-accessible Letterboxd data for movie recommendations