Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Comprehensive search engine for books, papers, comics, magazines
A simple Python Pydantic model for Honkai
A Distributed RESTful Search Engine
Turn entire websites into LLM-ready markdown or structured data
WebDriver for Firefox
Open source web scraping system for automated data collection tasks
A fast, high-level web crawling and web scraping framework
E-mails, subdomains and names
CoreDNS is a DNS server that chains plugins
Log management solution that improves the performance of SIEM
dude uncomplicated data extraction: A simple framework
A chrome extension for automating your browser by connecting blocks
Private, fast, and honest web browser
Alternative to Google Analytics that gives you full control over data
A Hypixel skyblock stats website
Free internet metasearch engine which aggregates
Pomerium is an identity and context-aware access proxy
Command line tool and library for transferring data with URLs
Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX
Python crawler for collecting and downloading Sina Weibo user data
Qualitis is a one-stop data quality management platform
REST API for any Postgres database
Realtime crawler for COVID-19 outbreak statistics from DXY data
The DB Browser for SQLite