Python & command-line tool to gather text on the Web
Redis-based components for Scrapy
Twitter for Python
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Web app for Scrapyd cluster management
Changelog CI is a GitHub Action that enables a project
Library for use with the AWS Cloud Development Kit
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
A web privacy measurement framework
Utilize all available CPU cores for accepting new client connections
Automatically mock your HTTP interactions to simplify testing
Scraping publicly-accessible Letterboxd data for movie recommendations
Scrape job websites into a single spreadsheet with no duplicates.
Rules engine for cloud security, cost optimization, and governance
Scalable PaaS (automated Docker+nginx), aka Heroku on Steroids
Static site generator that supports Markdown and reST syntax
CMS framework for Django
.NET version of the Playwright testing and automation library
Requests for PHP is a humble HTTP request library
The complete web scraping toolkit for PHP
The Universal Plug-in System. Extend anything with WebAssembly
Simple and distributed Machine Learning
A simple WebSocket server
Modern, privacy-friendly, and detailed web analytics
Distributed web crawler admin platform for spiders management