Java library for working with real-world HTML
A scalable web crawler framework for Java
This is the most powerful software taking into account CIS location
Open-Source RPA Software (formerly Kantu)
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
A fast, high-level web crawling and web scraping framework
Easily turn large sets of image urls to an image dataset
Finviz analysis python library
A visual no-code/code-free web crawler/spider
Lighter, faster browser kernel of blink to integrate HTML UI in apps
Open-source LLM Friendly Web Crawler & Scraper
Turn entire websites into LLM-ready markdown or structured data
MetaData html scraper and parser for Node.js (supports Promises
Python scraper based on AI
Python & command-line tool to gather text on the Web
A web scraping and browser automation library for Node.js
Python binding to Modest and Lexbor engines
A Python library for automating interaction with websites
The best free open source website change detection and restock service
The web browser built for scraping
NBA Stats API via Basketball Reference
Simple web scraping for R
Web app for Scrapyd cluster management
The unix-way web crawler
Laravel adapter for Roach, the complete web scraping toolkit for PHP