Java library for working with real-world HTML
A scalable web crawler framework for Java
Open-Source RPA Software (formerly Kantu)
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
A fast, high-level web crawling and web scraping framework
Easily turn large sets of image urls to an image dataset
Finviz analysis python library
A visual no-code/code-free web crawler/spider
Lighter, faster browser kernel of blink to integrate HTML UI in apps
Open-source LLM Friendly Web Crawler & Scraper
Turn entire websites into LLM-ready markdown or structured data
MetaData html scraper and parser for Node.js (supports Promises
Python scraper based on AI
Python & command-line tool to gather text on the Web
A web scraping and browser automation library for Node.js
Python binding to Modest and Lexbor engines
A Python library for automating interaction with websites
The web browser built for scraping
NBA Stats API via Basketball Reference
Simple web scraping for R
Web app for Scrapyd cluster management
The unix-way web crawler
Laravel adapter for Roach, the complete web scraping toolkit for PHP
Declarative web scraping
dude uncomplicated data extraction: A simple framework