CLI tool to save complete web pages as single self-contained HTML file
An adaptive Web Scraping framework
Python binding to Modest and Lexbor engines
MetaData html scraper and parser for Node.js (supports Promises
Scrape tweets, profiles, followers and following from Twitter/X
Fast CLI web crawler for discovering endpoints in modern web apps
Java library for working with real-world HTML
Open source Douyin crawler for collecting and downloading public data
The unix-way web crawler
Python crawler to download photos and videos from Tumblr blogs
Fast CLI tool for cloning entire websites for local browsing offline
Patches for Puppeteer and Playwright to reduce automation detection
Fast Go CLI tool for downloading videos from many streaming sites
Free Extracts Emails, Phones and custom text from Web using JAVA Regex
Fast Go-based CLI scanner for running automated search engine dorks
Free Extracts Emails, Phones and custom text from Web using JAVA Regex
Simple Python framework for building multithreaded web crawlers
JavaScript + BeautifulSoup = JSSoup