scraping free download

Showing 79 open source projects for "scraping"

View related business solutions

Python Clear Filters & Widen Search

Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

crawler

Collection of JS reverse engineering examples for web scraping study

...Many examples illustrate techniques such as debugging scripts, intercepting requests, analyzing encrypted parameters, and understanding authentication flows. crawler also explores common anti-scraping defenses and demonstrates how developers can examine them through debugging tools and reverse engineering techniques.

Downloads: 7 This Week

Last Update: 2 hours ago
See Project
2

Robin

AI-powered tool for dark web OSINT search and investigation

...The tool helps refine investigative queries, collect results from multiple dark web sources, and filter relevant intelligence using AI-driven processing. Robin also performs scraping of discovered pages through Tor sessions, allowing users to gather additional context from dark web sites while maintaining the required network routing. By integrating AI models, the platform can interpret results, highlight key information, and produce summaries that help analysts understand findings faster. The project provides a modular architecture separating search, scraping, and AI processing components so it can be extended with new data sources.

Downloads: 26 This Week

Last Update: 4 days ago
See Project
3

ScrapeGraphAI

Python scraper based on AI

Extracting content from websites and local documents using LLM. ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, Markdown, etc.). Just say which information you want to extract and the library will do it for you.

Downloads: 0 This Week

Last Update: 2026-05-10
See Project
4

Scrapling

An adaptive Web Scraping framework

Scrapling is an adaptive web scraping framework designed to handle everything from a single HTTP request to large-scale, concurrent crawls. Built for modern websites, it intelligently adapts to structural changes by automatically relocating elements when page layouts update. The framework includes advanced fetchers capable of bypassing anti-bot protections such as Cloudflare Turnstile using stealth and browser automation techniques.

Downloads: 1 This Week

Last Update: 2026-05-11
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
5

Python Code Tutorials

The Python Code Tutorials

Python Code Tutorials is a large educational repository that aggregates programming tutorials from the “The Python Code” website into a structured collection of Python projects and learning materials. The repository covers a wide range of programming topics including cybersecurity, networking, web scraping, machine learning, GUI development, and automation scripts. Each tutorial typically includes complete Python code examples and explanations that demonstrate how to build real tools and applications step by step. Many tutorials focus on practical implementations such as building network scanners, web scraping tools, object detection systems, and automation utilities using Python libraries. ...

Downloads: 0 This Week

Last Update: 2026-05-14
See Project
6

JobFunnel

Scrape job websites into a single spreadsheet with no duplicates.

Scrape job websites into a single spreadsheet with no duplicates. Automated tool for scraping job postings into a .csv file. You can search for jobs with YAML configuration files or by passing command arguments. By performing regular scraping and reviewing, you can cut through the noise of even the busiest job markets. Run funnel with your settings YAML to populate your master CSV file with jobs from available providers.

Downloads: 0 This Week

Last Update: 2024-09-29
See Project
7

FEAPDER

Powerful Python crawler framework for scalable web scraping tasks

...It includes several built-in spider types, such as AirSpider, Spider, TaskSpider, and BatchSpider, which address different crawling scenarios ranging from lightweight scraping to distributed and batch-based jobs. feapder supports features such as breakpoint resume, allowing crawlers to continue from where they stopped without losing progress. It also integrates monitoring and alerting capabilities to help developers track crawler performance and detect issues during execution. feapder includes browser rendering support for handling dynamic web pages and provides mechanisms for large-scale data deduplication during crawling.

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
8

Python-Spider

Python3 web crawler practice

...As part of the author’s public learning-path repositories, python-spider likely includes examples of HTTP requests, HTML parsing, maybe concurrency or scheduling to crawl multiple pages, and techniques to handle common web-scraping issues. For people wanting to get hands-on with building scrapers, collecting data, or learning how to navigate web programming in Python, this repository acts as a didactic reference or starting point. Because it’s published publicly under an open license, users are free to fork and adapt the code.

Downloads: 0 This Week

Last Update: 2025-12-08
See Project
9

Zendriver

A blazing fast, async-first, undetectable webscraping

Zendriver is a modern Python web automation and scraping framework that leverages the Chrome DevTools Protocol to provide fast, asynchronous control over real browser instances. Unlike traditional tools that rely on Selenium or WebDriver, Zendriver communicates directly with the browser through CDP, enabling higher performance and more precise control over browser behavior. The framework is designed to be difficult to detect by anti-bot systems, making it suitable for advanced scraping and automation use cases where stealth is important. ...

Downloads: 1 This Week

Last Update: 2026-03-17
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Parsera

Lightweight library for scraping web-sites with LLMs

Scrape data from any website with only a link and column descriptions. Parsera is a tool designed to scrape web content, specifically handling poorly structured or messy websites.

Downloads: 1 This Week

Last Update: 2025-10-08
See Project
11

spider_collection

Collection of Python web scraping scripts for data extraction tasks

spider_collection is a collection of Python web crawler scripts created primarily for experimentation, learning, and practical scraping tasks. spider_collection gathers multiple independent spiders designed to collect data from different platforms and services, demonstrating a variety of scraping techniques and workflows. These crawlers make use of common Python scraping tools such as requests, parsel, BeautifulSoup, and the Scrapy framework to extract structured information from web pages. ...

Downloads: 3 This Week

Last Update: 2 days ago
See Project
12

Scweet

Scrape tweets, profiles, followers and following from Twitter/X

Scweet is a Python-based Twitter/X scraping library and CLI designed to collect tweets, profile timelines, followers, following lists, and user profile data without requiring the official Twitter/X API or a developer account. Instead of depending on deprecated unauthenticated scraping methods, it works by using X’s web GraphQL API together with authenticated browser cookies, which gives it a more current and practical approach for data extraction.

Downloads: 0 This Week

Last Update: 2026-04-15
See Project
13

MDCx

Movie metadata scraper and organizer for media libraries and NFO

...It includes several interfaces, allowing users to operate it through a graphical desktop application, a browser-based web interface, or command-line utilities depending on their workflow. Its architecture separates core scraping logic from the user interfaces, allowing the same metadata processing system to be reused across different modes.

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
14

PythonPark

Python open source project "The Road to Self-Study Programming"

PythonPark is a large, curated “learning playground” for Python — essentially a comprehensive self-study meta-repository aimed at helping learners progress in Python programming, data science, machine learning, web scraping, and software engineering practices. It aggregates tutorials, learning guides, project examples, and resources across topics: from Python basics and data structures to machine learning, web scraping, and even interview preparation and “programmer life” guidance. Because of this breadth, PythonPark serves both as a reference library (for quick lookup) and as a structured learning path for beginners and intermediate learners in Python. ...

Downloads: 0 This Week

Last Update: 2025-12-08
See Project
15

CommunityScrapers

This is a public repository containing scrapers

Stash Community Scrapers is a large open-source collection of metadata extraction tools designed to work with the Stash media management platform, enabling automated scraping of content information from various online sources. The repository contains hundreds of scraper definitions written primarily in YAML and Python, each tailored to extract structured metadata such as titles, performers, tags, and media details from specific websites. These scrapers integrate directly into Stash, allowing users to enrich their media libraries with accurate and detailed information without manual entry. ...

Downloads: 1 This Week

Last Update: 5 days ago
See Project
16

Linkedin Scraper

A library that scrapes Linkedin for user data

Linkedin Scraper is a library that scrapes Linkedin for user data. Version 2.0.0 and before is called linkedin_user_scraper and can be installed via pip3 install --user linkedin_user_scraper. The reason is that LinkedIn has recently blocked people from viewing certain profiles without having previously signed in. So by setting scrape=False, it doesn't automatically scrape the profile, but Chrome will open the linkedin page anyways. You can login and logout, and the cookie will stay in the...

Downloads: 1 This Week

Last Update: 2026-04-10
See Project
17

CyberScraper 2077

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

CyberScraper 2077 is not just another web scraping tool – it's a glimpse into the future of data extraction. Born from the neon-lit streets of a cyberpunk world, this AI-powered scraper uses OpenAI, Gemini and LocalLLM Models to slice through the web's defenses, extracting the data you need with unparalleled precision and style.

Downloads: 0 This Week

Last Update: 2026-05-02
See Project
18

Fapello.Downloader

NSFW Windows app to batch download images and videos

...The tool allows users to paste a content URL and retrieve all associated media in a single operation, eliminating the need for manual downloading of individual files. It is built entirely in Python and leverages libraries such as BeautifulSoup and requests for scraping and data retrieval, while using a customtkinter interface to provide a user-friendly experience. The application is packaged as a standalone executable for Windows, making it accessible without requiring a Python environment. It is optimized for batch operations, organizing downloaded media into structured folders for easy access and storage. ...

Downloads: 66 This Week

Last Update: 2026-03-18
See Project
19

Fli

Google Flights MCP and Python Library

Fli is a powerful Python library and command-line tool that provides direct programmatic access to Google Flights data through reverse-engineered API interactions rather than traditional web scraping. This approach enables faster, more reliable, and more stable access to flight information, avoiding the fragility associated with HTML parsing and UI changes. The library supports a wide range of flight search capabilities, including filtering by airline, departure time, number of stops, cabin class, and sorting by price or duration, making it suitable for both casual queries and advanced travel analysis. ...

Downloads: 1 This Week

Last Update: 2026-05-24
See Project
20

CloakBrowser

Stealth Chromium that passes every bot detection test

...The project integrates with Playwright and Puppeteer while preserving familiar automation workflows for developers. It also supports isolated browser profiles with configurable fingerprints, making it useful for testing, automation research, scraping, QA, and multi-profile browser environments. The ecosystem includes a self-hosted browser profile manager that functions as an open-source alternative to commercial anti-detect browsers.

Downloads: 37 This Week

Last Update: 2026-05-21
See Project
21

Amazing-Python-Scripts

Curated collection of Amazing Python scripts

...The repository encourages community contributions, allowing developers to add their own scripts and improve existing ones through pull requests. Examples include scripts for sentiment analysis, data scraping, web automation, log analysis, and interactive applications such as games or voice-controlled tools. The project also provides contribution guidelines and documentation so that developers can easily collaborate and expand the collection of scripts.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
22

newspaper4k

Python library for scraping and analyzing online news articles easily

Newspaper4k is a Python library designed for extracting, processing, and analyzing news articles from websites. It is a continuation and active fork of the original newspaper3k library, which had stopped receiving updates, with the goal of keeping the ecosystem maintained while adding improvements and bug fixes. It provides developers with tools to automatically download web pages, extract the main article content, and collect associated metadata such as titles, authors, images, and...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
23

Instagram OSINT Tool

Instagram OSINT tool for gathering profile data and public posts

...It retrieves details that are not always easily visible when browsing an Instagram account normally, allowing investigators, researchers, and developers to gather structured data about a target profile. It works by scraping publicly available profile information and extracting metadata from Instagram pages using Python. It collects various attributes such as the username, profile name, follower counts, account status indicators, and profile metadata. In addition to profile information, it can also retrieve post-related data and download publicly available images associated with an account. ...

Downloads: 52 This Week

Last Update: 5 days ago
See Project
24

Letterboxd Recommendations

Scraping publicly-accessible Letterboxd data for movie recommendations

Scraping publicly-accessible Letterboxd data and creating a movie recommendation model with it that can generate recommendations when provided with a Letterboxd username. A user's "star" ratings are scraped from their Letterboxd profile and assigned numerical ratings from 1 to 10 (accounting for half stars). Their ratings are then combined with a sample of ratings from the top 4000 most active users on the site to create a collaborative filtering recommender model using singular value decomposition (SVD). ...

Downloads: 0 This Week

Last Update: 2025-11-01
See Project
25

WeebCentral Downloader

A powerful manga downloader for WeebCentral with both GUI and CLI

...The tool also incorporates progress tracking and background worker threads to ensure a responsive experience during large downloads. Its modular structure separates scraping logic, interface components, and configuration management, making it maintainable and extensible.

Downloads: 25 This Week

Last Update: 2026-03-24
See Project