scraping free download

Showing 3 open source projects for "scraping"

View related business solutions

Browsers Python Clear Filters & Widen Search

Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
Earn up to 16% annual interest with Nexo.
Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
1

Linkedin Scraper

A library that scrapes Linkedin for user data

Linkedin Scraper is a library that scrapes Linkedin for user data. Version 2.0.0 and before is called linkedin_user_scraper and can be installed via pip3 install --user linkedin_user_scraper. The reason is that LinkedIn has recently blocked people from viewing certain profiles without having previously signed in. So by setting scrape=False, it doesn't automatically scrape the profile, but Chrome will open the linkedin page anyways. You can login and logout, and the cookie will stay in the...

Downloads: 1 This Week

Last Update: 2026-04-10
See Project
2

CloakBrowser

Stealth Chromium that passes every bot detection test

...The project integrates with Playwright and Puppeteer while preserving familiar automation workflows for developers. It also supports isolated browser profiles with configurable fingerprints, making it useful for testing, automation research, scraping, QA, and multi-profile browser environments. The ecosystem includes a self-hosted browser profile manager that functions as an open-source alternative to commercial anti-detect browsers.

Downloads: 40 This Week

Last Update: 2026-05-21
See Project
3

Scrapy-Redis

Redis-based components for Scrapy

You can start multiple spider instances that share a single redis queue. Best suitable for broad multi-domain crawls. Scraped items gets pushed into a redis queued meaning that you can start as many as needed post-processing processes sharing the items queue. Scheduler + Duplication Filter, Item Pipeline, Base Spiders. Default requests serializer is pickle, but it can be changed to any module with loads and dumps functions. Note that pickle is not compatible between python versions. Version...

Downloads: 0 This Week

Last Update: 2024-07-06
See Project