scrape free download - SourceForge

Showing 15 open source projects for "scrape"

View related business solutions

Software Development Mac Clear Filters & Widen Search

Our Free Plans just got better! | Auth0 by Okta
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.

Try free now
Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs

Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.

Get Started
1

Scrapy

A fast, high-level web crawling and web scraping framework

Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...

Downloads: 45 This Week

Last Update: 2024-06-21
See Project
2

JMX Exporter

A process for exposing JMX Beans via HTTP for Prometheus consumption

JMX to Prometheus exporter: a collector that can configurable scrape and expose mBeans of a JMX target. This exporter is intended to be run as a Java Agent, exposing a HTTP server and serving metrics of the local JVM. It can be also run as a standalone HTTP server and scrape remote JMX targets, but this has various disadvantages, such as being harder to configure and being unable to expose process metrics (e.g., memory and CPU usage). Running the exporter as a Java agent is strongly encouraged.

Downloads: 3 This Week

Last Update: 2024-05-31
See Project
3

Parsera

Lightweight library for scraping web-sites with LLMs

Scrape data from any website with only a link and column descriptions. Parsera is a tool designed to scrape web content, specifically handling poorly structured or messy websites.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
4

jsoup

Java library for working with real-world HTML

jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make...

Downloads: 2 This Week

Last Update: 2024-07-10
See Project
Vivantio IT Service Management
Your service operation isn’t one-size-fits all, so your IT service management solution shouldn’t be either

The Vivantio Platform allows you to focus on the IT service management tools that make sense for your organization’s unique service model: from incident, problem and change requests, to service requests, client knowledge and asset management

Learn More
5

Elasticsearch Exporter

Elasticsearch stats exporter for Prometheus

Prometheus exporter for various metrics about Elasticsearch, written in Go. The exporter fetches information from an Elasticsearch cluster on every scrape, therefore having a too short scrape interval can impose load on ES master nodes, particularly if you run with --es.all and --es.indices. We suggest you measure how long fetching /_nodes/stats and /_all/_stats takes for your ES cluster to determine whether your scraping interval is too short. As a last resort, you can scrape this exporter...

Downloads: 0 This Week

Last Update: 2024-09-20
See Project
6

Artisan View

Manage your views in Laravel projects through artisan

This package adds a handful of view-related commands to Artisan in your Laravel project. Generate blade files that extend other views, scaffold out sections to add to those templates, and more. All from the command line we know and love.

Downloads: 0 This Week

Last Update: 2024-04-22
See Project
7

Prometheus Redis Metrics Exporter

Prometheus Exporter for Redis Metrics. Supports Redis 2.x, 3.x, 4.x, 5

... for the Redis instances then you can set the password via the --redis.password command line option of the exporter (this means you can currently only use one password across the instances you try to scrape this way. Use several exporters if this is a problem). If your Redis instance requires authentication then there are several ways how you can supply a username (new in Redis 6.x with ACLs) and a password.

Downloads: 0 This Week

Last Update: 5 days ago
See Project
8

DocSearch

The easiest way to add search to your documentation

... with the interaction patterns of each OS. We scrape your documentation or technical blog, configure the Algolia application and send you the snippet you'll have to integrate. It's that simple. You don't need to configure any settings or even have an Algolia account. We take care of this for you! We'll send you a small snippet to integrate DocSearch to your website and an invite to your fully configured Algolia application.

Downloads: 0 This Week

Last Update: 2024-09-27
See Project
9

htmLawed

PHP code to purify & filter HTML

The htmLawed PHP script makes HTML more secure and standards- & policy-compliant. The customizable HTML filter/purifier can balance tags, ensure proper nestings, neutralize XSS, restrict HTML, beautify code like Tidy, implement anti-spam measures, etc.

1 Review

Downloads: 112 This Week

Last Update: 2023-08-05
See Project
Omnichannel contact center platform for enterprises.
For Call centers or BPOs with a very high volume of calls

Deliver a personalized customer experience with every interaction, across every channel, with uContact, net2phone’s cloud contact center solution.

Learn More
10

Marathon

Deploy and manage containers (including Docker) on top of Apache Mesos

... application. You can run databases like MySQL and Postgres, and have storage accounted for by Mesos. Supply an HTTP endpoint to receive notifications, for example to integrate with an external load balancer. Query them at /metrics in JSON format, push them to systems like Graphite, StatsD and DataDog, or scrape them using Prometheus.

Downloads: 0 This Week

Last Update: 2022-01-27
See Project
11

django-dynamic-scraper

Creating Scrapy scrapers via the Django admin interface

..., but it is well suited for the relatively common case of regularly scraping a website with a list of updated items (e.g. news, events, etc.) and then dig into the detail page to scrape some more infos for each item. Django Dynamic Scraper tries to keep its data structure in the database as separated as possible from the models in your app, so it comes with its own Django model classes for defining scrapers, runtime information related to your scraper runs and classes.

Downloads: 0 This Week

Last Update: 2022-09-05
See Project
12

google-play-scraper

Node.js scraper to get data from Google Play

Node.js module to scrape application data from the Google Play store. Retrieves the full detail of an application. Retrieves a list of applications from one of the collections at Google Play. Retrieves a list of apps that results of searching by the given term. Returns the list of applications by the given developer name. Given a string returns up to five suggestions to complete a search query term. Retrieves a page of reviews for a specific application. Returns a list of similar apps...

Downloads: 0 This Week

Last Update: 2022-03-22
See Project
13

chrome-extensions-examples

All Chrome Extension examples collected into one repository

This is not an official mirror of the Chrome extension examples. Report any issues with the examples themselves to Google's issue trackers/forums. The Chrome Extensions examples did not exist as a Git repository, and browsing both the samples page and the VCViewer did not seem particularly handy. So, I decided to scrape the content into this repository for easier browsing and (possible) editing.

Downloads: 0 This Week

Last Update: 2022-05-11
See Project
14

dataflowkit

Golang framework for scraping data from web pages

Golang Web Scraper library for extracting data from web pages. Save results as CSV, JSON, XML

Downloads: 0 This Week

Last Update: 2018-03-09
See Project
15

Blackfire Player

Web Crawling, Web Testing, and Web Scraping application

Blackfire Player is a powerful Web Crawling, Web Testing, and Web Scraper application. It provides a nice DSL to crawl HTTP services, assert responses, and extract data from HTML/XML/JSON responses. Some Blackfire Player use cases: Crawl a website/API and check expectations -- aka Acceptance Tests; Scrape a website/API and extract values; Monitor a website; Test code with unit test integration (PHPUnit, Behat, Codeception, ...); Test code behavior from the outside thanks to the native...

Downloads: 0 This Week

Last Update: 2019-06-11
See Project