extraction free download

Showing 12 open source projects for "extraction"

View related business solutions

Internet JavaScript Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
1

newpipeextractor

Library for extracting streaming site data without official APIs

NewPipeExtractor is an open source Java library designed to extract data from streaming platforms by analyzing their web interfaces instead of relying on official APIs. It serves as the core extraction component used by the NewPipe Android application, but it is built as a standalone library that can also be integrated into other software projects. NewPipeExtractor provides a unified framework for retrieving information such as video streams, playlists, channels, and search results from supported streaming services. It handles many low-level tasks involved in web data extraction, including parsing responses, managing platform-specific logic, and handling errors, allowing developers to focus on implementing application features rather than scraping mechanics. ...

Downloads: 5 This Week

Last Update: 2026-06-09
See Project
2

Browserless

The headless Chrome/Chromium driver on top of Puppeteer

Browserless is an open-source headless browser automation library and service built on top of Puppeteer that simplifies the process of running and scaling Chromium-based browser tasks in production environments. It provides a high-level API for interacting with headless Chrome, allowing developers to perform operations such as generating PDFs, capturing screenshots, extracting text or HTML, and automating web navigation. The project is designed to act as a production-ready abstraction layer...

Downloads: 17 This Week

Last Update: 20 hours ago
See Project
3

katana

Fast CLI web crawler for discovering endpoints in modern web apps

Katana is an open source command-line web crawling and spidering framework developed by ProjectDiscovery. It is designed to efficiently crawl websites and web applications in order to discover endpoints, resources, and other useful information that may not be easily visible through manual browsing. Katana focuses on speed and automation, making it suitable for use in security reconnaissance workflows and automated pipelines. Katana supports both standard HTTP crawling and headless browser...

Downloads: 7 This Week

Last Update: 2026-05-05
See Project
4

DotnetSpider

Lightweight .NET framework for fast web crawling and data scraping

DotnetSpider is a web crawling and data extraction framework built on the .NET Standard platform. It is designed to help developers create efficient and scalable crawlers for collecting structured data from websites. It provides a high-level API that simplifies the process of defining spiders, managing requests, and extracting content from web pages. Developers can create custom spiders by extending base classes and configuring pipelines that handle downloading, parsing, and storing collected data. ...

Downloads: 4 This Week

Last Update: 2026-03-10
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
5

spider_collection

Collection of Python web scraping scripts for data extraction tasks

spider_collection is a collection of Python web crawler scripts created primarily for experimentation, learning, and practical scraping tasks. spider_collection gathers multiple independent spiders designed to collect data from different platforms and services, demonstrating a variety of scraping techniques and workflows. These crawlers make use of common Python scraping tools such as requests, parsel, BeautifulSoup, and the Scrapy framework to extract structured information from web pages....

Downloads: 4 This Week

Last Update: 5 days ago
See Project
6

MBR Bulk WP Detector

A free WP plugin that lets you check unlimited URLs

MBR Bulk WP Detector is a free WordPress plugin that lets you check unlimited URLs right from your own dashboard. No subscriptions, no URL limits, and your data stays completely private on your server. What Can You Do With It? The basics are simple: Paste a list of URLs (or upload a CSV file), click a button, and boom—you’ve got a clear breakdown of which sites are running WordPress and which aren’t. But it gets better… Turn on Deep Scan mode, and you’ll also discover what...

Downloads: 0 This Week

Last Update: 2026-03-26
See Project
7

OCR Web based

OCR web based for Browser Firefox & PC

...id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled the possibility to import an image from url. Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction

2 Reviews

Downloads: 0 This Week

Last Update: 2018-09-05
See Project
8

Turbo Download Manager

A portable modern multi-threading download manager for all platforms

A modern multi thread download manager for Windows, Linux, Mac OS, Firefox, Chrome, Opera and Android devices. For bug reports visit: https://github.com/inbasic/turbo-download-manager/issues For FAQs visit: http://add0n.com/turbo-download-manager.html Turbo Download Manager is an stand-alone application without any dependencies. It should run out of the box. Just set the download location while adding the first job request. If you have a browser and would like to integrate this...

2 Reviews

Downloads: 48 This Week

Last Update: 2016-11-29
See Project
9

Freedomeditor file editor env. (PHP)

Customizable browser based (text/web(WYSIWYG)) file editors environment in PHP (GPL Licensed) with loads of features. (tested only in firefox)

Downloads: 0 This Week

Last Update: 2017-09-20
See Project
Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
10

Cornell Web Lab Collaboration Server

The Cornell Web Lab Collaboration Server is a suite of tools and services for GUI-based extraction, analysis and sharing of archived web data. See http://weblab.infosci.cornell.edu/ and http://www.cs.cornell.edu/~weigel for details about the project.

Downloads: 0 This Week

Last Update: 2015-05-03
See Project
11

WSHToolBox

this a software package based on Windows Scripting Host tech. it'll give us various functions such as on-demond software installation, advanced file extraction. just enjoy them:)

Downloads: 0 This Week

Last Update: 2015-08-01
See Project
12

doXfolder

Document Management System

Document Management System created using JEE6

Downloads: 0 This Week

Last Update: 2016-11-22
See Project