web scraper extractor free download

Showing 14 open source projects for "web scraper extractor"

View related business solutions

Software Development Linux Clear Filters & Widen Search

Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
1

Article Extractor

To extract main article from given URL with Node.js

A Node.js library for extracting main content from web articles, removing unnecessary clutter like ads and navigation elements.

Downloads: 6 This Week

Last Update: 2025-09-04
See Project
2

shot-scraper

A command-line utility for taking automated screenshots of websites

shot-scraper is a command-line utility for taking automated screenshots of web pages using a headless browser engine. After installation, a single command can capture a full-page screenshot of a URL and save it to a file, making it ideal for documentation, monitoring, and visual regression tasks. Under the hood it uses a modern browser (installed via a one-time shot-scraper install step) and exposes options for viewport size, full-page versus clipped screenshots, and device emulation. ...

Downloads: 0 This Week

Last Update: 2026-02-01
See Project
3

Goutte

Goutte, a simple PHP Web Scraper

Goutte is a screen scraping and web crawling library for PHP. Goutte provides a nice API to crawl websites and extract data from the HTML/XML responses. Goutte depends on PHP 7.1+. Add fabpot/goutte as a require dependency in your composer.json file. Create a Goutte Client instance (which extends Symfony\Component\BrowserKit\HttpBrowser). Make requests with the request() method. The method returns a Crawler object (Symfony\Component\DomCrawler\Crawler). To use your own HTTP settings, you may...

Downloads: 0 This Week

Last Update: 2023-04-01
See Project
4

soup

Web Scraper in Go, similar to BeautifulSoup

Web Scraper in Go, similar to BeautifulSoup. soup is a small web scraper package for Go, with its interface highly similar to that of BeautifulSoup. Pointer containing the pointer to the current html node. NodeValue containing the current html node's value, i.e. the tag name for an ElementNode, or the text in case of a TextNode. Error containing an error in a struct if one occurs, else nil is returned.

Downloads: 0 This Week

Last Update: 2023-01-25
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
5

TransPose

PyTorch Implementation for "TransPose, Keypoint localization

TransPose is a human pose estimation model based on a CNN feature extractor, a Transformer Encoder, and a prediction head. Given an image, the attention layers built in Transformer can efficiently capture long-range spatial relationships between keypoints and explain what dependencies the predicted keypoints locations highly rely on.

Downloads: 0 This Week

Last Update: 2024-07-12
See Project
6

django-dynamic-scraper

Creating Scrapy scrapers via the Django admin interface

Django Dynamic Scraper (DDS) is an app for Django build on top of the scraping framework Scrapy. While preserving many of the features of Scrapy it lets you dynamically create and manage spiders via the Django admin interface. With Django Dynamic Scraper (DDS) you can define your Scrapy scrapers dynamically via the Django admin interface and save your scraped items in the database you defined for your Django project. Since it simplifies things DDS is not usable for all kinds of scrapers, but...

Downloads: 0 This Week

Last Update: 2022-09-05
See Project
7

google-play-scraper

Node.js scraper to get data from Google Play

Node.js module to scrape application data from the Google Play store. Retrieves the full detail of an application. Retrieves a list of applications from one of the collections at Google Play. Retrieves a list of apps that results of searching by the given term. Returns the list of applications by the given developer name. Given a string returns up to five suggestions to complete a search query term. Retrieves a page of reviews for a specific application. Returns a list of similar apps to the...

Downloads: 0 This Week

Last Update: 2022-03-22
See Project
8

Rush Stack

Monorepo for tools developed by the Rush Stack community

...These major tools are developed under the Rush Stack umbrella. Rush, the scalable monorepo build orchestrator. Heft, an extensible build system that interfaces with Rush, API Extractor, coordinates API reviews for library packages, and generates .d.ts rollups. API Documenter, generates your API documentation website.

Downloads: 0 This Week

Last Update: 2022-03-15
See Project
9

Thymeleaf Property Extractor

A Java-written extractor for all your Thymeleaf localization strings

Thymeleaf Property Extractor (ThExtractor in short) is a java-written app that given a folder or a html file, extracts every Thymeleaf localisation tags (for example: th:text="#{myString}") to a properties file, to ease and fasten the development of your Spring web app. It's really easy to use! Just run the jar, and browse for your HTML (or a folder containing them) in the first field, and browse for a target .properties file (or create one!)

Downloads: 0 This Week

Last Update: 2014-10-12
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

php-url-extractor

List al URLs present in requested URL in absolute format

This php program extracts all URLs present on the requested URL, in absolute path.

Downloads: 3 This Week

Last Update: 2018-02-03
See Project
11

Calibre Database Extractor

A tool that can extract data from the Calibre SQLite database, including a command line tool that generates OPDS catalogs. In everyone's words, it takes the metadata out of Calibre, and generates catalogs for Stanza, Aldiko and web browsers.

Downloads: 0 This Week

Last Update: 2013-04-12
See Project
12

Scraper and Macro Interface for the Web

This project aims at developing a free open-source Scraper and Macro Interface for Web programmers, that needs no installation and will work in most of the operating systems.

Downloads: 0 This Week

Last Update: 2013-03-21
See Project
13

dataflowkit

Golang framework for scraping data from web pages

Golang Web Scraper library for extracting data from web pages. Save results as CSV, JSON, XML

Downloads: 0 This Week

Last Update: 2018-03-09
See Project
14

Blackfire Player

Web Crawling, Web Testing, and Web Scraping application

Blackfire Player is a powerful Web Crawling, Web Testing, and Web Scraper application. It provides a nice DSL to crawl HTTP services, assert responses, and extract data from HTML/XML/JSON responses. Some Blackfire Player use cases: Crawl a website/API and check expectations -- aka Acceptance Tests; Scrape a website/API and extract values; Monitor a website; Test code with unit test integration (PHPUnit, Behat, Codeception, ...); Test code behavior from the outside thanks to the native Blackfire Profiler integration -- aka Unit Tests from the HTTP layer (tm). ...

Downloads: 0 This Week

Last Update: 2019-06-11
See Project