scraping free download

WebHarvest - web data extraction tool

Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.

14 Reviews

Downloads: 5 This Week

Last Update: 2025-10-27

See Project

unfluff

Automatically extract body content (and other cool stuff) from HTML

unfluff is a Node.js library designed to automatically extract the main content from an HTML document — stripping away navigation bars, ads, footers and other boilerplate to leave you with the “body content”, metadata (title, author, date) and other useful fields. It’s a tool very much aimed at content-analysis, web scraping, building datasets, or repurposing article text for downstream processing (like machine-learning or summarization). The API is simple: you feed in raw HTML and it returns a structured object with the extracted text and other fields. It supports caching internal representations to speed up repeated extractions. While its language support is best for English, it is still widely used in web-content-processing pipelines. ...

Downloads: 0 This Week

Last Update: 2025-11-14

See Project

ScraperEdit for XBMC

XML bindings and a GUI for creating and editing XBMC Scrapers

This program is an editor for creating XBMC Scrapers. It is similar to ScraperEditor, an other editor using ScraperXML, that runs under .Net environment. This program runs under Sun/Oracle's Java Runtime. HELP WANTED! I am looking for someone, who would help me writing documentation, like user's manual and on-line help. Also if someone want to help, translated language files are always welcome...

Downloads: 0 This Week

Last Update: 2016-03-10

See Project

datalus

PHP web API designed to simplify object handling(loading, saving, querying, displaying, and editing), abstract the data from its display structure, and layout and allow the target data to be delivered to any supported format without special logic.

Downloads: 0 This Week

Last Update: 2016-05-28

See Project

Xidel

Xidel is a cli webpage scraping tool supporting XPath/XQuery 3 and CSS

Xidel is a command line tool to download web pages and extract data from them. This data can be extracted using XPath/XQuery 3.0 (with a compatibility modes for XPath 2.0 and XQuery 1.0), JSONiq, CSS 3 selectors, and custom, pattern-matching templates that are like an annotated version of the processed page. It can download files over HTTP/S connections, follow redirections, links, or extracted values, and also process local files. The extracted values can then be exported as...

3 Reviews

Downloads: 0 This Week

Last Update: 2017-05-12

See Project

Search Results for "scraping"

5 projects for "scraping" with 2 filters applied:

WebHarvest - web data extraction tool

unfluff

ScraperEdit for XBMC

datalus

Xidel

Search Results for "scraping"

5 projects for "scraping" with 2 filters applied:

WebHarvest - web data extraction tool

unfluff

ScraperEdit for XBMC

datalus

Xidel

Related Searches

Related Categories