Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.

Project Activity

See All Activity >

License

BSD License, GNU General Public License version 2.0 (GPLv2)

Follow WebHarvest - web data extraction tool

WebHarvest - web data extraction tool Web Site

You Might Also Like
Pimberly PIM - the leading enterprise Product Information Management platform. Icon
Pimberly PIM - the leading enterprise Product Information Management platform.

Pimberly enables businesses to create amazing online experiences with richer, differentiated product descriptions.

Drive amazing product experiences with quality product data.
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
10
1
1
1
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 3 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5

User Reviews

Be the first to post a review of WebHarvest - web data extraction tool!

Additional Project Details

Operating Systems

Linux

Intended Audience

Advanced End Users, Developers

User Interface

Java Swing

Programming Language

XSL (XSLT/XPath/XSL-FO), Java

Database Environment

MySQL

Related Categories

XSL (XSLT/XPath/XSL-FO) XML Software, XSL (XSLT/XPath/XSL-FO) HTML XHTML, XSL (XSLT/XPath/XSL-FO) Search Engines, XSL (XSLT/XPath/XSL-FO) Frameworks, XSL (XSLT/XPath/XSL-FO) Web Scrapers, Java XML Software, Java HTML XHTML, Java Search Engines, Java Frameworks, Java Web Scrapers

Registered

2006-07-14