Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.

Project Activity

See All Activity >

License

BSD License, GNU General Public License version 2.0 (GPLv2)

Follow WebHarvest - web data extraction tool

WebHarvest - web data extraction tool Web Site

You Might Also Like
Employee monitoring software with screenshots Icon
Employee monitoring software with screenshots

Clear visibility and insights into how employees work. Even remotely.

Stay productive working at any distance from anywhere with Monitask.
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
10
1
1
1
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 3 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5

User Reviews

  • I've used this tool several times on a dozen of so different sites with good results. The syntax can be challenging. Once you get used to it, it works quite well. Support was very good in the past. Very helpful. Sorry to see development has stopped by the looks of it.
Read more reviews >

Additional Project Details

Operating Systems

Linux

Intended Audience

Advanced End Users, Developers

User Interface

Java Swing

Programming Language

XSL (XSLT/XPath/XSL-FO), Java

Database Environment

MySQL

Related Categories

XSL (XSLT/XPath/XSL-FO) XML Software, XSL (XSLT/XPath/XSL-FO) HTML XHTML, XSL (XSLT/XPath/XSL-FO) Search Engines, XSL (XSLT/XPath/XSL-FO) Frameworks, XSL (XSLT/XPath/XSL-FO) Web Scrapers, Java XML Software, Java HTML XHTML, Java Search Engines, Java Frameworks, Java Web Scrapers

Registered

2006-07-14