WebHarvest - web data extraction tool
betaDescription
Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
WebHarvest - web data extraction tool Web SiteUser Ratings
User Reviews
-
O melhor programa para compartilhamento
-
Very well polished, the debugger and watch tools are very helpful. Much better than other crawling tools I've seen elsewhere.
-
Amazing tool and the GUI is create for developing the scraper scripts. The ability to inspect all intermediate results and run XPath queries against those interactively is an absolute killer feature.
-
very useful and fully functional java crawler
-
Nice tool! It allows you to apply the variety of data manipulation techniques (regex, XPath, XQuery, XSLT, different script engines like JS, groovy and beanshell, HTML cleaner and many more). And you also have a decent set of the flow control instruments (if-else, user functions, loops). And of course you can write your own plug-in if you want something special. But the coolest thing among all of those is consistency - you have everything you need in one place!
-
Very powerful web extraction tool. Great for beginners and full of features for experienced users. You can extract data, email it or save to a database without knowing how to program Java.