User Ratings

ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 3 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 2 / 5

Rate This Project

Login To Rate This Project

User Reviews

  • Yeah, it works well for web data extraction. But it is not enough powerful for cloud extraction. For this, I use another web scraping tool, octoparse.

  • I've used this tool several times on a dozen of so different sites with good results. The syntax can be challenging. Once you get used to it, it works quite well. Support was very good in the past. Very helpful. Sorry to see development has stopped by the looks of it.

  • Great use of XSLT and visual representation. Would be better to easier identify the results of the search. I prefer this htp:// however for data extraction.

  • All other 18 reviews are FAKE and by the uploader.

  • Thanks very good project! +

  • very good project, thanks!

  • dont find any donation button ...

    1 user found this review helpful.
  • good job web-harvest

  • Very well polished, the debugger and watch tools are very helpful. Much better than other crawling tools I've seen elsewhere.

    1 user found this review helpful.
  • Amazing tool and the GUI is create for developing the scraper scripts. The ability to inspect all intermediate results and run XPath queries against those interactively is an absolute killer feature.

  • very useful and fully functional java crawler

  • Nice tool! It allows you to apply the variety of data manipulation techniques (regex, XPath, XQuery, XSLT, different script engines like JS, groovy and beanshell, HTML cleaner and many more). And you also have a decent set of the flow control instruments (if-else, user functions, loops). And of course you can write your own plug-in if you want something special. But the coolest thing among all of those is consistency - you have everything you need in one place!

  • Very powerful web extraction tool. Great for beginners and full of features for experienced users. You can extract data, email it or save to a database without knowing how to program Java.

  • The best and the simplest, and the most powerful extraction tool. Please, add delay capability to reduce quantity of http requests per second.

  • Web Harvest is awesome. I added a new function to mine to perform redirected website resolution - a rather useful function to be sure. I will second the motion to add a timeout capability, which I will probably add to my own version. -DTM

  • Totally awesome

  • The best! It's very simple work with this tool.