-
Hi Steve,
I have read the thread and used java 1.6 rev 2. It's with this VM that I have the problem.
The workaround I used was to divide the scraping in batches.
Kind regards,
Mile.
2007-08-13 20:05:06 UTC in WebHarvest - web data extraction tool
-
Hello,
I have been successfully using web harvest for scraping for some time.
I have lately discovered an issue regarding scraping multiple pages and processing them with some xpath expressions.
I did some basic profiling and apparently the class
org.webharvest.runtime.variables.NodeVariable is the one that pumps up with every downloaded page resulting in the end an OutOfMemory...
2007-08-13 14:53:12 UTC in WebHarvest - web data extraction tool
-
milerosu committed patchset 958 of module makumba to the Makumba CVS repository, changing 1 files.
2005-07-09 14:15:30 UTC in Makumba
-
milerosu committed revision 1060 to the Makumba SVN repository, changing 1 files.
2005-07-09 14:15:30 UTC in Makumba
-
Hello,
I have uploaded the old repository of the ParaDe
project on the sf server. (
/home/groups/p/pa/parade/parade.zip )
Please add it as a cvs repository.
Thank you very much,
Mile Rosu.
2005-07-06 13:10:19 UTC in SourceForge.net
-
milerosu registered the ParaDe project.
2005-07-04 10:24:40 UTC in ParaDe
-
milerosu committed patchset 847 of module makumba to the Makumba CVS repository, changing 4 files.
2005-04-20 15:21:32 UTC in Makumba
-
milerosu committed revision 946 to the Makumba SVN repository, changing 4 files.
2005-04-20 15:21:32 UTC in Makumba
-
milerosu committed patchset 812 of module makumba to the Makumba CVS repository, changing 1 files.
2005-03-08 23:23:18 UTC in Makumba
-
milerosu committed revision 899 to the Makumba SVN repository, changing 1 files.
2005-03-08 23:23:18 UTC in Makumba