Combine is an open system for crawling Internet resources. It can be used both as a general and focused crawler. If you want to downloadWeb-pages pertaining to a particular topic (like 'Carnivorous Plants')Then Combine is the system for you!
Produce friendly binaries and interfaces in order to increase its popularity and usage.
combine (4.003) karmic; urgency=low * Added extra space when concatenating extracted text in HTMLExtractor.pm * Bugfix certain special characters in regexp when optimizing meta/title in generated XML * Added extra field, score, in table urldb to support new URL scheduling algoritms * Added support for new URL scheduling algorithms -- Anders Ardo <anders@nike> Mon, 15 Jun 2009 10:22:38 +0200 combine (4.002) intrepid; urgency=low * Fixed tests 05MySQL och 90Installationtest * Added support for exceptions to GeoIP, config-file server2country * Now handles special characters when using files for external parsers * Disabled warnings for unknown config variables -- Anders Ardo <anders@dbkit10.eit.lth.se> Mon, 30 Mar 2009 09:09:35 +0200
combine (4.001) intrepid; urgency=low * New version numbering compatible with CPAN * Integration with Solr enterprise search server (http://lucene.apache.org/solr/) similar to Zebra: new module Solr.pm, configuration variable SolrHost, switch SolrIndexing in combineExport
* Added code for simple Lucene integration to templates directory. Contributed by Xianghang Liu * Changed documentation HTML-generator to ht4tex
* Added switches 'collapseinlinks' and 'nooutlinks' to combineExport * Moved tmp-files to /tmp/$$ * decoded output from extconverter to Perl internal utf8 * Added -nodrm to pdf2html switches * Fixed bug in processing of pure text documents * Added switch ZebraIndexing to combineExport. Enables updating of the configured Zebra server with exported records * Improved indexing of PDF documents * Handled case when $md5 empty in DeleteKey * Fixed bug in Zebra recordId handling
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use
Thanks for your rating!
Would you also like to write a review?