From: Manuel L. <ml...@ac...> - 2005-07-19 22:45:36
|
Hello, on 07/19/2005 07:14 PM Neal Richter said the following: > 3.2b6 is either done or dead. I suppose that either me or Geoff need to > finish it off and call it 3.2 and update the website. I've called for a > psuedo-vote on this several times with silence being the general response. The lack of visible activity and the fact that many of us us htdig as is without problems made people believe that voting would irrelevant. Anyway, if you still think you need votes to go ahead, here is a +1 on my behalf. > On a more positive note Anthony Arnone (Montana State Univ. grad student) > and I have started active development of HtDig 4.0. It will be a merge of > HtDig + CLucene with a significant amount of code for the existing > Berkeley DB based WordDb being flushed. > > The main impetous for this is Unicode support and a speed and index size > improvement. > > We expect to produce a decently detailed refactoring document next week > and create a 4.0 CVS branch then. Great. I hope that will allow us to do things like making Htdig crawl individual pages and only update their entries in the index. That is what miss most in the current HTDig version. I make htdig crawl the static version of my site every day, but that is not very efficient and often it is too late. I can keep track of all pages that change and need to reindexed, but it is odd to make Htdig crawl the hole site just because a few pages changed. I would be more satisfied if I could just tell htdig once an hour to reindex a limited list of pages that changed. -- Regards, Manuel Lemos PHP Classes - Free ready to use OOP components written in PHP http://www.phpclasses.org/ PHP Reviews - Reviews of PHP books and other products http://www.phpclasses.org/reviews/ Metastorage - Data object relational mapping layer generator http://www.meta-language.net/metastorage.html |