From: Gustave Stresen-R. <ted...@ma...> - 2005-12-09 12:08:40
|
Neal, I've been reading, with interest, the posts on the blog. I have a few of questions so far. - Is htdig a competitor to Nutch? If not, could you take a few minutes to clarify the differences between the two? - What, if any, modifications to the ranking engine will be made in 4.0 (saw the note about back-links and anchor texts - what about incoming links from other domains)? - It seems the goal is to create a library that can be included in other programs. Will the library include all the code for spidering, creating the indexes, and searching or just the database creation stuff, or something else...? - Are there any security considerations that should be addressed at this early stage (sanitizing of URL parameters, for example) I'm not a C developer, but I'm more than happy to try building the project on Linux and Mac OS X (10.3). Is there a 4.0 branch in CVS or will we have to wait for you to tag it? Thanks for the work. Gustave (Ted) Stresen-Reuter On Dec 8, 2005, at 6:05 PM, Neal Richter wrote: > Hey all, > > We've been making good progress on HtDig 4.0 > > You can see the progress updates on this blog. > > http://htdig.blogspot.com/ > > Thanks. > > -- > Neal Richter > Sr. Researcher and Machine Learning Lead > Software Development > RightNow Technologies, Inc. > Customer Service for Every Web Site > Office: 406-522-1485 > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > ht://Dig Developer mailing list: > htd...@li... > List information (subscribe/unsubscribe, etc.) > https://lists.sourceforge.net/lists/listinfo/htdig-dev |