> *what is the maximum indexing speed?
> *what's the peak query rate?
What's your hardware? How large are the documents that you're
indexing? How fast is your network, or will you be indexing the local
server rather than across the network? For queries, will you be running
other services on the server while queries are performed? How many
documents are likely to be returned? Are you talking about queries on a
single server or a server farm?
No offense, but I can't give you any reasonable number here. Suffice to
say, ht://Dig is quite fast and well within the realm of commercial
products. Personally, I'd be wary of someone giving you an actual number
from your query alone.
> *what are the rough disk space requirements for the software (not the
> index, I mean for gcc/g++,ht://dig,and any other extra software
> downloads I may need)?
For the software? Depends on what sort of server you're running. Most UNIX
servers have gcc/g++ already installed. The size of the ht://Dig binaries
varies a bit by platform, but is probably in the realm of 2-3MB.
> *what are the webserver requirements?
Hardware? Software? For indexing, ht://Dig can index any webserver that
understands HTTP (i.e. all of them), though there have been reports of
strange quirks with Lotus Notes webservers. For running results, you
simply need the htsearch CGI and a CGI-webserver (i.e. just about
everything). UNIX-based servers are preferred, but there are users who run
ht://Dig on Windows as well--though it's flakier.
Williams Students Online
From: Martin Vorlaender <mv@PDV-SYSTEME.de> - 2002-06-07 05:40:53
Geoff Hutchison wrote:
> > *what are the webserver requirements?
> Hardware? Software? For indexing, ht://Dig can index any webserver
> understands HTTP (i.e. all of them), though there have been reports of
> strange quirks with Lotus Notes webservers. For running results, you
> simply need the htsearch CGI and a CGI-webserver (i.e. just about
> everything). UNIX-based servers are preferred, but there are
> users who run ht://Dig on Windows as well--though it's flakier.
There also is a VMS port of ht://Dig 3.1.6 (by yours truly) which I
know is run by some DEC^3^HCompaq^6^HHP Customer Support Center to
index the VMS documentation and source listings. The overall feedback
I got was very positive, although some SWISH-E fan critizised the
slowness of indexing - well, ht://Dig offers a lot more than SWISH, or
It works perfectly under the Apache port, and (using GET queries )
Purveyor (an old but rock-solid commercial web server).
 no POST because Purveyor doesn't support stdin
So long, and thanks | Martin Vorlaender | VMS & WNT programmer
for all the books... | work: mv@...
In Memoriam Douglas Adams | http://www.pdv-systeme.de/users/martinv/
1952-2001 | home: martin@...