From: Geoff H. <ghu...@ws...> - 2002-10-29 14:11:46
|
On Tuesday, October 29, 2002, at 05:22 AM, WASSILIOS.ALEXOPOULOS@LHSYSTEMS.COM wrote: > Can someone tell me how the search algorythm is when indexing: > > start_url http://a.com \ > http://b.com \ > http://c.com I'm really not sure what you're asking. I'm *guessing* that you want to know how particular pages are indexed? The "algorithm" is that all URL links are followed from te start_url, provided they fall within the pattern provided by the limit_urls_to attribute, don't match the exclude_urls patterns, etc. <http://www.htdig.org/attrs.html#limit_urls_to> <http://www.htdig.org/attrs.html#exclude_urls> If this isn't what you mean, please explain in more detail. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ |