Subject: Re: [htdig] Masking out template content with
[noindex_start] and [noindex_end]
On Fri, Jul 4, 2003, Andreas.Mueller@bbw.admin.ch wrote:
> We use ht://Dig as search engine. To make sure that this
> irrelevant) template content is not indexed we have made
heavy use of
> [noindex_start] and [noindex_end] -- in our case we used
> htdig_noindex_end -->' and '<!-- htdig_noindex_start -->'.
> Now I found out that htdig does not seem to consider
these tags as white
> spaces forming separate words.
The fix is to patch htdig/HTML.cc to add a space after
stripping out the noindex_start ... noindex_end section. See
the 3.1.6 fix. A similar approach would to the trick in 3.2.
This is a minor problem, only reported once, because these
tags are usually used each on their own separate line, so the
whitespace is already there most of the time. However, it's
an easy fix. The question is will some users ever expect/
count on the opposite behaviour, i.e. that no space be