The item that stands out for me is your mention that changing the file
type to .html makes things work okay. This makes me think that there
must be a mis-match between what you have set for mime-type mapping
within the htdig config, and what your web server is chucking out as the
mime type for these files.
If I am right, and you need more help, let me know...
[mailto:htdig-general-bounces@...] On Behalf Of
Sent: Wednesday, November 01, 2006 5:12 PM
Subject: [htdig] Problem indexing shtml files
I am trying to setup htdig to search our site, and have run into
a problem. Most pages on our site are .shtml files as they use server
side includes to include common graphics, menu structure, sidebars on
each page. I would like to only index by keywords and descriptions and
would like to replace the normal excerpt with the meta description from
Here is where the problem occurs, if I set all of the index
factors to 0 except title, keywords, and description nothing gets
indexed. If I set the text_factor to something above 0 then all of the
files get indexed but the use_meta_description does not work.
If I rename the files to .html files everything indexes fine and
the use_meta_description works like a charm. Ofcourse by doing this
none of my pages would display properly.
Is there a way around this outside of renaming the files to
.html and turning on the xbithack? =20
I am using version 3.1.6 of htdig (provided by an isp).=20
I have read some information in the mailing list archive etc but
I could not find anything specific to this issue. To make sure pages
would get indexed I created an html file with links to most of the pages
on the site (as a lot of the page links are in the includes and have
http:/ <http:/> addresses as there was something mentioning that file
based searches had issues with .shtml files without changing some of the
This email and any attachments may contain confidential and
privileged information. If you are not the intended recipient, please
notify the sender immediately by return e-mail, delete this e-mail and
destroy any copies. Any dissemination or use of this information by a
person other than the intended recipient is unauthorized and may be
illegal. Unless otherwise stated, opinions expressed in this e-mail are
those of the author and are not endorsed by the author's employer.