Re: [htdig] Question

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

According to Omer Zimmer:
> I'm using htdig as my search engine and configured it to scan PDF files but 
> when I run htdig/rundig it didn't index my PDF files so I did a little 
> debugging and got the following error:
> <MY PDF FILE> token is application/octet-stream not XXX HTML
> 
> Its very strange since the program does index 2 PDF files but all the rest 
> of the PDFs get the error above.

That error message doesn't look like anything in the htdig code.  What
external parser are you using for PDFs?  Are the PDF files that work
in a different directory than those that don't?  Are there .htaccess
files, or something like that, in either of these directories, that
might define different file types for .pdf files depending on where
they are?  It's looking to me like your web server isn't returning the
same Content-Type header for all PDF files.

-- 
Gilles R. Detillieux              E-mail: <gr...@sc...>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930