From: Gilles D. <gr...@sc...> - 2002-04-09 22:23:51
|
According to Omer Zimmer: > I'm using htdig as my search engine and configured it to scan PDF files but > when I run htdig/rundig it didn't index my PDF files so I did a little > debugging and got the following error: > <MY PDF FILE> token is application/octet-stream not XXX HTML > > Its very strange since the program does index 2 PDF files but all the rest > of the PDFs get the error above. That error message doesn't look like anything in the htdig code. What external parser are you using for PDFs? Are the PDF files that work in a different directory than those that don't? Are there .htaccess files, or something like that, in either of these directories, that might define different file types for .pdf files depending on where they are? It's looking to me like your web server isn't returning the same Content-Type header for all PDF files. -- Gilles R. Detillieux E-mail: <gr...@sc...> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 |