From: Olivier K. <oli...@re...> - 2003-03-27 15:14:25
|
Hi, Anne Durand said : > When I run on command line > doc2html.pl /full/path/to/sample/Maison_Guiette.pdf "application/pdf" > url I don't get any error and the parsing looks ok. > > The htdig.conf file contains > external_parsers: application/pdf /usr/local/bin/doc2html.pl > > When I run htdig, I get the following errors : > Error (0): PDF file is damaged - attempting to reconstruct xref table... > Error: Top-level pages object is wrong type (null) > Error: Couldn't read page catalog > External parser error in line:<HTML> > URL: > http://www.archi.fr/UIA/htmEdifices/DOCOMOMO/Belgium/Maison_Guiette.pdf > External parser error in line:<HEAD> > .... I think this is a FAQ. max_doc_size defaults to 100000. Your PDF file is over 300000 bytes ! OK, it's FAQ #5.2 : <http://www.htdig.org/FAQ.html#q5.2> Regards, Olivier -- Olivier Korn. Strasbourg, France. |