From: Steve Lewis <slewis@lu...> - 2004-09-17 13:27:31
I'm new to HtDig and have one issue that is bothersome but not a big
problem. Everytime someone uses the search engine on our site I get a
message from our cron job as follows:
PDF::parse: cannot find pdf parser /usr/local/bin/acroread
With a subject line of:
Cron <root@...> /usr/local/bin/rundig
Do I need to put acrobat up on the system or is there some way I can
stop these messages.
Manager of IT, QA and Manufacturing
732 357-3523 Voice
732 618-6006 Cell
866 213-5250 Pager
On Fri, 17 Sep 2004, Steve Lewis wrote:
> I'm new to HtDig and have one issue that is bothersome but not a big
> problem. Everytime someone uses the search engine on our site I get a
> message from our cron job as follows:
> PDF::parse: cannot find pdf parser /usr/local/bin/acroread
It sounds like htdig is encountering some PDFs and trying to use the
default handling mechanism, which is failing due to acroread not being
found. If you really want to index the PDFs, you should probably start by
reading http://www.htdig.org/FAQ.html#q4.9. If you don't care about the
PDFs and just want to get rid of the message, it would probably be easiest
to just add .pdf to the bad_extensions attribute.
Btw, I suspect what is happening is that you are getting the message you
refer to each time cron tries to execute rundig. Not each time someone
uses the search engine on your site. The site search calls htsearch which
doesn't try to parse PDFs or do anything with cron.
Get latest updates about Open Source Projects, Conferences and News.