|
From: <kau...@cs...> - 2005-10-26 11:01:56
|
Hi, thanks for the new nutchwax&wera releases! I'm indexing a small test archive and all pdf files create errors. The script parse-pdf.sh is in bin directory, but path to it is hard coded somewhere, in ../plugins/parse-ext/plugin.xml 051026 125244 adding 2381478 bytes of mimetype application/pdf http://www.helsinki2005.fi/files/pdf/pretraining_camps.pdf 051026 125244 Failed parse: java.io.IOException: java.io.IOException: /home/stack/workspace/archive-access/projects/nutch/bin/parse-pdf.sh: not found Kaisa |