|
From: Charles F. <Cha...@bn...> - 2005-10-26 12:29:00
|
Hi Kaisa, you could execute: find / -name parse-pdf.sh This will tell you where parse-pdf.sh is located (e.g. /bin/parse-pdf.sh) Then, in the file ../plugins/parse-ext/plugin.xml, replace the line /home/stack/workspace/archive-access/projects/nutch/bin/parse-pdf.sh with the real location of the parse-pdf.sh script (e.g. /bin/parse-pdf.sh) Charlie change ----- Original Message ----- From: <kau...@cs...> To: <arc...@li...> Sent: Wednesday, October 26, 2005 1:01 PM Subject: [Archive-access-discuss] Path to parse-pdf.sh Hi, thanks for the new nutchwax&wera releases! I'm indexing a small test archive and all pdf files create errors. The script parse-pdf.sh is in bin directory, but path to it is hard coded somewhere, in ../plugins/parse-ext/plugin.xml 051026 125244 adding 2381478 bytes of mimetype application/pdf http://www.helsinki2005.fi/files/pdf/pretraining_camps.pdf 051026 125244 Failed parse: java.io.IOException: java.io.IOException: /home/stack/workspace/archive-access/projects/nutch/bin/parse-pdf.sh: not found Kaisa ------------------------------------------------------- This SF.Net email is sponsored by the JBoss Inc. Get Certified Today * Register for a JBoss Training Course Free Certification Exam for All Training Attendees Through End of 2005 Visit http://www.jboss.com/services/certification for more information _______________________________________________ Archive-access-discuss mailing list Arc...@li... https://lists.sourceforge.net/lists/listinfo/archive-access-discuss |