Hello Jeffrey,
 
we had a lot of these errors including other ones. We downloaded the lateste PDFBox.jar from the PDFBox website and got rid of most of the errors. Now we only have the "You did not close the PDF document" because there are some PDFs in DSpace that have turned security on. Those PDFs do not allow text extraction. PDFBox also returns an error for this.
 
You could ask your user not to turn this security option on.
 
Check you /tmp directory for any temporary pdfxxx files which are not deleted. Your /tmp directory might run out of space.
 
Regards,
 
Francis Brouns
-----Original Message-----
From: Jeffrey Trimble [mailto:jtrimble@cc.ysu.edu]
Sent: 22 September 2005 16:44
To: dspace-tech@lists.sourceforge.net
Subject: [Dspace-tech] Filter media and PDF files

I have seen this warning, but since I am not an expert in PDF file creation, would like someone
to comment on this.  Resolutions are welcome:

Creating search index:
java.lang.Throwable: Warning: You did not close the PDF Document
        at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:384)
        at java.lang.ref.Finalizer.invokeFinalizeMethod(Native Method)
        at java.lang.ref.Finalizer.runFinalizer(Finalizer.java:83)
        at java.lang.ref.Finalizer.access$100(Finalizer.java:14)
        at ava.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:160)

I believe this has something to do with the way the PDF document was saved.  Is there
something I need to have our folks who create these documents do before the final
save and uploading of the file onto our dspace server?

TIA,

Jeff

Jeffrey A. Trimble
Systems Librarian
Youngstown State University
Youngstown, OH
jtrimble@cc.ysu.edu
(330) 941-2483