Hi,
We have a problem in filter-media in our dspace for particular item only.
And found the link to upgrade PDFBox.jar .
http://sourceforge.net/mailarchive/message.php?msg_id=18359568
But it is not solve our problem as our dspace version is 1.4.1 and PDFBox.jar already in 0.7.3.
Hope anyone can help us on this issue.
PS: I have attached the pdf file incase you can't download the file from the link
http://researchspace.itss.auckland.ac.nz/handle/2292/87
Regards,
YinYin
=======
Applying Media Filters
ERROR filtering, skipping bitstream #47 java.io.EOFException: Unexpected end of ZLIB input stream
java.io.EOFException: Unexpected end of ZLIB input stream
at
java.util.zip.InflaterInputStream.fill(InflaterInputStream.java:216)
at
java.util.zip.InflaterInputStream.read(InflaterInputStream.java:134)
at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:97)
at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290)
at org.pdfbox.cos.COSStream.doDecode(COSStream.java:235)
at
org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170)
at
org.pdfbox.pdfparser.PDFStreamParser.<init>(PDFStreamParser.java:101)
at org.pdfbox.cos.COSStream.getStreamTokens(COSStream.java:132)
at
org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:20
2)
at
org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
at
org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
at
org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259)
at
org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
at
org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
at
org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java
:110)
at
org.dspace.app.mediafilter.MediaFilter.processBitstream(MediaFilter.java
:155)
at
org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte
rManager.java:327)
at
org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana
ger.java:296)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt
erManager.java:266)
ERROR filtering, skipping bitstream #48 java.io.EOFException: Unexpected end of ZLIB input stream ....
Logged In: YES
user_id=276600
Originator: NO
Any status on this one? I'm trying to convert a few thousand PDF files to text for indexing and I've run into this issue on 140+ of them.
Logged In: NO
I am also seeing the same problem on a significant minority of pdf files.
Logged In: YES
user_id=601708
Originator: NO
fixed, please try tonights nightly build
Logged In: YES
user_id=1705973
Originator: YES
Hi Ben
there is no item under nightly built. http://www.pdfbox.org/dist/
Rgds,
YinYin