Menu

#395 Unexpected end of ZLIB input stream

closed-fixed
None
5
2007-10-22
2007-01-31
ylatt
No

Hi,

We have a problem in filter-media in our dspace for particular item only.

And found the link to upgrade PDFBox.jar .
http://sourceforge.net/mailarchive/message.php?msg_id=18359568

But it is not solve our problem as our dspace version is 1.4.1 and PDFBox.jar already in 0.7.3.

Hope anyone can help us on this issue.

PS: I have attached the pdf file incase you can't download the file from the link
http://researchspace.itss.auckland.ac.nz/handle/2292/87

Regards,
YinYin

=======
Applying Media Filters
ERROR filtering, skipping bitstream #47 java.io.EOFException: Unexpected end of ZLIB input stream
java.io.EOFException: Unexpected end of ZLIB input stream
at
java.util.zip.InflaterInputStream.fill(InflaterInputStream.java:216)
at
java.util.zip.InflaterInputStream.read(InflaterInputStream.java:134)
at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:97)
at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290)
at org.pdfbox.cos.COSStream.doDecode(COSStream.java:235)
at
org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170)
at
org.pdfbox.pdfparser.PDFStreamParser.<init>(PDFStreamParser.java:101)
at org.pdfbox.cos.COSStream.getStreamTokens(COSStream.java:132)
at
org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:20
2)
at
org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
at
org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
at
org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259)
at
org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
at
org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
at
org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java
:110)
at
org.dspace.app.mediafilter.MediaFilter.processBitstream(MediaFilter.java
:155)
at
org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte
rManager.java:327)
at
org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana
ger.java:296)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt
erManager.java:266)
ERROR filtering, skipping bitstream #48 java.io.EOFException: Unexpected end of ZLIB input stream ....

Discussion

  • ylatt

    ylatt - 2007-01-31
     
  • Ben Litchfield

    Ben Litchfield - 2007-02-14
    • assigned_to: nobody --> benlitchfield
     
  • Cameron Taggart

    Cameron Taggart - 2007-08-13

    Logged In: YES
    user_id=276600
    Originator: NO

    Any status on this one? I'm trying to convert a few thousand PDF files to text for indexing and I've run into this issue on 140+ of them.

     
  • Nobody/Anonymous

    Logged In: NO

    I am also seeing the same problem on a significant minority of pdf files.

     
  • Ben Litchfield

    Ben Litchfield - 2007-10-22

    Logged In: YES
    user_id=601708
    Originator: NO

    fixed, please try tonights nightly build

     
  • Ben Litchfield

    Ben Litchfield - 2007-10-22
    • status: open --> closed-fixed
     
  • ylatt

    ylatt - 2007-10-23

    Logged In: YES
    user_id=1705973
    Originator: YES

    Hi Ben

    there is no item under nightly built. http://www.pdfbox.org/dist/

    Rgds,
    YinYin

     

Log in to post a comment.