java.io.IOException: Error: Expected an integer type, actual
Brought to you by:
benlitchfield
java.io.IOException: Error: Expected an integer type,
actual='L?s'
at org.pdfbox.pdfparser.BaseParser.readInt
(BaseParser.java:1213)
at org.pdfbox.pdfparser.PDFParser.parseObject
(PDFParser.java:370)
at org.pdfbox.pdfparser.PDFParser.parse
(PDFParser.java:144)
at org.pdfbox.ExtractText.parseDocument
(ExtractText.java:297)
at org.pdfbox.ExtractText.getTextContent
(ExtractText.java:262)
at org.pdfbox.ExtractText.main
(ExtractText.java:92)
Exception in thread "main"
when I am trying to parse a PDF file I am getting the
above exeception
please could you held me out in this
Logged In: YES
user_id=601708
Is it possible for you to mail me or attach the PDF document?
ben@csh.rit.edu
Logged In: YES
user_id=1117929
I cant attach the PDF here
Logged In: YES
user_id=601708
Then please mail or send link to the PDF document.
Logged In: YES
user_id=601708
I still don't have a PDF document for this yet. I can't fix this
issue if I can't reproduce it.
Ben
Logged In: YES
user_id=601708
I believe this issue is due to PDF documents that use the new
PDF 1.5 object streams.
If you open up the pdf in a text editor and search for the
string 'ObjStm' and find it then that means it is using the new
object stream. PDFBox does not support object streams yet
but this is the next RFE to be implemented.
Ben
Logged In: YES
user_id=601708
If this is due to 1.5 object streams then it has been fixed in
CVS. Please let me know if it is still a problem with tonight
nightly build.
Ben