Menu

#72 Value was either too large or too small for an Int32

0.1.2.1
closed-fixed
None
3
2015-05-27
2015-05-14
No

Error when try to extract text from the attached PDF

EXCEPTION: System.OverflowException: Value was either too large or too small for an Int32.
   at System.Number.ParseInt32(String s, NumberStyles style, NumberFormatInfo info)
   at org.pdfclown.documents.contents.fonts.CMapParser.ParseUnicode()
   at org.pdfclown.documents.contents.fonts.CMapParser.Parse()
   at org.pdfclown.documents.contents.fonts.Font.Load()
   at org.pdfclown.documents.contents.fonts.Font..ctor(PdfDirectObject baseObject)
   at org.pdfclown.documents.contents.fonts.SimpleFont..ctor(PdfDirectObject baseObject)
   at org.pdfclown.documents.contents.fonts.Type1Font..ctor(PdfDirectObject baseObject)
   at org.pdfclown.documents.contents.fonts.Font.Wrap(PdfDirectObject baseObject)
   at org.pdfclown.documents.contents.FontResources.Wrap(PdfDirectObject baseObject)
   at org.pdfclown.documents.contents.ResourceItems`1.get_Item(PdfName key)
   at org.pdfclown.documents.contents.objects.SetFont.GetResource(IContentContext context)
   at org.pdfclown.documents.contents.objects.SetFont.GetFont(IContentContext context)
   at org.pdfclown.documents.contents.objects.SetFont.Scan(GraphicsState state)
   at org.pdfclown.documents.contents.ContentScanner.MoveNext()
   at org.pdfclown.documents.contents.ContentScanner.TextWrapper.Extract(ContentScanner level)
   at org.pdfclown.documents.contents.ContentScanner.TextWrapper..ctor(ContentScanner scanner)
   at org.pdfclown.documents.contents.ContentScanner.GraphicsObjectWrapper.Get(ContentScanner scanner)
   at org.pdfclown.documents.contents.ContentScanner.get_CurrentWrapper()
   at org.pdfclown.tools.TextExtractor.Extract(ContentScanner level, IList`1 extractedTextStrings)
   at org.pdfclown.tools.TextExtractor.Extract(ContentScanner level, IList`1 extractedTextStrings)
   at org.pdfclown.tools.TextExtractor.Extract(IContentContext contentContext)
1 Attachments

Discussion

  • Stefano Chizzolini

    • assigned_to: Stefano Chizzolini
     
  • Stefano Chizzolini

    • status: open --> closed-fixed
    • Priority: 5 --> 3
     
  • Stefano Chizzolini

    CMap Unicode sequences were not handled.

    Fixed on 0.1.2-Fix branch (rev 220) and 0.2.0 trunk (rev 221).

    thank you

     

Log in to post a comment.

MongoDB Logo MongoDB