Menu

#47 Import all parts of PDF, not just images

open
nobody
None
5
2009-01-26
2008-11-28
No

I have experienced the same bug as 1815881 on Ubuntu Hardy 8.04 with gscan2pdf 0.9.26:

"Importing pdf that contains an image only imports the image.
If I create a document in openoffice that contains an image and then export it as pdf, when I try to import it into gscan2pdf it only retrieves the
image and not the rest of the text.

I have not tried creating pdfs with other packages so I don't know if this
is openoffice specific.

I have attached a sample pdf file that displays ok with pdf readers like
Acrobat etc but exhibits the problem described above when importing into gscan2pdf."

Discussion

  • Jeffrey Ratcliffe

    As the comment in 1815881 says, this as designed at the moment. I only added PDF import to allow gscan2pdf to roundtrip. To do something sensible with all possible PDFs is out of the scope of the program at the moment.

     
  • Jeffrey Ratcliffe

    • summary: confirm bug 1815881 Importing pdf that contains an image onl --> Import all parts of PDF, not just images
     
  • kobayashison

    kobayashison - 2009-07-07

    If gscan2pdf would achieve the possibility to import all types of PDF it wouldn't be simply the best gnome OCR and scanning program, but also a great opensource program to manage PDF! It could join pdf, eliminate pages, reordinate pages, renumbering pages, change pdf metadata, view pdf, add pages from scanner, OCRringany pages adding OCRred text invisible layer for document indexers... In two words... The best.

     

Log in to post a comment.