Now that we can do pdf as input, wouldn't it be great to have pdf as output in the form of "image on text"?
This is when the OCR output is placed invisibly on the original picture / image so that recognition errors do not detract from the user-reading experience, yet the invisible text can be searched, copied / pasted etc..
If not pdf as output, perhaps LibreOffice ODT (Open Document Text) would be even better.
Sorry for the late answer, I really wonder why sourceforge won't send me emails about new forum postings…
I'm working on the a new release (ported to gtk3), and I'll consider this - unfortunately I'm quite busy at the moment so progress isn't as fast as I would like.
I am missing this feature too! But it is a great software!
I'll need to check whether tesseract (the OCR engine I'm using in gImageReader) is able to provide me sufficient metrics for properly scaling and aligning the output text within the source image.
Log in to post a comment.
Sign up for the SourceForge newsletter:
You seem to have CSS turned off.
Please don't fill out this field.