I'm obviously not an OCR expert - hence my question. Would be great if there was a way to import the images (I mean graphics, Ilustrations) from the OCRED text. Plus some basic formatting (rtf, odf) but i don't know if it's possible or has been discussed
gImageReader is basically a front-end to tesseract-ocr. I would need to double check, but I don't think that the tesseract api currently exposes enough information to be able to extract illustrations.
Sign up for the SourceForge newsletter:
You seem to have CSS turned off.
Please don't fill out this field.