I have seen it can happens that some text is not recognized after the
OCR. It can happens that the title is not recognized as text. I assume
it's because the characters are much bigger and sometimes with different
colors. One time I had also problem with text with a red background.
Almost nothing was recognized. If something is not recognized, is there
a way to add a box with some text ? Such that when we do a search after
in the pdf, that text will also be found ?
I know I can correct or add some text in the created boxes after the
OCR. I mean really, adding a box, because for example a title is not
recognized as text.
On 10 March 2013 14:46, Cédric Macquat <cmacquat@...> wrote:
> Almost nothing was recognized. If something is not recognized, is there
> a way to add a box with some text ? Such that when we do a search after
At the moment, there is no way of adding a box. Please file a feature
request so that this doesn't get forgotten.
The whole issue of correcting OCR output needs work, which I will get
to as soon as I have finished refactoring the rest of the codebase to
allow regression tests for as much as is possible.