I'm new to T. and I've been trying for the past few days to learn as much as possible about how T. works. I would like to be able to extract coordinates of the occurrence of the word on the page so that I can highlight (i.e. I would like to draw a semi-transparent box over the word on the page).
How do I go about accomplishing that?
Here is a windows dll that will do that:
Also, look at OCR Append char for more info.
Where is the source code? This DLL is useless to people who are not on Windows.
"Where is the source code? This DLL is useless to people who are not on Windows."
Sigh. Tesseract is released under the Apache license and NOT GNU. Thus, there is absolutely NO requirement that authors/developers release source code - they may do so out of the goodness of their heart but the license does not force them to do so.
With this in mind, please try a more cordial approach...
P.S. The DLL implements the API which is in the svn and, besides, if you're on non-Windoze, you can't use the DLL, so why ask for it? Get the svn and figure out the API.
Log in to post a comment.
Sign up for the SourceForge newsletter:
You seem to have CSS turned off.
Please don't fill out this field.