Extracting coordinates of lines of text?

mike bell
  • mike bell

    mike bell - 2007-03-02

    Hi all,

    I'm new to T. and I've been trying for the past few days to learn as much as possible about how T. works. I would like to be able to extract coordinates of the occurrence of the word on the page so that I can highlight (i.e. I would like to draw a semi-transparent box over the word on the page).

    How do I go about accomplishing that?



    • JetsoftDev.com

      JetsoftDev.com - 2007-04-08

      Here is a windows dll that will do that:

      Also, look at OCR Append char for more info.

      • Gregory Maxwell

        Gregory Maxwell - 2007-04-08

        Where is the source code?  This DLL is useless to people who are not on Windows.

    • Filip Gieszczykiewicz

      "Where is the source code? This DLL is useless to people who are not on Windows."

      Sigh. Tesseract is released under the Apache license and NOT GNU. Thus, there is absolutely NO requirement that authors/developers release source code - they may do so out of the goodness of their heart but the license does not force them to do so.

      With this in mind, please try a more cordial approach...


      P.S. The DLL implements the API which is in the svn and, besides, if you're on non-Windoze, you can't use the DLL, so why ask for it? Get the svn and figure out the API.


Log in to post a comment.

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:

JavaScript is required for this form.

No, thanks