Thank you Thank you Thank you!
With the training code out, interest should really pick up as languages other than English as supported. There are some interesting projects out there - people probing tess'es capabilities and operations to see where it can be used to fill a void.
Hi, I'm a Informatics Engineering student from Portugal and I'm about to start a work around Tesseract/Gnome Scan. Since Tesseract, has I heard, doesn't support any other language than English, I was thinking about developing the Portuguese support.
The problem is that I don't know how/where to start... could indicate me some direction? ;)
See http://tesseract-ocr.repairfaq.org/ and then Ray Smith's comments on this matter at http://sourceforge.net/forum/message.php?msg_id=4023187 Basically, V1.03 offers the training code back in the release (it was not in 1.02) but the documentation is all in Ray's head :-)
That means that it will take some time for other languages to be supported.
You are not the only one looking into uniting Tess with Gnome AND also adding another language, these folks are also working on this: