[Indic-computing-users] NEWS: OCR'ing in Telugu...
Status: Alpha
Brought to you by:
jkoshy
From: Frederick N. (FN) <fr...@by...> - 2003-07-05 05:20:06
|
--__--__-- Message: 1 From: Sumeet Madhukar Moghe <cus...@as...> To: LIH <lin...@li...>, LIG <lin...@li...>, LISubmit <lin...@li...> Date: 04 Jul 2003 13:44:57 +0530 Subject: [LIG] [Fwd: [TwinCLinG] Velugu] Reply-To: lin...@li... -----Forwarded Message----- From: Nikhil Shanker <nik...@ac...> To: TwinCLinG <il...@ya...> Subject: [TwinCLinG] Velugu Date: 04 Jul 2003 01:49:11 +0530 Hello people. As promised at the last ILUG-HYD meet, what you saw in the meet has been put online as a free (as in beer) service to anybody who is interested in OCR'ing Telugu Document Images with their layout preserved. You can submit an image (preferrably scanned at >250dpi) of any Telugu text and get back formatted HTML in UTF-8. That makes your text searchable, editable and indexable. It has a variety of uses. Talking of all that is beyond the scope of this list. Check out http://lihkin.net/velugu/ for more on that. This is an initial launch, so expect a lot of errors/problems. Please let me know if you find any. As mentioned in an earlier post, all flames on the the accuracy of the OCR can be redirected to Chandra Kanth and all your flames on the layout (that includes tables, font sizes, et al) restoration can be showered on me. Its a work in progress, so I'll keep fixing stuff as it goes. Nikhil. PS: Yeah, I never tried my hand at web-designing. I know I suck. Thank you. -- Nikhil Shanker (nikhil.shanker at acm.org) Slackware Linux http://www.slackware.com/ I guess that's why people care: Simplicity is Divine. ----------------------------------------------- Next Meeting 26 July 2003, 6:00 PM, ESCI-IT City Center, 6-1-85/1 &2 2nd Floor, Opp. Telephone Bhavan, Saifabad Our Website: http://ilug-hyd.org.in (*not too much uptodate*) ----------------------------------------------- |