I wanted to test the capabilities of postgresql in storing indian
language text in unicode format.
And much to my pleasant surprise, everything seems to be working!
Here is an attempt to build kannada dictionary (you could store
any indian language dictionary in this db).
http://eagle:8180/arun/kandict/lookup_word.jsp
http://eagle:8180/arun/kandict/add_word.jsp
(The first word I added is: "Internet")
I've found that Mozilla 0.9.7 still doesn't do the right thing in
doing a HTTP POST with UTF-8 characters. I'll take it up on their
i18n newsgroups. IE 6.x seems to work just fine though.
I know, I know, before I blink, the database will have plenty of
unprintable language from visitors on the net. I intend to keep a
"clean" copy of the dictionary, to which I intend to push user submitted
entries after screening.
Feel free to contribute to the dictionary!
-Arun
|