Thread: [Indic-computing-devel] Re: [smc-devel] FW: Malayalam Cillaksharams (was Ya-phalaa)

Status: Alpha

Brought to you by: jkoshy

indic-computing-devel

[Indic-computing-devel] Re: [smc-devel] FW: Malayalam Cillaksharams (was Ya-phalaa)

From: Baiju M <bai...@ly...> - 2003-03-06 09:22:39

If none of ZWJ and ZWNJ is specified after a virama it should be rendered as an explicit virama.
ie., NA + VIRAMA (should render as NA+VIRAMA)

If we are adopting the new Malayalam Inscript layout propsed by Kerala govt. (which has got single keys for 5 chillus ), Inputing chillus will be very easy (It can impliment very easily by using Compose keys in GNU/Linux)

Regards,
Baiju M

On Wed, 5 Mar 2003 22:09:51   
 Andy White wrote:
>"This is part of the improvements to [the] Unicode [Standard] that have
>been made for [Version] 4.0"
>
>[...]
>> To reiterate our consistency in using this model, I will give you a 
>> Malayalam example.
>> 
>> NA + VIRAMA + MA --> NMA (a single conjunct)
>> NA + VIRAMA + ZWNJ + MA --> NMA (with a visible virama breve 
>> above and between) NA + VIRAMA + ZWJ + MA --> NMA (with the 
>> cillaks.aram virama curl)
>[...]
>> Michael Everson
>
>I have always advocated the use of the above mentioned rendering rules
>for Malayalam. However, people trying to Implement Malayalam in Unicode
>have periodically asked on this list the question of how to encode
>chillu forms.
>Due to the lack of any official response, those implementations that I
>have seen unfortunately do not follow the above rules.
>***If you have such an implementation, Especially anyone responsible for
>a rendering subsystem, please take note***.
>
>Andy
>
>
>
>
>_______________________________________________
>smc-devel mailing list
>smc...@no...
>http://mail.nongnu.org/mailman/listinfo/smc-devel
>

_____________________________________________________________
Get 25MB, POP3, Spam Filtering with LYCOS MAIL PLUS for $19.95/year.
http://login.mail.lycos.com/brandPage.shtml?pageId=plus&ref=lmtplus

[Indic-computing-devel] javascript indic renderer and community portals

From: Arun M <ar...@fr...> - 2003-05-05 04:33:52

Attachments: Screenshot.png

Hi Friends,

   We see a lot of community portals coming up these days based on
CMS like *nuke. But one of these CMS supports Indic. Also we dont see
any community discussion boards in Indian languages (Pls correct me). 

Some issues I see are:

 - There is no indic(Unicode/Non unicode) support in most platforms. 
 - Most browsers doesnt support unicode. 

Any community portal we build should be based on font encoding
systems, at least for some more time. A idea is store
the data in Unicode and then convert to font encoding at the server
side.  A special proxy should help here.

Second issues is entering data from the client side. This is major
prob. Most of the sites uses Java applets for entering the data in
local languages. This may require good amount of modification in the
CMS we have now like *nuke. 

What about Javascript ?  When user types in a text field using some
Transliteration scheme or Inscript KeyBoard layout convert it to font
encoding scheme and show Indic on the fly. Data will be posted to CMS
based on unicode or font encoding scheme. Theme change should be sufficiant 
get *nukes support Indic(I've not looked into *nuke code in detail. Just a guess) 

Yesterday I made a proto of this. A crude one. It works. 

Do you think this will be of use ? If so I will work on it and
make the code better and generic(I dont know Javascript much , will have to
learn it first). 


Arun.

[Indic-computing-devel] Re: javascript indic renderer and community portals

From: <al...@ya...> - 2003-05-05 06:27:33

> 
>    We see a lot of community portals coming up these days based on
> CMS like *nuke. But one of these CMS supports Indic. Also we dont see
> any community discussion boards in Indian languages (Pls correct me). 

http://groups.yahoo.com/group/linux-bangalore-hindi/
Well, not exactly a discussion board, but a mailing list.

Wonder if you have seen this(doesn't need applents but is of course still not
the perfect input method):
http://www.wandel.subnet.dk/hindi.html


 
> Some issues I see are:
> 
>  - There is no indic(Unicode/Non unicode) support in most platforms. 

> 
Something about the social aspect of using Indic scripts on the web: While
these are valid points, the browsers most people use do support unicode.(all
right flame me but face the truth first!) Unless the existing users, who use
English, start using Indic languages in mailing lists and boards, users who
don't know English very well are not going to get much encouragement either.
Developing a user base is crucial because that's where future developers would
emerge from. 

>  - Most browsers doesnt support unicode. 
I think most of them do, thanks to CJKV. What they *don't* support is indic
script rendering.

> Arun.
> 
> 
> 

> ATTACHMENT part 2 image/png name=Screenshot.png
 

=====
Contribute to http://indic-computing.sourceforge.net
+91-80-653-8200
http://groups.yahoo.com/group/linux-bangalore-hindi/
Can't see Hindi? http://geocities.com/alkuma/seehindi.html
http://9211.blogspot.com Set View->Encoding->UTF-8
à¤à¤ªà¤à¥ à¤¸à¥à¤à¤¾à¤µ à¤¶à¥à¤à¥à¤°à¤¤à¤¾ à¤¸à¥ à¤¤à¤à¥ à¤¸à¥à¤µà¥à¤à¤¾à¤° à¤¹à¥à¤à¤à¥ à¤à¥ à¤à¤¬ à¤à¤ª à¤à¤¨à¥à¤¹à¥à¤ à¤²à¤¾à¤à¥ à¤à¤°à¤¨à¥ à¤à¤¾ à¤¬à¥à¤¡à¤¼à¤¾ à¤à¥ à¤¸à¥à¤µà¤¯à¤ à¤à¤ à¤¾à¤¯à¥à¤à¤à¥à¥¤

________________________________________________________________________
Missed your favourite TV serial last night? Try the new, Yahoo! TV.
       visit http://in.tv.yahoo.com

Re: [Indic-computing-devel] Re: javascript indic renderer and community portals

From: Arun M <ar...@fr...> - 2003-05-06 03:02:40

 
> Something about the social aspect of using Indic scripts on the web: While
> these are valid points, the browsers most people use do support unicode.(all
> right flame me but face the truth first!) Unless the existing users, who use
> English, start using Indic languages in mailing lists and boards, users who
> don't know English very well are not going to get much encouragement either.
> Developing a user base is crucial because that's where future developers would
> emerge from. 

The old chicken, egg prob. 
 
> >  - Most browsers doesnt support unicode. 
> I think most of them do, thanks to CJKV. What they *don't* support is indic
> script rendering.

Sorry for the mistake. I think you got the point.

arun.

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Suryaprakash K. <kom...@ce...> - 2003-05-05 12:18:07

Hello,
> What about Javascript ?  When user types in a text field using some
> Transliteration scheme or Inscript KeyBoard layout convert it to font

	For collecting data on Indic scripts, we had created an interface
that uses Java. I had used ITRANS transliteration and the default java
fonts to come up with an input window that accepts transliteration in
Latin script and outputs Devanagari on another window.
	I am not familiar with CMS or *nuke - but with what I gather
reusable java classes could be a good way to look at it. We can develop
classes to work with other languages -
	Right now, the tool might be a bit bulky to work with since the
package has a lot of Image Processing related code to go with it. But
its' good to test offline - Ppl can take a look at it here -
www.cedar.buffalo.edu/ilt
	If its' useful, we can plan to make the input system independant
of the IP part -

> http://www.wandel.subnet.dk/hindi.html
	I tried the site out - it didnt need applets - but it didnt work
on netscape either - gave me a message saying it needs windows to work
with but then behaved OK on mozilla. :) :( What it is doing is - it
takes the unicode value for the character, converts it into int and this
is what we can do with the o/p -

<html>
&#2313;&#2314;&#2392;&#2393;
</html>

	But since what it is in the background is basically unicode, we
still need a *properly configured* browser to view the stuff - It came
out properly on Mozilla, didnt show up on netscpape. Pretty nifty - but
the addition of transliterated/keyboard based input might be more welcome.
The characters present in the interface are the ones from the unicode
table for Devanagari.

Bye,
Surya
  __       __
 |__      |__|    |/
  __|urya |rakash |\ompalli

Email - kom...@cs...
Phone - Home    - (716) 834 6859
      - Work    - (716) 645 6164 X 519
Website - http://www.cse.buffalo.edu/~kompalli


On 5 May 2003, Arun M wrote:

> Hi Friends,
>
>    We see a lot of community portals coming up these days based on
> CMS like *nuke. But one of these CMS supports Indic. Also we dont see
> any community discussion boards in Indian languages (Pls correct me).
>
> Some issues I see are:
>
>  - There is no indic(Unicode/Non unicode) support in most platforms.
>  - Most browsers doesnt support unicode.
>
> Any community portal we build should be based on font encoding
> systems, at least for some more time. A idea is store
> the data in Unicode and then convert to font encoding at the server
> side.  A special proxy should help here.
>
> Second issues is entering data from the client side. This is major
> prob. Most of the sites uses Java applets for entering the data in
> local languages. This may require good amount of modification in the
> CMS we have now like *nuke.
>
> Yesterday I made a proto of this. A crude one. It works.
>
> Do you think this will be of use ? If so I will work on it and
> make the code better and generic(I dont know Javascript much , will have to
> learn it first).
>
>
> Arun.
>
>
>

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Krishnamurthy N. <kn...@ya...> - 2003-05-12 12:32:58

Hi Suryaprakash, Arun and others,

A good, generic transliteration library for the Indian
languages/scripts is what is needed, IMHO. As anyone
who has studied the structure of Indian languages and
scripts would appreciate, there is good structure in
the phonetic input and how the input syllables are
encoded using a script and how they are mapped to a
series of simple or composite glyphs.

In a web-based client-server model of app development,
unlike Latin scripts/languages, it would be more
appropriate to do input processing, transliteration
and glyph composition for rendering on the server
side. So, using Javascript may not be that feasible to
achieve this. At the same time, if all processing is
on the server side, then the application can't be
really that interactive (such as showing the display
to the user for each syllable typed in). Some
intermediate soluton would be needed. Also, using PHP
may be a better idea than Javascripts or jsp. What do
you folks think ?

A couple of weeks back, I made an enhancement to my
transliteration library (translib under the
indic-computing project on sourceforge) to take a
'word' in an Indian language encoded as a sequence of
Unicode characters (in UTF-8 format), kind of map it 
(using a user-defined lookup mapping file) to the most
appropriate Roman phonetic input and then apply the
transliteration rules for that language+script by
looking up the transliteration rule file. The output,
as before, is a sequence of symbolic glyph names that
correspond to glyph indices in a given font file. This
can be fed to a font reading and rendering library
such as freetype2 for final display (Koshy wrote a
python script to do this using gozer, but he is now
replacing gozer with a python wrapper to ft2).

I tried out this utf8-to-final-glyph rendering for
Hindi+Devanagari with very minimal mapping done and
did some prelim testing and it's ok. All the
intelligence is in the user-defined mapping files and
the source code itself has no knowledge of any Indian
language.

Unicode is neither an input mapping scheme nor a glyph
mapping scheme; it's just an encoding scheme, as all
of us know. It has limitations, but with a sound
transliteration library in place, utf-8 can be used
for 
storing Indian language content for further processing
(search, sort, display etc etc).

cheers,
Nagarajan

--- Suryaprakash Kompalli <kom...@ce...>
wrote:
> Hello,
> > What about Javascript ?  When user types in a text
> field using some
> > Transliteration scheme or Inscript KeyBoard layout
> convert it to font
> 
> 	For collecting data on Indic scripts, we had
> created an interface
> that uses Java. I had used ITRANS transliteration
> and the default java
> fonts to come up with an input window that accepts
> transliteration in
> Latin script and outputs Devanagari on another
> window.
> 	I am not familiar with CMS or *nuke - but with what
> I gather
> reusable java classes could be a good way to look at
> it. We can develop
> classes to work with other languages -
> 	Right now, the tool might be a bit bulky to work
> with since the
> package has a lot of Image Processing related code
> to go with it. But
> its' good to test offline - Ppl can take a look at
> it here -
> www.cedar.buffalo.edu/ilt
> 	If its' useful, we can plan to make the input
> system independant
> of the IP part -
> 
> > http://www.wandel.subnet.dk/hindi.html
> 	I tried the site out - it didnt need applets - but
> it didnt work
> on netscape either - gave me a message saying it
> needs windows to work
> with but then behaved OK on mozilla. :) :( What it
> is doing is - it
> takes the unicode value for the character, converts
> it into int and this
> is what we can do with the o/p -
> 
> <html>
> ????
> </html>
> 
> 	But since what it is in the background is basically
> unicode, we
> still need a *properly configured* browser to view
> the stuff - It came
> out properly on Mozilla, didnt show up on netscpape.
> Pretty nifty - but
> the addition of transliterated/keyboard based input
> might be more welcome.
> The characters present in the interface are the ones
> from the unicode
> table for Devanagari.
> 
> Bye,
> Surya

> On 5 May 2003, Arun M wrote:
> 
> > Hi Friends,
> >
> >    We see a lot of community portals coming up
> these days based on
> > CMS like *nuke. But one of these CMS supports
> Indic. Also we dont see
> > any community discussion boards in Indian
> languages (Pls correct me).
> >
> > Some issues I see are:
> >
> >  - There is no indic(Unicode/Non unicode) support
> in most platforms.
> >  - Most browsers doesnt support unicode.
> >
> > Any community portal we build should be based on
> font encoding
> > systems, at least for some more time. A idea is
> store
> > the data in Unicode and then convert to font
> encoding at the server
> > side.  A special proxy should help here.
> >
> > Second issues is entering data from the client
> side. This is major
> > prob. Most of the sites uses Java applets for
> entering the data in
> > local languages. This may require good amount of
> modification in the
> > CMS we have now like *nuke.
> >
> > Yesterday I made a proto of this. A crude one. It
> works.
> >
> > Do you think this will be of use ? If so I will
> work on it and
> > make the code better and generic(I dont know
> Javascript much , will have to
> > learn it first).
> >
> >
> > Arun.
> >

__________________________________
Do you Yahoo!?
The New Yahoo! Search - Faster. Easier. Bingo.
http://search.yahoo.com

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Suryaprakash K. <kom...@ce...> - 2003-05-12 20:29:16

Hello,

> A good, generic transliteration library for the Indian
> languages/scripts is what is needed, IMHO. As anyone
	So, what kind of a standard are we planning to
use for the transliteration itself?


> In a web-based client-server model of app development,
> unlike Latin scripts/languages, it would be more
> appropriate to do input processing, transliteration
> and glyph composition for rendering on the server
	True in the current scenario, but why in the
long run?? We can create engines to do this, something
like say-flash - If you want to view falsh based content,
you need flash to be on your machine (client side) -
similar way - if you need to view Devanagari text -
install our XYZ (preferably opensource) package, and any
website that conforms to these standards will be able to
display Devanagari to you. This will allow -
A central body to easily make modifications to how the software
behaves, instead of individual websites having arbitrary
control.
	A (ServerSide) JScript + (ClientSide)Java based solution might
allow us to come up with a quick fix to do something like this. And at a
subsequent time, we can do away with the quick fix and develop our own
packages to deal with any kind of character rendering or input schemes.
(option buttons, password entry, input boxes etc.)

> side. So, using Javascript may not be that feasible to
> achieve this. At the same time, if all processing is
> on the server side, then the application can't be
> really that interactive (such as showing the display
> to the user for each syllable typed in). Some

		This is exactly what I am hinting at -
It might be difficult to get instant results in the short
term - but in the long term, a client side solution would
help.

> intermediate soluton would be needed. Also, using PHP
> may be a better idea than Javascripts or jsp. What do
> you folks think ?
> A couple of weeks back, I made an enhancement to my
> transliteration library (translib under the
> indic-computing project on sourceforge) to take a
> 'word' in an Indian language encoded as a sequence of
> Unicode characters (in UTF-8 format), kind of map it
> (using a user-defined lookup mapping file) to the most
> appropriate Roman phonetic input and then apply the
> transliteration rules for that language+script by
> looking up the transliteration rule file. The output,
> as before, is a sequence of symbolic glyph names that
> correspond to glyph indices in a given font file. This
> can be fed to a font reading and rendering library
> such as freetype2 for final display (Koshy wrote a
> python script to do this using gozer, but he is now
> replacing gozer with a python wrapper to ft2).

	I have done the transliteration to unicode part
for devanagari - for a tool we use for our research work.
If someone sheds light on the transliteration scheme
being used, I could help.

> I tried out this utf8-to-final-glyph rendering for
> Hindi+Devanagari with very minimal mapping done and
> did some prelim testing and it's ok. All the
> intelligence is in the user-defined mapping files and
> the source code itself has no knowledge of any Indian
> language.
	Yes, I agree....

> Unicode is neither an input mapping scheme nor a glyph
> mapping scheme; it's just an encoding scheme, as all
> of us know. It has limitations, but with a sound
> transliteration library in place, utf-8 can be used
> for
> storing Indian language content for further processing
> (search, sort, display etc etc).
	We are using unicode encoding for such purposes and
I am in the process of developing a query and retrieval
system using the same scheme (for the tool mentioned
earlier)

	However, There could be some misinterpretations
on my part, I think I need to brush up on the
activities in our group. I have been working on Indic
Computing for a short while, but I do not know where
to get technical-info about the various ideas/projects that
everyone is posting here - If there is a web-site(s) from
where I can get more info, it would be gr8. I did not
notice info about the translib project at -
http://indic-computing.sourceforge.net/projects/index.html

	And, is there a pdf verion of the handbook?
http://indic-computing.sourceforge.net/handbook/index.html
This site had some circular links - but I did not find the
pdf version.
Thanks,
Surya
  __       __
 |__      |__|    |/
  __|urya |rakash |\ompalli

Email - kom...@cs...
Phone - Home    - (716) 834 6859
      - Work    - (716) 645 6164 X 519
Personal 	- http://www.cse.buffalo.edu/~kompalli
Academic 	- http://www.cedar.buffalo.edu/ilt

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Krishnamurthy N. <kn...@ya...> - 2003-05-13 08:30:50

Hi Surya,

--- Suryaprakash Kompalli <kom...@ce...>
wrote:
> Hello,
> 
> 
> So, what kind of a standard are we planning to
> use for the transliteration itself?

Well, we need not really get into standards for
transliteration - it can easily be made generic enough

and modifiable by either end-user or other developers.


>True in the current scenario, but why in the
> long run?? We can create engines to do this,
> something
> like say-flash - If you want to view falsh based
> content,
> you need flash to be on your machine (client side) -
> similar way - if you need to view Devanagari text -
> install our XYZ (preferably opensource) package, and
> any
> website that conforms to these standards will be
> able to
> display Devanagari to you. This will allow -
> A central body to easily make modifications to how
> the software
> behaves, instead of individual websites having
> arbitrary
> control.
> 	A (ServerSide) JScript + (ClientSide)Java based
> solution might
> allow us to come up with a quick fix to do something
> like this. And at a
> subsequent time, we can do away with the quick fix
> and develop our own
> packages to deal with any kind of character
> rendering or input schemes.
> (option buttons, password entry, input boxes etc.)

Yes, I agree that we can create engines that can be
downloaded by a client and can be centrally
controlled.
This would also make the apps as interactive as
possible.

> 		This is exactly what I am hinting at -
> It might be difficult to get instant results in the
> short
> term - but in the long term, a client side solution
> would
> help.

Yes.


> I have done the transliteration to unicode part
> for devanagari - for a tool we use for our research
> work.
> If someone sheds light on the transliteration scheme
> being used, I could help.
 
Pls see the reply further down in this mail.


> 	We are using unicode encoding for such purposes and
> I am in the process of developing a query and
> retrieval
> system using the same scheme (for the tool mentioned
> earlier)
> 
> 	However, There could be some misinterpretations
> on my part, I think I need to brush up on the
> activities in our group. I have been working on
> Indic
> Computing for a short while, but I do not know where
> to get technical-info about the various
> ideas/projects that
> everyone is posting here - If there is a web-site(s)
> from
> where I can get more info, it would be gr8. I did
> not
> notice info about the translib project at -
>
http://indic-computing.sourceforge.net/projects/index.html

Since it's still work-in-progress, we haven't put up a
downloadable tarball. You can check out, as anon user,
the entire indic-computing project CVS tree from the 
cvs page of this project
(http://sourceforge.net/cvs/?group_id=38057) or
navigate thru the main page and reach at the same page
from the 'Sources' sub-section. Pls go thru the
documentation (and source) of translib and other
projects and let's know your comments.

> 	And, is there a pdf verion of the handbook?
>
http://indic-computing.sourceforge.net/handbook/index.html

I suggest you download the doc-tool-chain tarball and
run the docbook to get a single document in your
preferred format.

Thanks for your interest!

cheers,
Nagarajan


__________________________________
Do you Yahoo!?
The New Yahoo! Search - Faster. Easier. Bingo.
http://search.yahoo.com

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Tapan S. P. <ta...@ya...> - 2003-05-12 21:02:48

> 
> In a web-based client-server model of app development,
> unlike Latin scripts/languages, it would be more
> appropriate to do input processing, transliteration
> and glyph composition for rendering on the server
> side. 

This is debatable (and has been debated).  Doing glyph composition on
the server will preclude user-installed fonts on the client with
variable glyph sets.  Also, depending on how you define input
processing, it might also preclude different types of keyboard mappings
(and input methods) on the client side.

> A couple of weeks back, I made an enhancement to my
> transliteration library (translib under the
> indic-computing project on sourceforge) to take a
> 'word' in an Indian language encoded as a sequence of
> Unicode characters (in UTF-8 format), kind of map it 
> (using a user-defined lookup mapping file) to the most
> appropriate Roman phonetic input and then apply the
> transliteration rules for that language+script by
> looking up the transliteration rule file. The output,
> as before, is a sequence of symbolic glyph names that
> correspond to glyph indices in a given font file. This
> can be fed to a font reading and rendering library
> such as freetype2 for final display (Koshy wrote a
> python script to do this using gozer, but he is now
> replacing gozer with a python wrapper to ft2).
> 

Part of this seems like a worthy effort, but the need for the
transliteration is not clear to me.  Why is it required, what purpose
does the romanized text serve?  Why not just go from UTF-8 directly to
the glyph names?

Its seems that you are proposing now three standards - Unicode, the
transliteration standard, and the glyph standard (for storage,
transmission and display respectively).  It is unclear to me why the
storage and transmission standards have to be different (in fact they
already are - UCS-2 vs. UTF-8).  It has been hard enough to get 1
standard in place.  Why shoot for two more now, when only one is
required (for glyphs)?

Also, unless the glyph names are somehow "abstract", and do not stand
for direct 1-1 mappings to actual glyph indices, this precludes variable
glyph sets which would be a requirement for different quality (and cost)
fonts and advanced typography.


> 
> Unicode is neither an input mapping scheme nor a glyph
> mapping scheme; it's just an encoding scheme, as all
> of us know. It has limitations, but with a sound
> transliteration library in place, utf-8 can be used
> for 
> storing Indian language content for further processing
> (search, sort, display etc etc).
> 

It seems to me you are proposing an alternative to OTF, which MAY be a
worthy effort if the IPR issues remain muddy w/ that technology.  But
you seem to be doing that at the expense of some of the complexity and
expressivity of the OTF standard.  IMHO that may not be a wise long term
compromise.  Am I missing the point?

Any other views?

-- Tapan

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Krishnamurthy N. <kn...@ya...> - 2003-05-13 06:06:56

Hi Tapan,

--- "Tapan S. Parikh" <ta...@ya...> wrote:

> This is debatable (and has been debated).  Doing
> glyph composition on
> the server will preclude user-installed fonts on the
> client with
> variable glyph sets.  Also, depending on how you
> define input
> processing, it might also preclude different types
> of keyboard mappings
> (and input methods) on the client side.

Yes, it's debatable for more reasons than this. First,
as I pointed out and also by Suryaprakash, it can't
really be very interactive on the client-side. 

About user-installed fonts on client side : as we all
know, it's not just the fonts that are enough to do
rendering for Indian languages - you need to do
process the input key sequence according to certain
rules, and do or don't do translitration and do the
appropriate glyph mapping for 'a given' font, and not
any and every font. We all know the issue of lack of
any standard for Indian scripts. I guess until we have
any standards for Indian script fonts, we can't give
full freedom on the client side to choose any font.

> Part of this seems like a worthy effort, but the
> need for the
> transliteration is not clear to me.  Why is it
> required, what purpose
> does the romanized text serve?  Why not just go from
> UTF-8 directly to the glyph names?

Direct mapping of input sequence, whether in utf-8 or 
Romain phonetic, to an appropriate sequence of glyphs 
is not feasible for any of the Indian languages, due
to the various complexities related to all kinds of
contextual details. As I developed the generic
transliteration rules framework that would work for
any Indian language/script, this had become quite
clear to me. Why utf-8 to romanized text : this was
more convenient to do this and apply the generic
transliteration rules that I developed.

> 
> Its seems that you are proposing now three standards
> - Unicode, the
> transliteration standard, and the glyph standard
> (for storage,
> transmission and display respectively).  It is
> unclear to me why the
> storage and transmission standards have to be
> different (in fact they
> already are - UCS-2 vs. UTF-8).  It has been hard
> enough to get 1
> standard in place.  Why shoot for two more now, when
> only one is
> required (for glyphs)?

No, I am not proposing any new standards. If you have
seen the sample fontmaps that give 'symbolic' names to

glyphs or cluster of glyphs in a font, and the generic
mappings I have given to map the vowels, consonants
and other entities of Indian scripts to phonetic keybd
input sequences in the transliteration rule map files,
I left everything open to the end-user or user of
translib. The mappings can freely modified and as long
as what's 'stored' is based on an accepted standard
such as utf8, that is enough.

If at all, we will need a 'minimalistic' glyph
standard 
for each Indian 'script'.

> 
> Also, unless the glyph names are somehow "abstract",
> and do not stand
> for direct 1-1 mappings to actual glyph indices,
> this precludes variable
> glyph sets which would be a requirement for
> different quality (and cost)
> fonts and advanced typography.

Yes, glyph names are 'abstract', as defined in the
fontmap file format by Koshy. The very precise reason
why we developed such abstract glyph names is for
usability across different fonts.

> 
> It seems to me you are proposing an alternative to
> OTF, which MAY be a
> worthy effort if the IPR issues remain muddy w/ that
> technology.  But
> you seem to be doing that at the expense of some of
> the complexity and
> expressivity of the OTF standard.  IMHO that may not
> be a wise long term
> compromise.  Am I missing the point?

It's not that I am proposing an alternative to OTF.
It's just that I see that with a few special mapping
rules for special 'input sequence combinations', we
can use special glyphs or clusters of glyphs for
effective high quality rendering (e.g in the CDAC
Devanagari font, there are four different glyphs for
dependent vowel sign 'i' and it takes just four simple
rules in the translit. rule file (simple text file) to
effectively specify the context where each of these
four different glyphs need to be used).

cheers,
Nagarajan

__________________________________
Do you Yahoo!?
The New Yahoo! Search - Faster. Easier. Bingo.
http://search.yahoo.com

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Tapan S. P. <ta...@ya...> - 2003-05-05 04:38:22

I certainly think this will be of huge value.  Input methods are on the
client have been a huge hurdle and often forces clients to go to Windoze
XP over other platforms like Linux or even (gasp!) Windoze 98.  I didnt
realize this would be possible using javascript, if so I am wondering
why noone thought of it before.

-- Tapan

On 05 May 2003 10:07:54 +0530
Arun M <ar...@fr...> wrote:

> Hi Friends,
> 
>    We see a lot of community portals coming up these days based on
> CMS like *nuke. But one of these CMS supports Indic. Also we dont see
> any community discussion boards in Indian languages (Pls correct me). 
> 
> Some issues I see are:
> 
>  - There is no indic(Unicode/Non unicode) support in most platforms. 
>  - Most browsers doesnt support unicode. 
> 
> Any community portal we build should be based on font encoding
> systems, at least for some more time. A idea is store
> the data in Unicode and then convert to font encoding at the server
> side.  A special proxy should help here.
> 
> Second issues is entering data from the client side. This is major
> prob. Most of the sites uses Java applets for entering the data in
> local languages. This may require good amount of modification in the
> CMS we have now like *nuke. 
> 
> What about Javascript ?  When user types in a text field using some
> Transliteration scheme or Inscript KeyBoard layout convert it to font
> encoding scheme and show Indic on the fly. Data will be posted to CMS
> based on unicode or font encoding scheme. Theme change should be
> sufficiant get *nukes support Indic(I've not looked into *nuke code in
> detail. Just a guess) 
> 
> Yesterday I made a proto of this. A crude one. It works. 
> 
> Do you think this will be of use ? If so I will work on it and
> make the code better and generic(I dont know Javascript much , will
> have to learn it first). 
> 
> 
> Arun.
> 
> 
>

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Guntupalli K. <kar...@fr...> - 2003-05-05 11:27:37

On Sun, 4 May 2003 21:39:21 -0700
"Tapan S. Parikh" <ta...@ya...> wrote:

> 
> I certainly think this will be of huge value.  Input methods are on
> the client have been a huge hurdle and often forces clients to go to
> Windoze XP over other platforms like Linux or even (gasp!) Windoze
> 98.  I didnt realize this would be possible using javascript, if so
> I am wondering why noone thought of it before.
> 
 Sites like epatra.com have used it before, but they used to send the
text to the server to get the glyph codes (they couldnt have it all in
JS else nyone else would have easily copied it for own sites ).
 There are some convertors from ISCII -> ISFOC , shusha etc, which can
be used. 
 See http://www.iiit.net/ltrc/FC-1.0/fc.html

Regards,
Karunakar

-- 

"Twenty years from now you will be more disappointed by the things
that
you didn't do than by the ones you did do. So throw off the bowlines.
Sail away from the safe harbour. Catch the trade winds in your sails.
Explore. Dream.Discover."  			- Mark Twain

---------------------------
* Indian Linux project    *
* http://www.indlinux.org *
---------------------------

[Indic-computing-devel] Re: javascript indic renderer and community portals

From: <al...@ya...> - 2003-05-05 12:29:29

> "Tapan S. Parikh" <ta...@ya...> wrote:
> > 98.  I didnt realize this would be possible using javascript, if so
> > I am wondering why noone thought of it before.
> > 
>  Sites like epatra.com have used it before, but they used to send the
> text to the server to get the glyph codes (they couldnt have it all in
> JS else nyone else would have easily copied it for own sites ).

Aha. Karunakar hit the nail on the head. No one *wanted* to think of using
Javascript before because they were thinking commercial, and they didn't want
their code to be copy-able. 

=====
Contribute to http://indic-computing.sourceforge.net
+91-80-653-8200
http://groups.yahoo.com/group/linux-bangalore-hindi/
Can't see Hindi? http://geocities.com/alkuma/seehindi.html
http://9211.blogspot.com Set View->Encoding->UTF-8
à¤à¤ªà¤à¥ à¤¸à¥à¤à¤¾à¤µ à¤¶à¥à¤à¥à¤°à¤¤à¤¾ à¤¸à¥ à¤¤à¤à¥ à¤¸à¥à¤µà¥à¤à¤¾à¤° à¤¹à¥à¤à¤à¥ à¤à¥ à¤à¤¬ à¤à¤ª à¤à¤¨à¥à¤¹à¥à¤ à¤²à¤¾à¤à¥ à¤à¤°à¤¨à¥ à¤à¤¾ à¤¬à¥à¤¡à¤¼à¤¾ à¤à¥ à¤¸à¥à¤µà¤¯à¤ à¤à¤ à¤¾à¤¯à¥à¤à¤à¥à¥¤

________________________________________________________________________
Missed your favourite TV serial last night? Try the new, Yahoo! TV.
       visit http://in.tv.yahoo.com

Re: [Indic-computing-devel] Re: javascript indic renderer and community portals

From: Arun M <ar...@gn...> - 2003-05-06 03:19:06

> Aha. Karunakar hit the nail on the head. No one *wanted* to think of using
> Javascript before because they were thinking commercial, and they didn't want
> their code to be copy-able. 


I dont take that reason completely. 

1. The problem we are trying  to address here is some thing different 
   from what Indic sites tries to solves with Java. 
   (Java solution has some advantages in comparison with Javascript
solution)

2. Also may be Javascript wasnt as powerful as it is now, when these 
   Java solutions came up. And then people simply followed the Java.
   (Dont have much idea abt history of web technologies)

Re: [Indic-computing-devel] Re: javascript indic renderer and community portals

From: Dr. U.B. P. <pav...@vi...> - 2003-05-06 04:28:51

> 2. Also may be Javascript wasnt as powerful as it is now, when these 
>    Java solutions came up. And then people simply followed the Java.
>    (Dont have much idea abt history of web technologies) 

As per my knowledge, Javascript can trap and handle ASCII values 
of the trapped keys and not keyboard scancodes (check 
http://javascript.internet.com/forms/block-key-press.html). To 
write a keyboard handler, we need to trap the keyboard 
scnacodes.

-Pavanaja
-----------------------------------------------------
Dr. U.B. Pavanaja
Editor, Vishva Kannada
World's first Internet magazine in Kannada
http://www.vishvakannada.com/

Note: I don't worry about pselling mixtakes

Re: [Indic-computing-devel] javascript indic renderer and community portals

From: Arun M <ar...@fr...> - 2003-05-06 03:08:54

> I certainly think this will be of huge value.  Input methods are on the
> client have been a huge hurdle and often forces clients to go to Windoze

So, I've to add one more item to todo list. 

> XP over other platforms like Linux or even (gasp!) Windoze 98.  I didnt
> realize this would be possible using javascript, if so I am wondering
> why noone thought of it before.

may be because it is too simple ;)

arun.