Thread: Re: [Indic-computing-users] [Indlinux-group] Fw: Announcement - Beejnaagri and BeejU8
Status: Alpha
Brought to you by:
jkoshy
From: Krishna M. G. <gk...@gm...> - 2006-08-13 18:43:05
|
SGVsbG8gYWxsLAoKUGxlYXNlIGJlIGF3YXJlIHRoYXQgdW5pY29kZSBoYXMgYSBjb21wcmVzc2lv biBzY2hlbWUgdGhhdCBzdXBwb3J0cwpkeW5hbWljIHdpbmRvd2luZyBhbW9uZyBvdGhlcnMgdG8g Z2V0IHJpZCBvZiB0aGUgb3ZlcmhlYWQgZHVlIHRvIFVURi04CmVuY29kaW5nLiBQbGVhc2UgcmVm ZXIgdG8gIkEgU3RhbmRhcmQgQ29tcHJlc3Npb24gU2NoZW1lIGZvciBVbmljb2RlIgphdCBodHRw Oi8vd3d3LnVuaWNvZGUub3JnL3JlcG9ydHMvdHI2LwoKSU1ITywgdGhpcyBpcyBzdWZmaWNpZW50 IGZvciBtb3N0IHB1cnBvc2VzLiBCdXQgaWYgb25lIG5lZWRzIGZ1cnRoZXIKY29tcHJlc3Npb24g dGhlIHByb3Bvc2VkIGJlZWpuYWdhcmkgc2NoZW1lIGlzIHByZXR0eSBzaW1wbGUgKHlvdSBjYW4K dGhpbmsgb2YgaXQgYXMgYSBzaW1wbGVyIHZlcnNpb24gb2YgTFpXKS4gSSBzdWdnZXN0IHRoYXQg WldOSiBhbmQgWldKCmJlIG1hZGUgcGFydCBvZiBiZWVqbmFnYXJpIHJhdGhlciB0aGFuIGVzY2Fw aW5nIHRvIHVuaWNvZGUgdG8KcmVwcmVzZW50IHRoZW0uCgpyZWdhcmRzLApLcmlzaG5hIE1vaGFu LgoKT24gOC8xMy8wNiwgR3VudHVwYWxsaSBLYXJ1bmFrYXIgPGthcnVuYWthckBpbmRsaW51eC5v cmc+IHdyb3RlOgo+Cj4KPiBCZWdpbiBmb3J3YXJkZWQgbWVzc2FnZToKPgo+IERhdGU6IFN1biwg MTMgQXVnIDIwMDYgMjA6NDY6MjcgKzA1MzAKPiBGcm9tOiBTYW5kZWVwIEd1cHRhIDxzYW5keWdt YWhhcmFqQGdtYWlsLmNvbT4KPiBUbzogdW5kaXNjbG9zZWQtcmVjaXBpZW50czogOwo+IFN1Ympl Y3Q6IEFubm91bmNlbWVudCAtIEJlZWpuYWFncmkgYW5kIEJlZWpVOAo+Cj4KPiBbUGxlYXNlIGZv cndhcmQgdGhpcyBtZXNzYWdlIHRvIHJlbGV2YW50IG1haWxpbmcgbGlzdHNdCj4KPiBf4KSP4KSV IOCkleCkpuCkriDgpKHgpL/gpJzgpYDgpJ/gpLIg4KS44KWN4KS14KSk4KSC4KSk4KWN4KSw4KSk 4KS+IOCkleClgCDgpJPgpLBfCj4KPiDgpavgpa/gpLXgpYfgpIIg4KS44KWN4KS14KSk4KSC4KSk 4KWN4KSw4KSk4KS+IOCkpuCkv+CkteCkuCDgpJXgpYcg4KS24KWB4KStIOCkheCkteCkuOCksCDg pKrgpLAsIOCkrOClgOCknOCkqOCkvuCkl+CksOClgCDgpLUg4KSs4KWA4KSc4KWC4KWuIOCkleCl gCDgpJjgpYvgpLfgpKPgpL7gpaQKPgo+IOCkrOClgOCknOCkqOCkvuCkl+CksOClgCDgpI/gpJUg 4KSv4KWC4KSo4KWA4KSV4KWL4KShIOCksOClguCkquCkvuCkguCkpOCksOCkoyDgpKvgpYngpLDg pY3gpK7gpYfgpJ8gKFVURikg4KS54KWIIOClpCDgpI/gpJUg4KWn4KWoIOCkrOCkv+CknyDgpJXg pYLgpJ/gpLLgpYfgpJbgpKgg4KSV4KSy4KS+Cj4g4KSc4KWLIOCkuOCljeCkteCksCwg4KSu4KS+ 4KSk4KWN4KSw4KS+IOCktSDgpLXgpY3gpK/gpILgpJzgpKgg4KSV4KS+IOCkheCksuCklyDgpIXg pLLgpJcg4KSV4KWL4KSh4KSoIOCkleClgCDgpJzgpJfgpLkg4KSF4KSV4KWN4KS34KSwIOCkleCl i+CkoeCkqCDgpJXgpLDgpKTgpL4g4KS54KWIIOClpAo+Cj4g4KSs4KWA4KSc4KSo4KS+4KSX4KSw 4KWALCDgpKbgpYfgpLXgpKjgpL7gpJfgpLDgpYAg4KSV4KWLIOCkr+ClguCkn+ClgOCkj+Ckq+Cl riDgpLjgpYcg4KSy4KSX4KSt4KSXIOClq+Clpi3gpazgpaYlIOCkuOCkguCkleCljeCkt+Ckv+Ck quCljeCkpCDgpJXgpLDgpKTgpL4g4KS54KWIIOClpCDgpLjgpK7gpL7gpKgKPiDgpKTgpJXgpKjg pYDgpJUg4KSF4KSo4KWN4KSvIOCkreCkvuCksOCkpOClgOCkryDgpLLgpL/gpKrgpL/gpK/gpYvg pIIg4KSy4KS/4KSq4KS/IOCkquCksCDgpK3gpYAg4KSy4KS+4KSX4KWCIOCkueCliyDgpLjgpJXg pKTgpYAg4KS54KWIIOClpAo+Cj4g4KSs4KWA4KSc4KWC4KWuLCDgpKzgpYDgpJzgpKjgpL7gpJfg pLDgpYAg4KS1IOCkr+ClguCkn+ClgOCkj+Ckq+ClriDgpJXgpYcg4KSs4KWA4KSaIOCkleCkviDg pLDgpYHgpKrgpL7gpKjgpY3gpKTgpLDgpKMg4KSq4KWN4KSw4KSm4KSw4KWN4KS24KS/4KSkIOCk leCksOCkpOCkviDgpLngpYgg4KWkIOCkr+Clhwo+IOCksuCkv+CkqOCkleCljeCkuCDgpJXgpYcg 4KSy4KS/4KSPIOCkqOCkv+CksOCljeCkruCkv+CkpCDgpLngpYgg4KWkCj4KPiDgpJXgpYPgpKrg pY3gpK/gpL4gaHR0cDovL2JlZWpuYWFncmkuc291cmNlZm9yZ2UubmV0LyDgpLjgpYcg4KSh4KS+ 4KSJ4KSo4KSy4KWL4KShIOCkleCksOCkleClhyDgpIfgpKjgpY3gpLngpYfgpIIKPiDgpKrgpLDg pJbgpYfgpIIg4KS1IOCkheCkquCkqOClgCDgpKzgpLngpYHgpK7gpYLgpLLgpY3gpK8g4KSw4KS+ 4KSvIOCkpuClh+CkgiDgpaQKPgo+IOCkuOCkguCkpuClgOCkqiDgpJfgpYHgpKrgpY3gpKTgpL4K Pgo+IF9BIHN0ZXAgdG93YXJkcyBEaWdpdGFsIEluZGVwZW5kZW5jZV8KPgo+IEFubm91bmNpbmcg dGhlIHJlbGVhc2Ugb2YgQmVlam5hYWdyaSBhbmQgQmVlalU4IG9uIHRoZSBvY2Nhc2lvbiBvZgo+ IDU5dGggSW5kZXBlbmRlbmNlIGRheS4KPgo+IEJlZWpuYWFncmkgaXMgYSBVbmljb2RlIFRyYW5z Zm9ybWF0aW9uIEZyb21hdCAoVVRGKSBmb3IgRGV2YW5hZ2FyaS4KPiBJdCBpcyBhIDEyLWJpdCBl bmNvZGluZyB0ZWNobmlxdWUgd2hpY2ggZW5jb2RlcyBha3NoYXJzIHJhdGhlciB0aGFuCj4gc3dh ciwgdnlhbmphbiBhbmQgbWFhdHJhIHNlcGFyYXRlbHkuCj4KPiBCZWVqbmFhZ3JpIGlzIGV4cGVj dGVkIHRvIGNvbXByZXNzIERldmFuYWdyaSB0ZXh0IGJ5IDUwLTYwJSBvdmVyCj4gVVRGLTguIEEg c2ltaWxhciB0ZWNobmlxdWUgY2FuIGJlIGFwcGxpZWQgdG8gb3RoZXIgSW5kaWMgc2NyaXB0cy4K Pgo+IEJlZWpVOCB2MC4xIGRlbW9uc3RyYXRlcyB0aGUgY29udmVyc2lvbiBiZXR3ZWVuIEJlZWpu YWFncmkgYW5kIFVURi04Lgo+IEl0IHJlcXVpcmVzICpuaXggZW52aXJvbm1lbnQuIEl0IGhhcyBi ZWVuIHRlc3RlZCBvbiBVYnVudHUgNi4wNiB3aXRoCj4gdmVyeSBnb29kIHJlc3VsdHMuCj4KPiBQ bGVhc2UgdmlzaXQgaHR0cDovL2JlZWpuYWFncmkuc291cmNlZm9yZ2UubmV0LyAuIEl0IHdvdWxk IGJlIGhpZ2hseQo+IGFwcHJlY2lhdGVkIGlmIHlvdSBjYW4gY29tbWVudCBvbiBCZWVqbmFhZ3Jp IGFuZCB0ZXN0IEJlZWpVOCBvbiB5b3VyCj4gc3lzdGVtcyBhbmQgcHJvdmlkZSB5b3VyIHZhbHVh YmxlIGZlZWRiYWNrLgo+Cj4gU2FuZGVlcCBHdXB0YQo+Cj4KPgo+IC0tCj4KPiAqKioqKioqKioq KioqKioqKioqKioqKioqKioqKioqKioqKioqCj4gKiBXb3JrOiBodHRwOi8vd3d3LmluZGxpbnV4 Lm9yZyAgICAgKgo+ICogQmxvZzogaHR0cDovL2NhcnRvb25zb2Z0LmNvbS9ibG9nICoKPiAqKioq KioqKioqKioqKioqKioqKioqKioqKioqKioqKioqKioqCj4KPiAtLS0tLS0tLS0tLS0tLS0tLS0t LS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tCj4g VXNpbmcgVG9tY2F0IGJ1dCBuZWVkIHRvIGRvIG1vcmU/IE5lZWQgdG8gc3VwcG9ydCB3ZWIgc2Vy dmljZXMsIHNlY3VyaXR5Pwo+IEdldCBzdHVmZiBkb25lIHF1aWNrbHkgd2l0aCBwcmUtaW50ZWdy YXRlZCB0ZWNobm9sb2d5IHRvIG1ha2UgeW91ciBqb2IgZWFzaWVyCj4gRG93bmxvYWQgSUJNIFdl YlNwaGVyZSBBcHBsaWNhdGlvbiBTZXJ2ZXIgdi4xLjAuMSBiYXNlZCBvbiBBcGFjaGUgR2Vyb25p bW8KPiBodHRwOi8vc2VsLmFzLXVzLmZhbGthZy5uZXQvc2VsP2NtZD1sbmsma2lkPTEyMDcwOSZi aWQ9MjYzMDU3JmRhdD0xMjE2NDIKPiBfX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fXwo+IEluZExpbnV4LWdyb3VwIG1haWxpbmcgbGlzdAo+IEluZExpbnV4LWdy b3VwQGxpc3RzLnNvdXJjZWZvcmdlLm5ldAo+IGh0dHBzOi8vbGlzdHMuc291cmNlZm9yZ2UubmV0 L2xpc3RzL2xpc3RpbmZvL2luZGxpbnV4LWdyb3VwCj4K |
From: Sandeep G. <san...@gm...> - 2006-08-14 15:36:08
|
Krishna Mohan Gundu wrote: > Hello all, > > Please be aware that unicode has a compression scheme that supports > dynamic windowing among others to get rid of the overhead due to UTF-8 > encoding. Please refer to "A Standard Compression Scheme for Unicode" > at http://www.unicode.org/reports/tr6/ > > IMHO, this is sufficient for most purposes. But if one needs further > compression the proposed beejnagari scheme is pretty simple (you can > think of it as a simpler version of LZW). I suggest that ZWNJ and ZWJ > be made part of beejnagari rather than escaping to unicode to > represent them. > > regards, > Krishna Mohan. > Thanks Krishna. I had made Beejnaagri not only keeping compression in mind but also a TTS. I believe it would be easy to design a TTS from scratch for a beejnaagri encoded text because it is `natural' and conforms to the phonetic nature of Indic scripts. Please comment. Also, I would include ZWNJ and ZWJ as part of Beejnaagri. Best regards Sandeep |
From: Krishna M. G. <gk...@gm...> - 2006-08-21 10:05:03
|
Hello Sandeep, > I had made Beejnaagri not only keeping compression in mind but also a > TTS. I believe it would be easy to design a TTS from scratch for a > beejnaagri encoded text because it is `natural' and conforms to the > phonetic nature of Indic scripts. > Please comment. I am not familiar with speech synthesis, although me and my friend did play with voice samples, we did not find time to dig deeper. There are different synthesis methods and may be in some of these methods this particular representation helps. Frankly, I don't know. May be Chaitanya Kamisetty will voice his opinion. I will say this much. If Beejnaagri has no significant advantage over standardized unicode text representation schemes, it is better to be used as an internal representation (may be piping unicode text through a beejnaagri module before sending to TTS). regards, Krishna Mohan. |