[sinhala-technical] Sinhala 'Syllables'
Brought to you by:
aratnaweera,
harshula
From: Harshula <har...@gm...> - 2006-11-28 19:58:38
|
Hi, Background ========== Takahashi and I have been discussing [1] what rules the m17n Wijesekera input method should follow. We would appreciate some feedback on letters that define the start of a new 'syllable', thus allowing the input method to empty a buffer containing the previously completed syllable. We need the help of the linguists on this one. The term 'syllable' is used loosely. [1] http://www.lug.lk/lurker/message/20061031.082256.17a60a64.en.html Sinhala letters which define the start of a new 'syllable' ========================================================== 1) All independent vowels (U+0d85 - U+0d96) 2) Kombuva (U+0dd9) - except if preceded by a kombuva. 3) All consonants (U+0d9a - U+0dc6) - except if preceded by kombuva or kombuva deka (U+0ddb) 4) Kunddaliya (U+0df4) 5) All non-Sinhala characters/codepoints - except ZWJ (U+200D) Comments please ... cya, # On Tue, 2006-11-14 at 03:19 +0530, Harshula wrote: <snip> > If we use the two terms 'terminator' and 'non-terminator' that might > make it easier. My current understanding is that all characters entered > whilst creating a syllable are 'non-terminators' and all characters that > can follow a completed syllable are 'terminators'. In which case > anusvaraya (ang) and visargaya would be non-terminators. However, I > suspect terminators and non-terminators are not two disjoint sets. > There's probably an intersection of the two sets. This will require > tracking the context. > > I had not thought through all the possible cases till you mentioned some > of them. For this particular issue, I'd like to get the input of a few > other people. Will get back to you ASAP. Really sorry this reply took so > long. > > cya, > # |