Re: [Kanji-database-contact] http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRGN1860_parseIDS.pdf

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Mitrophan Chin wrote:
> Thanks. The link to the variant dictionary in Taiwan was helpful. Are there any plans to put any or all those variants from that website into Unicode Ideographic Variation Database or integrate the variant mappings with http://www.unicode.org/charts/unihan.html ?

Yes. At last IRG#39 (on November, Hanoi),
TCA has submitted 4000 characters to Ext. F that are taken from
the variant dictionary and scheduled to be coded in CNS 11643,
but, their submission was without the printing typeface fonts, and without IDS.
As a result, all TCA submission was dropped :-(
It was a pity.

However, even if they are coded in ISO/IEC 10646, I'm suspicious
whether the variant dictionary website will have better interface
to lookup the variants from UCS codepoints.

> 
> In my evidence the character also appears as part of 錫⿰口芉汶, which is the name of the Prophet Simeon (Συμεών in Greek/Симеон in Cyrillic) who in his elderly age was awaiting to meet the small child Jesus being presented at the Temple (cf. Luke 2:25). So based on the pronunciation of the name Simeon, I think ⿰口芉 can be unified with 哶. So if we want to display 哶 with the glyph rendered from ⿰口芉, is that where IVD plays that role and is it supported by major web browsers?

I think using IVS is better than the separated encoding of the
character (because of same pronunciation). Of course, UTC can
propose to encode it to CJK Compatibility Ideographs, but it
would not be welcomed much because compatibility ideograph block is
now recognized as a compatibility with existing legacy encoding
in existing systems.

You may ask "how to register ⿰口芉 to IVS?". At present, I don't
have good idea on proper process. The original genuine process was
the registration with fee, but recently the registration without
fee is discussed.

I will try to find ⿰口芉 in Taisho Tripitaka and ask SAT experts
whether they are interested in the registration of ⿰口芉 glyph shape
in IVD, as an alternate form for U+54F6. If they are interested in,
you would not have to do anything :-)

Regards,
mpsuzuki

> 
> 
> 
> 
> ________________________________
>  From: suzuki toshiya <mps...@hi...>
> To: Mitrophan Chin <stm...@ya...> 
> Cc: "kan...@li..." <kan...@li...>; Yuri Shardt <yur...@gm...> 
> Sent: Thursday, January 3, 2013 10:18 AM
> Subject: Re: http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRGN1860_parseIDS.pdf
>  
> Hi,
> 
> Sorry for lazy response!
> 
> Talking about the glyph shapes between 哶(U+54F6) and CB02378/HZK02-D0A1,
> the info that CHISE provides is only where CHISE picked it from, so it's
> not fruitful to discuss whether they are same or not.
> 
> The right component of CB02378/HZK02-D0A1 is looking like as if it were
> 芉(U+8289) (ah, CHISE's IDS does so), a composition of "grass" (upper) +
> sound "gan" (or "kan") (lower). But, the right component of 哶(U+54F6) is slightly
> different; it is an alternate form of sheep "羊". In fact, the sound of
> 哶(U+54F6) is "mie" or "me", different from the sound of 芉(U+8289).
> For further information of the background of U+54F6,
>     http://dict.variants.moe.edu.tw/yitia/fra/fra03217.htm
> would be helpful.
> 
> I don't know how many IRG experts are aware of the semantic & phonetic
> difference between the right component of 芉(U+8289) and 哶(U+54F6), but,
> if you propose the character with only the scanned image evidence,
> somebody will ask whether it could not be unified with U+54F6.
> Thus, it is expected to find the meaning or the pronunciation of it.
> CHISE is not helpful at all for such purpose. Is it possible to identify
> the pronunciation of the character in your evidence? If it is NOT mie/me,
> the pronunciation difference could be a reason to encode it separately,
> if you want to encode it separately.
> 
> # I guess, CB02378 might have been picked by CBETA project from some
> # Buddhist sutra or Taisho Tripitaka, so, it could be a poorly typecasted
> # result of U+54F6. Anyway, CHISE does not identify the character in *your* evidence.
> 
> Regards,
> mpsuzuki
> 
> Mitrophan Chin wrote:
>> Suzuki,
>>
>> Do you know if this characterhttp://www.chise.org/chisewiki/view.cgi?char=&CB02378; which also looks like http://www.chise.org/chisewiki/view.cgi?char=&HZK02-D0A1; if they areunified or same kanji with 哶 or will they be separately encoded by IRG or Unicode? 
>> If they are not unifiable and have not already been submitted to IRG or UTC already
>>   to be encoded, I would like include it for Russian Mission character submission to UTC as well for proper name transliteration appearing on http://orthodox.cn/liturgical/festalmenaion/1884/0202meetinglord/jpg/0202-01.jpg
>>
>> -Mitrophan
>>
>>
>>
>> ________________________________
>>   From: suzuki toshiya <mps...@hi...>
>> To: Mitrophan Chin <stm...@ya...> Cc: "kan...@li..." <kan...@li...> Sent: Monday, December 17, 2012 8:19 AM
>> Subject: Re: http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRGN1860_parseIDS.pdf
>>   
>>> Attached is some documented example evidence of the use of the un-coded characters.
>>> They were specifically introduced by the Russian Orthodox Mission in China at the
>>> turn of the 19th century to assist in transliterating Slavonic names into Chinese.
>> Quite interesting! When I saw your IDSes, my impression was "these are something
>> like Vietnamese ChuNom, or, the pronunciation transliterating characters in old
>> Bhuddist texts, because, it's difficult to distinguish the meaning and pronunciation
>> components". It's reasonable to hear that they are for the transliteration of
>> Slavonic names. But, seeing "利爾" and "羅爾" examples, I'm not sure why "爾" was
>> added. "利" already instructs a pronunciation "ri", and "羅" already instructs "ro"
>> (or "ra", in Japan), so why "爾" is needed?
>>
>> I guess most easiest way to add your characters to next CJK Ext. G would be the
>> submission via UTC. So... you will be needed to write some documents to propose
>> the inclusion of your characters to UTC's CJK Ext. G submission. Jenkins already
>> posted the short how-to in the forum.
>>
>> Now CJK Ext. F is just beginning, so, it would not be impossible for Russia to
>> participate IRG and submit something to CJK Ext. G, but I'm afraid that Russian
>> national body is not active participants in face to face meeting of ISO/IEC
>> JTC1/SC2, so making Russian national body submit your characters would not be
>> so easy.
>>
>> Regards,
>> mpsuzuki