Re: [Kanji-database-contact] http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRGN1860_parseIDS.pdf

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

I got a response from Professor Kiyonori Nagasaki, the delegate
from SAT to IRG; SAT is going to register a glyph shaped like
⿰口芉 as a variant of U+54F6, although concrete schedule of
the registration is not discussed yet.

Regards,
mpsuzuki

suzuki toshiya wrote:
> It seems that
> http://chise.org/chisewiki/view.cgi?character=rep.cbeta:0x94A
> was taken from Taisho Tripitaka. Please find attached JPEG.
> I will ask whether SAT experts want to register it to IVS.
> 
> Regards,
> mpsuzuki
> 
> suzuki toshiya wrote:
>> Mitrophan Chin wrote:
>>> Thanks. The link to the variant dictionary in Taiwan was helpful. Are 
>>> there any plans to put any or all those variants from that website 
>>> into Unicode Ideographic Variation Database or integrate the variant 
>>> mappings with http://www.unicode.org/charts/unihan.html ?
>>
>> Yes. At last IRG#39 (on November, Hanoi),
>> TCA has submitted 4000 characters to Ext. F that are taken from
>> the variant dictionary and scheduled to be coded in CNS 11643,
>> but, their submission was without the printing typeface fonts, and 
>> without IDS.
>> As a result, all TCA submission was dropped :-(
>> It was a pity.
>>
>> However, even if they are coded in ISO/IEC 10646, I'm suspicious
>> whether the variant dictionary website will have better interface
>> to lookup the variants from UCS codepoints.
>>
>>>
>>> In my evidence the character also appears as part of 錫⿰口芉汶, 
>>> which is the name of the Prophet Simeon (Συμεών in Greek/Симеон in 
>>> Cyrillic) who in his elderly age was awaiting to meet the small child 
>>> Jesus being presented at the Temple (cf. Luke 2:25). So based on the 
>>> pronunciation of the name Simeon, I think ⿰口芉 can be unified with 
>>> 哶. So if we want to display 哶 with the glyph rendered from ⿰口芉, 
>>> is that where IVD plays that role and is it supported by major web 
>>> browsers?
>>
>> I think using IVS is better than the separated encoding of the
>> character (because of same pronunciation). Of course, UTC can
>> propose to encode it to CJK Compatibility Ideographs, but it
>> would not be welcomed much because compatibility ideograph block is
>> now recognized as a compatibility with existing legacy encoding
>> in existing systems.
>>
>> You may ask "how to register ⿰口芉 to IVS?". At present, I don't
>> have good idea on proper process. The original genuine process was
>> the registration with fee, but recently the registration without
>> fee is discussed.
>>
>> I will try to find ⿰口芉 in Taisho Tripitaka and ask SAT experts
>> whether they are interested in the registration of ⿰口芉 glyph shape
>> in IVD, as an alternate form for U+54F6. If they are interested in,
>> you would not have to do anything :-)
>>
>> Regards,
>> mpsuzuki
>>
>>
>>>
>>>
>>>
>>>
>>> ________________________________
>>>  From: suzuki toshiya <mps...@hi...>
>>> To: Mitrophan Chin <stm...@ya...> Cc: 
>>> "kan...@li..." 
>>> <kan...@li...>; Yuri Shardt 
>>> <yur...@gm...> Sent: Thursday, January 3, 2013 10:18 AM
>>> Subject: Re: 
>>> http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRGN1860_parseIDS.pdf
>>>  
>>> Hi,
>>>
>>> Sorry for lazy response!
>>>
>>> Talking about the glyph shapes between 哶(U+54F6) and 
>>> CB02378/HZK02-D0A1,
>>> the info that CHISE provides is only where CHISE picked it from, so it's
>>> not fruitful to discuss whether they are same or not.
>>>
>>> The right component of CB02378/HZK02-D0A1 is looking like as if it were
>>> 芉(U+8289) (ah, CHISE's IDS does so), a composition of "grass" (upper) +
>>> sound "gan" (or "kan") (lower). But, the right component of 哶(U+ 
>>> 54F6) is slightly
>>> different; it is an alternate form of sheep "羊". In fact, the sound of
>>> 哶(U+54F6) is "mie" or "me", different from the sound of 芉(U+8289).
>>> For further information of the background of U+54F6,
>>>     http://dict.variants.moe.edu.tw/yitia/fra/fra03217.htm
>>> would be helpful.
>>>
>>> I don't know how many IRG experts are aware of the semantic & phonetic
>>> difference between the right component of 芉(U+8289) and 哶(U+54F6), 
>>> but,
>>> if you propose the character with only the scanned image evidence,
>>> somebody will ask whether it could not be unified with U+54F6.
>>> Thus, it is expected to find the meaning or the pronunciation of it.
>>> CHISE is not helpful at all for such purpose. Is it possible to identify
>>> the pronunciation of the character in your evidence? If it is NOT 
>>> mie/me,
>>> the pronunciation difference could be a reason to encode it separately,
>>> if you want to encode it separately.
>>>
>>> # I guess, CB02378 might have been picked by CBETA project from some
>>> # Buddhist sutra or Taisho Tripitaka, so, it could be a poorly 
>>> typecasted
>>> # result of U+54F6. Anyway, CHISE does not identify the character in 
>>> *your* evidence.
>>>
>>> Regards,
>>> mpsuzuki
>>>
>>> Mitrophan Chin wrote:
>>>> Suzuki,
>>>>
>>>> Do you know if this 
>>>> characterhttp://www.chise.org/chisewiki/view.cgi?char=&CB02378; 
>>>> which also looks like 
>>>> http://www.chise.org/chisewiki/view.cgi?char=&HZK02-D0A1; if they 
>>>> areunified or same kanji with 哶 or will they be separately encoded 
>>>> by IRG or Unicode? If they are not unifiable and have not already 
>>>> been submitted to IRG or UTC already
>>>>   to be encoded, I would like include it for Russian Mission 
>>>> character submission to UTC as well for proper name transliteration 
>>>> appearing on 
>>>> http://orthodox.cn/liturgical/festalmenaion/1884/0202meetinglord/jpg/0202-01.jpg 
>>>>
>>>>
>>>> -Mitrophan
>>>>
>>>>
>>>>
>>>> ________________________________
>>>>   From: suzuki toshiya <mps...@hi...>
>>>> To: Mitrophan Chin <stm...@ya...> Cc: 
>>>> "kan...@li..." 
>>>> <kan...@li...> Sent: Monday, 
>>>> December 17, 2012 8:19 AM
>>>> Subject: Re: 
>>>> http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRGN1860_parseIDS.pdf
>>>>  
>>>>> Attached is some documented example evidence of the use of the 
>>>>> un-coded characters.
>>>>> They were specifically introduced by the Russian Orthodox Mission 
>>>>> in China at the
>>>>> turn of the 19th century to assist in transliterating Slavonic 
>>>>> names into Chinese.
>>>> Quite interesting! When I saw your IDSes, my impression was "these 
>>>> are something
>>>> like Vietnamese ChuNom, or, the pronunciation transliterating 
>>>> characters in old
>>>> Bhuddist texts, because, it's difficult to distinguish the meaning 
>>>> and pronunciation
>>>> components". It's reasonable to hear that they are for the 
>>>> transliteration of
>>>> Slavonic names. But, seeing "利爾" and "羅爾" examples, I'm not sure 
>>>> why "爾" was
>>>> added. "利" already instructs a pronunciation "ri", and "羅" already 
>>>> instructs "ro"
>>>> (or "ra", in Japan), so why "爾" is needed?
>>>>
>>>> I guess most easiest way to add your characters to next CJK Ext. G 
>>>> would be the
>>>> submission via UTC. So... you will be needed to write some documents 
>>>> to propose
>>>> the inclusion of your characters to UTC's CJK Ext. G submission. 
>>>> Jenkins already
>>>> posted the short how-to in the forum.
>>>>
>>>> Now CJK Ext. F is just beginning, so, it would not be impossible for 
>>>> Russia to
>>>> participate IRG and submit something to CJK Ext. G, but I'm afraid 
>>>> that Russian
>>>> national body is not active participants in face to face meeting of 
>>>> ISO/IEC
>>>> JTC1/SC2, so making Russian national body submit your characters 
>>>> would not be
>>>> so easy.
>>>>
>>>> Regards,
>>>> mpsuzuki
>>
>>
> 
>