|
From: suzuki t. <mps...@hi...> - 2013-01-03 16:33:05
|
It seems that http://chise.org/chisewiki/view.cgi?character=rep.cbeta:0x94A was taken from Taisho Tripitaka. Please find attached JPEG. I will ask whether SAT experts want to register it to IVS. Regards, mpsuzuki suzuki toshiya wrote: > Mitrophan Chin wrote: >> Thanks. The link to the variant dictionary in Taiwan was helpful. Are >> there any plans to put any or all those variants from that website >> into Unicode Ideographic Variation Database or integrate the variant >> mappings with http://www.unicode.org/charts/unihan.html ? > > Yes. At last IRG#39 (on November, Hanoi), > TCA has submitted 4000 characters to Ext. F that are taken from > the variant dictionary and scheduled to be coded in CNS 11643, > but, their submission was without the printing typeface fonts, and > without IDS. > As a result, all TCA submission was dropped :-( > It was a pity. > > However, even if they are coded in ISO/IEC 10646, I'm suspicious > whether the variant dictionary website will have better interface > to lookup the variants from UCS codepoints. > >> >> In my evidence the character also appears as part of 錫⿰口芉汶, which >> is the name of the Prophet Simeon (Συμεών in Greek/Симеон in Cyrillic) >> who in his elderly age was awaiting to meet the small child Jesus >> being presented at the Temple (cf. Luke 2:25). So based on the >> pronunciation of the name Simeon, I think ⿰口芉 can be unified with >> 哶. So if we want to display 哶 with the glyph rendered from ⿰口芉, >> is that where IVD plays that role and is it supported by major web >> browsers? > > I think using IVS is better than the separated encoding of the > character (because of same pronunciation). Of course, UTC can > propose to encode it to CJK Compatibility Ideographs, but it > would not be welcomed much because compatibility ideograph block is > now recognized as a compatibility with existing legacy encoding > in existing systems. > > You may ask "how to register ⿰口芉 to IVS?". At present, I don't > have good idea on proper process. The original genuine process was > the registration with fee, but recently the registration without > fee is discussed. > > I will try to find ⿰口芉 in Taisho Tripitaka and ask SAT experts > whether they are interested in the registration of ⿰口芉 glyph shape > in IVD, as an alternate form for U+54F6. If they are interested in, > you would not have to do anything :-) > > Regards, > mpsuzuki > > >> >> >> >> >> ________________________________ >> From: suzuki toshiya <mps...@hi...> >> To: Mitrophan Chin <stm...@ya...> Cc: >> "kan...@li..." >> <kan...@li...>; Yuri Shardt >> <yur...@gm...> Sent: Thursday, January 3, 2013 10:18 AM >> Subject: Re: >> http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRGN1860_parseIDS.pdf >> >> Hi, >> >> Sorry for lazy response! >> >> Talking about the glyph shapes between 哶(U+54F6) and CB02378/HZK02-D0A1, >> the info that CHISE provides is only where CHISE picked it from, so it's >> not fruitful to discuss whether they are same or not. >> >> The right component of CB02378/HZK02-D0A1 is looking like as if it were >> 芉(U+8289) (ah, CHISE's IDS does so), a composition of "grass" (upper) + >> sound "gan" (or "kan") (lower). But, the right component of 哶(U+54F6) >> is slightly >> different; it is an alternate form of sheep "羊". In fact, the sound of >> 哶(U+54F6) is "mie" or "me", different from the sound of 芉(U+8289). >> For further information of the background of U+54F6, >> http://dict.variants.moe.edu.tw/yitia/fra/fra03217.htm >> would be helpful. >> >> I don't know how many IRG experts are aware of the semantic & phonetic >> difference between the right component of 芉(U+8289) and 哶(U+54F6), but, >> if you propose the character with only the scanned image evidence, >> somebody will ask whether it could not be unified with U+54F6. >> Thus, it is expected to find the meaning or the pronunciation of it. >> CHISE is not helpful at all for such purpose. Is it possible to identify >> the pronunciation of the character in your evidence? If it is NOT mie/me, >> the pronunciation difference could be a reason to encode it separately, >> if you want to encode it separately. >> >> # I guess, CB02378 might have been picked by CBETA project from some >> # Buddhist sutra or Taisho Tripitaka, so, it could be a poorly typecasted >> # result of U+54F6. Anyway, CHISE does not identify the character in >> *your* evidence. >> >> Regards, >> mpsuzuki >> >> Mitrophan Chin wrote: >>> Suzuki, >>> >>> Do you know if this >>> characterhttp://www.chise.org/chisewiki/view.cgi?char=&CB02378; which >>> also looks like >>> http://www.chise.org/chisewiki/view.cgi?char=&HZK02-D0A1; if they >>> areunified or same kanji with 哶 or will they be separately encoded >>> by IRG or Unicode? If they are not unifiable and have not already >>> been submitted to IRG or UTC already >>> to be encoded, I would like include it for Russian Mission >>> character submission to UTC as well for proper name transliteration >>> appearing on >>> http://orthodox.cn/liturgical/festalmenaion/1884/0202meetinglord/jpg/0202-01.jpg >>> >>> >>> -Mitrophan >>> >>> >>> >>> ________________________________ >>> From: suzuki toshiya <mps...@hi...> >>> To: Mitrophan Chin <stm...@ya...> Cc: >>> "kan...@li..." >>> <kan...@li...> Sent: Monday, December >>> 17, 2012 8:19 AM >>> Subject: Re: >>> http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRGN1860_parseIDS.pdf >>> >>>> Attached is some documented example evidence of the use of the >>>> un-coded characters. >>>> They were specifically introduced by the Russian Orthodox Mission in >>>> China at the >>>> turn of the 19th century to assist in transliterating Slavonic names >>>> into Chinese. >>> Quite interesting! When I saw your IDSes, my impression was "these >>> are something >>> like Vietnamese ChuNom, or, the pronunciation transliterating >>> characters in old >>> Bhuddist texts, because, it's difficult to distinguish the meaning >>> and pronunciation >>> components". It's reasonable to hear that they are for the >>> transliteration of >>> Slavonic names. But, seeing "利爾" and "羅爾" examples, I'm not sure >>> why "爾" was >>> added. "利" already instructs a pronunciation "ri", and "羅" already >>> instructs "ro" >>> (or "ra", in Japan), so why "爾" is needed? >>> >>> I guess most easiest way to add your characters to next CJK Ext. G >>> would be the >>> submission via UTC. So... you will be needed to write some documents >>> to propose >>> the inclusion of your characters to UTC's CJK Ext. G submission. >>> Jenkins already >>> posted the short how-to in the forum. >>> >>> Now CJK Ext. F is just beginning, so, it would not be impossible for >>> Russia to >>> participate IRG and submit something to CJK Ext. G, but I'm afraid >>> that Russian >>> national body is not active participants in face to face meeting of >>> ISO/IEC >>> JTC1/SC2, so making Russian national body submit your characters >>> would not be >>> so easy. >>> >>> Regards, >>> mpsuzuki > > |