From: Ram V. <ra...@jt...> - 2003-03-28 20:42:39
|
Hi The are number of ways to encode a graphically similar characters in Unicode. Normalization is a process defined by The Unicode Consortium for converting the graphically similar characters to a unique equivalent form. For eg: DEVANAGARI QA == DEVANAGARI KA + NUKTA AUGNSTROM == A + COMBINING RING For more information please see http://oss.software.ibm.com/icu/userguide/normalization.html For the technical report on Normalization please see: http://www.unicode.org/unicode/reports/tr15/ AFAIK MLang does not support Normalization. There is an API called FoldString and FoldStringW which performs operations similar to some forms of Normalization, but is not fully conformant to Normalization algorithm as defined by The Unicode Consortium. Please see: http://msdn.microsoft.com/library/default.asp?url=/library/en-us/winui/WinUI /WindowsUserInterface/Resources/Strings/StringReference/StringFunctions/Fold String.asp Regards, Ram Viswanadha ----- Original Message ----- From: "Sundarajan Santhanam" <ssa...@bl...> To: <icu...@os...> Sent: Friday, March 28, 2003 8:16 AM Subject: Normailization in ICU > Hi > > Can someone explain what the Normalization process is in ICU and if other Unicode component( like MLang, Rosette) support this feature? I have a basic understanding of what it means after reading the help pages in ICU but would like more info. > > Thanks > > Sundar > _______________________________________________ > icu...@os... - icu4c-support mailing list > To Un/Subscribe: > http://oss.software.ibm.com/developerworks/oss/mailman/listinfo/icu4c-suppor t > |