From: Kent K. <ken...@te...> - 2010-10-21 19:32:39
|
See UTN 34, http://www.unicode.org/notes/tn34, in particular section 2.5 but regarding SPACE as field separator. The UTN does not give any code, it just describes the principles for handling certain collation issues. In terms of keys for dealing with fields, and considering only two levels (for simplicity in this email), the key for a two-field item would be constructed as follows: field1-level1, field1-level2, 0, field2-level1, field2-level2 This way level 2 of the first field is of more weight than any level of field 2. /Kent K Den 2010-10-21 20:28, skrev "Ken Zook" <ken...@si...>: > Is it possible to use ICU rules to sort Vietnamese as follows, and if so how? > > ca > ca sĩ > cà phê > cả > calo > cam > cam kết > cảm > > We need to use secondary sorting for the diacritics, but it needs to sort > everything up to the first space first, then sort any remaining characters up > to the next space, etc. > > The rules for the vi locale are > [normalization on ] > > &̀<<̉<<̃<<́<<̣ > &a<ă<<<Ă<â<<<Â > &d<đ<<<Đ > &e<ê<<<Ê > &o<ô<<<Ô<ơ<<<Ơ > &u<ư<<<Ư > > Unfortunately, they do not give the above order. > > Thanks for any help you can give, > Ken > > > ------------------------------------------------------------------------------ > Nokia and AT&T present the 2010 Calling All Innovators-North America contest > Create new apps & games for the Nokia N8 for consumers in U.S. and Canada > $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing > Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store > http://p.sf.net/sfu/nokia-dev2dev > > _______________________________________________ > icu-support mailing list - icu...@li... > To Un/Subscribe: https://lists.sourceforge.net/lists/listinfo/icu-support |