From: Aleksey C. <vl...@gm...> - 2003-12-15 13:50:11
|
Michael Bunk <bu...@im...> writes: > Why we can't use "C" locale, ie. byte order? The .index file for utf-8 dictionaries is sorted by "sort" using C locale, i.e. byte order. > I think it would be > much easier. Do we need some utf8 locale to correctly identify & > remove punctuation characters from queries & indices? Yes. This is what iswspace and iswalnum functions are needed for. -- Best regards, Aleksey Cheusov. |