From: David H. <df...@gm...> - 2016-01-09 17:02:50
|
- Sorting a set of UTF-8 encoded strings as strings of unsigned bytes yields the same order as sorting the corresponding Unicode strings lexicographically <file:///wiki/Lexicographical_order> by codepoint. On Saturday, 9 January 2016, Karl Kleinpaste <ka...@kl...> wrote: > Unfortunately, that approaches the problem from exactly the wrong > direction. I have UTF-8 strings that need to be sorted as UTF-8 strings > to display in a UTF-8-enabled application, not converted to UTF-16 so that > the (broken) sort comparator is accommodated. There seems to be no UTF-8 > equivalent offered. But I do appreciate the efforts to help find a > solution. > -- Best regards, David F. Haslam |