From: David O. <da...@qc...> - 2012-10-31 09:22:57
|
On 30 October 2012 20:39, Stephen Deasey <sd...@gm...> wrote: > On Tue, Oct 30, 2012 at 7:59 PM, Stephen Deasey <sd...@gm...> wrote: > > > > But the code points of iso88591 are a subset of utf8... > > Actually, this doesn't make sense. The byte encoding of code points > above 128 uses two bytes for utf8, but only one byte for iso88591. > Yes this is exactly it Stephen. We're communicating with an external system which uses iso8859-1. So for the extended character set the results are different. The external system gets confused if it tries to decode something we encoded using utf-8 eg: nscp 52> ns_urlencode -charset iso8859-1 ú %fa nscp 53> ns_urlencode -charset utf-8 ú %c3%ba nscp 54> ns_urldecode -charset iso8859-1 %c3%ba ú Thanks very much Gustaf for making this change! |