From: Loren P. <pe...@ne...> - 2000-07-24 08:05:55
|
On Sat, 22 Jul 2000, Aaron Davies wrote: > I've discovered while messing around with the strings that the XML > parser in A1 chokes on certain high-ASCII characters, particularly "*" > (opt-d). As far as I can tell from the W3C's definition of XML, any > character in unicode is legal for XML. That may be some kind of problem with "expat", the XML parser that I use. I had set it to run in single-byte mode, because I want to read off of ASCII files, and Unicode runs in double-byte mode. However, there is a modification of Unicode called UTF-8 that represents standard low-half ASCII with 8 bytes. But I don't really know much about UTF-8 or the canonical translations of MacOS high-half character values in UTF-8. It would most likely be done with 2-byte values. Maybe we can find some Unicode expert somewhere to help us out... Loren Petrich Happiness is a fast Macintosh pe...@ne... And a fast train My home page: http://www.petrich.com/home.html |