Re: [ssax-sxml] More-than-ASCII chars
Brought to you by:
oleg
From: Shiro K. <sh...@la...> - 2003-10-18 07:30:40
|
From: ol...@po... Subject: [ssax-sxml] More-than-ASCII chars Date: Fri, 17 Oct 2003 11:15:01 -0700 (PDT) > Any Java/JVM-based Scheme system should have no problem with implementing > these functions. Ah, I'm afraid not. How about ⠾ ---if the implementation uses utf-16, it has to become two characters, one's code is #xd800 and another's is #xdf02. (Or is there a JVM-based Scheme that does handle surrogate pair as a single character?) > Perhaps some of the readers of this list would like to take upon that SRFI? I put my presentation on ILC2003 about multilingualization (m17n) support on Scheme on web, in which I suggested several operations that would be useful to write a portable Scheme programs across various implementations. http://www.shiro.dreamhost.com/scheme/docs/ILC2003.html More discussion is necessary to nail down what operations are essential, before start thinking about m17n SRFI, though. If anybody has ideas/optinions on it, I'm eager to hear it. > Perhaps on some systems that handle Unicode/UCS2 > char-alphabetic? can do all that. If it can, SSAX can handle XML > documents where _tag_ names themselves (not the character content!) > are written in Greek, in Kana, or Kanji. Indeed, according to the XML > Recommendation, all Kanji are considered letters and may appear > wherever an ASCII letter is allowed. That's nice. Hmm, I need a rather large table to do that, though... --shiro |