On Saturday, Oct 11, 2003, at 23:41 Australia/Sydney,
Bernard
Desgraupes wrote:
> While I was working with the AppleScript mode I
noticed the following
> (certainly known but) annoying issue with the [glob]
command. In order
> to
> test filenames with accented letters I have a
> file named "יטאשמ".
> If I run [glob] over the directory containing this
file, it comes out
> as
> e?e?a?u?i? with AlphaX (just the same with Wish, only
little squares
> instead of question marks).
> It looks like [glob] is not yet correctly
> implemented on OSX (?):
there is nothing really incorrect about the result you
get, it's just
in decomposed unicode, but the glob matches
sucessfully. The decomposed
name is strictly equivalent to the composed one...
> I've heard about some problem with the
> Finder returning decomposed Unicode names but I'd
just like to know if
> there is currently a workaround so that I can handle
these files.
what do you need to do with these files?
tcl should handle decomposed filenames fine, the only
thing that
doesn't work is [string compare] type operations that
are not
decomposed-unicode aware
until somebody (me?) implements unicode
composition/decomposition in
tcl, there is not much more you can do unfortunately.
We could
potentially add a tcl interface to my unicode
composition/decomposition
routines in osxMacTcl.c for use by AlphaTcl in the
meantime?
Logged In: YES
user_id=90580
also see duplicate bug 956433:
http://sourceforge.net/tracker/?
func=detail&aid=956433&group_id=10894&atid=110894
my last comment there is still current as far as my efforts go:
> My effort to implement this has currently
> stalled, somebody else is most welcome to have a go; otherwise
> I'll get back to it at some point, it's on the todo list...
> It might be worthwile for somebody to devise a general solution
> to how tcl deals with unicode normlizatoin, including all
> normalization forms, not just NFC and NFD like we need in this
> context.