Thread: Re: [Mingw-users] FW: Re: Unicode file names

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

> Well, yes, ..., but software that uses the 8-bit calls is broken anyway in the sense that it stops working the moment the codepage changes.

I don't know what you mean with "the moment the codepage changes". You
make the situation sound worse than it is. The system codepage of a
machine does not change without overwriting the Windows installation
with a different language edition of Windows, as far as I know.

Code that is written to use the "normal" C library (plain "char") APIs
(and A-suffixed versions of Win32 APIs, that is without any suffix at
all assuming UNICODE is not defined), for instance code ported
straight from Unix, does work fine in most cases on various language
editions of Windows with different system codepages, and is able to
handle non-ASCII file names in the system codepages in question.

You don't need to write such code to work just in one particular
system codepage. (In fact, it would be hard to intentionally do it.)

"Narrow char" code will usually, to the best of my knowledge, work
fine on a Western Windows installation, a Greek one, an Arabic one, or
a Hebrew one etc without recompilation and will handle files with
names in those codepages (which all do include plain ASCII in the
7-bit half).

(Then, on systems with East Asian double-byte system codepages, such
"plain" code will also work mostly fine, except that doing things like
strchr(filename, '\\') to find directory separators will break as some
double-byte characters have '\\' as the second byte. Ditto for '/'. To
properly handle strings encoded in also double-byte system codepages,
one should use the multi-byte string functions like _mbschr().)

It is just the case where a system has files with names containing
characters not in the system codepage that absolutely *requires* using
Unicode APIs, wide character strings and wide character APIs, to
handle such files.

As such, I have no idea how common or rare such situations are, but
they might be quite common in some parts of the world, or in
institutions that regularly handle files from different parts of the
world. In my personal opinion, it is important to be prepared for such
situations. That is why I tend to bring up the issue of being Unicode
aware.

--tml

Thread: Re: [Mingw-users] FW: Re: Unicode file names

A native Windows port of the GNU Compiler Collection (GCC)

mingw-users