|
From: Shigemichi Y. <ya...@gl...> - 2001-07-05 19:26:24
|
At Thu, 5 Jul 2001 09:50:59 -0500 ,
Johnson, Clay <cla...@vi...> wrote:
> Technically, ISO-8859-1 does not assign code points 0x80-9f. I suppose an
> eager implementation may refuse to map these code points, although I would
> expect it to substitute 0x3f ('?') rather than return nothing.
I don't have a copy of ISO standard, but Roman Czyborra states
"... the positions 128 to 159 hold some less used control characters:
the so-called C1 set from ISO 6429."
(http://czyborra.com/charsets/iso8859.html)
Also, Unicode mapping table
(http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-1.TXT) assigns
0x80-0x9f to the corresponding Unicode control characters. So my
understanding is that there is no unassigned code point in ISO-8859-1
code space.
I agree with you that if the conversion fails Java returns REPLACEMENT
CHARACTER rather than a null string.
-------------------
Shigemichi Yazawa
ya...@gl...
|