From: <no...@so...> - 2002-03-27 13:19:14
|
Feature Requests item #535705, was opened at 2002-03-27 05:19 You can respond by visiting: http://sourceforge.net/tracker/?func=detail&atid=360894&aid=535705&group_id=10894 Category: 15. Commands A-H Group: None Status: Open Resolution: None Priority: 5 Submitted By: Nobody/Anonymous (nobody) Assigned to: Donal K. Fellows (dkf) Summary: Tcl interface to stop on encoding errors Initial Comment: When [encoding convertto] encounters a character that cannot be represented in the target encoding, it silently substitutes a default instead. In the C interface there is a flag TCL_ENCODING_STOPONERROR which instead causes the underlying function to return with an error when this condition occurs. It would be useful to have a switch for [encoding convertto] to get the same behaviour there, especially for cross-platform installation programs (where encoding conversions are common and unnoticed character substitutions can be very dangerous). PS: While I'm at [encoding] anyway, I might as well point out that the man page for the [encoding] command (of Tcl8.4 as currently on tcl.activestate.com) in the Example section incorrectly claims that "because the source command always reads files using the ISO8859-1 encoding, Tcl will treat each byte in the file as a separate character that maps to the 00 page in Unicode". I believe the truth is rather that [source] uses the system encoding. The example should work, since \xHH produces character in said Unicode page, but including the corresponding bytes as explicit characters in the source probably won't work cross-platform. ---------------------------------------------------------------------- You can respond by visiting: http://sourceforge.net/tracker/?func=detail&atid=360894&aid=535705&group_id=10894 |
From: <no...@so...> - 2002-04-05 13:23:53
|
Feature Requests item #535705, was opened at 2002-03-27 13:19 You can respond by visiting: http://sourceforge.net/tracker/?func=detail&atid=360894&aid=535705&group_id=10894 >Category: 42. UTF-8 Strings Group: None Status: Open Resolution: None Priority: 5 Submitted By: Nobody/Anonymous (nobody) >Assigned to: Jeffrey Hobbs (hobbs) Summary: Tcl interface to stop on encoding errors Initial Comment: When [encoding convertto] encounters a character that cannot be represented in the target encoding, it silently substitutes a default instead. In the C interface there is a flag TCL_ENCODING_STOPONERROR which instead causes the underlying function to return with an error when this condition occurs. It would be useful to have a switch for [encoding convertto] to get the same behaviour there, especially for cross-platform installation programs (where encoding conversions are common and unnoticed character substitutions can be very dangerous). PS: While I'm at [encoding] anyway, I might as well point out that the man page for the [encoding] command (of Tcl8.4 as currently on tcl.activestate.com) in the Example section incorrectly claims that "because the source command always reads files using the ISO8859-1 encoding, Tcl will treat each byte in the file as a separate character that maps to the 00 page in Unicode". I believe the truth is rather that [source] uses the system encoding. The example should work, since \xHH produces character in said Unicode page, but including the corresponding bytes as explicit characters in the source probably won't work cross-platform. ---------------------------------------------------------------------- You can respond by visiting: http://sourceforge.net/tracker/?func=detail&atid=360894&aid=535705&group_id=10894 |