It's this one:

Convert all non-ASCII and Tcl-significant characters into \u escape sequences by using regsub and subst in combination:

# This RE is just a character class for everything "bad"
set RE {[][{};#\\\$\s\u0080-\uffff]}

# We will substitute with a fragment of Tcl script in brackets
set substitution {[format \\\\u%04x [scan "\\&" %c]]}

# Now we apply the substitution to get a subst-string that
# will perform the computational parts of the conversion.
set quoted [subst [regsub -all $RE $string $substitution]]

===========================================
The RE for "bad" characters includes \s which matches all whitespace characters including newline. However, inserting \ before newline before calling subst on a string containing it does not preserve the newline, it causes it to be replaced by a space, so the whole procedure replaces newlines with \u0020

Not sure how to get this to work properly! Little help?

Discussion

Donal K. Fellows - 2010-09-10

Good catch. Newlines need to be handled specially. :-(

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Donal K. Fellows - 2010-09-10

assigned_to: pvgoran --> dkf
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Donal K. Fellows - 2010-09-10

Updated example text is as below.

# This RE is just a character class for almost everything "bad"
set RE {[][{};#\\\$ \r\t\u0080‐\uffff]}

# We will substitute with a fragment of Tcl script in brackets
set substitution {[format \\\\u%04x [scan "\\&" %c]]}

# Now we apply the substitution to get a subst‐string that
# will perform the computational parts of the conversion. Note
# that newline is handled specially through string map since
# backslash‐newline is a special sequence.
set quoted [subst [string map {\n {\\u000a}} \ [regsub -all $RE $string $substitution]]]

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Donal K. Fellows - 2010-09-10

Fixed in HEAD and 8.5 branch (alas, just missed the train for 8.5.9...)

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Donal K. Fellows - 2010-09-10

status: open --> closed-fixed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

regsub example doesn't perform as advertized

The Tool Command Language implementation

Group

Searches

Help

#4711 regsub example doesn't perform as advertized

It's this one:

Discussion