Clark C . Evans [mailto:cce@...] wrote:
> | About the rest (NEL, PS and LS) - there's a lovely little
> | document in
> | http://www.unicode.org/unicode/reports/tr13/ which
> | describes what we (or
> | anyone else) should be doing
> Great. So CR, LF, CRLF, and NEL are all normalized to LF.
> | Outside scalars we throw away the line break characters anyway
> | so there's no isse of what PS/LS should map to.
> | Inside text scalars I suggest that never convert PS/LS into
> | LF or fold them
> | into a space. If someone is using them presumably he has a
> | good reason to,
> | and he's aware that notepad wouldn't handle it well.
> Off hand, I think we should normalize PS/LS just like
> the others, unless, of course, they are escaped.
I think that would be an unnecessary incompatibility with Unicode.
> The problem with the scalar treatment is the edge
> case. (PS = line ended using PS instead of CR/LF/CRLF/NEL)
I didn't understand the problem:
> one: bing PS
The value is "one".
> two: \\PS
> bop PS
> foo PS
The value is "bop<PS>foo". The first and last PS-es aren't part of the
escaped scalar value. If you want "<PS>bop<PS>foo<PS>" you have to write:
Where "eol" is any line break form (PS included).
> three: bar PS
The value is "bar".
I don't see the problem...