From: <hp...@we...> - 2022-06-19 11:00:21
|
Hello Leslie! First -- I have to confess, the 'csvcln' stage I showed recently is an example from VM/CMS running on codepage 273, not tested with double-byte characters. The goal was, get rid of all superfluous extras to make CSV tables comparable, whatever provenience. Thus my suggestion of masking word separators might not work with today's UTF-8 or -16 content. Next -- following remarks are neither NetRexx- nor Java-specific. Your succession of change stages makes me shiver ... > change /\n"/ /\n“/ | > change / "/ / “/ | > change /"\n/ /â€\n/ | ... because it uses arguments which /could/ be subject of interpretation by a "funny" editor or other "helpful" tools I don't know. You may consider my suggestion as paranoid to use only hex values as arguments for Change (I didn't so in my csvcln, but that was for a rather stable OS). > loop while execOptions.words > 0 > parse execOptions execOption execOptions > select > when 'help'.abbrev(execOptions.lower,1) then The variable execOption is very close to execOptions what is pretty confusion prone. (BTW, there is a programming language which allows looong variable names but the compiler utilizes the first eight letters only.) I'm not sure, the first When condition, could there be a little 's' too much? Best, M. Am 19.06.2022 um 10:12 schrieb J Leslie Turriff: > Hi, > I've written a little routine to replace plain quotes and apostrophes (' ") with the > [...] |