From: <pl...@pi...> - 2011-01-27 16:10:08
|
On 01/27/11 14:46, mw...@gm... wrote: > * Having an optional 3 byte sequence at the beginning of a file complicates > things a lot. I think a script to "fix" damaged utf-8 files is probably the > best solution: > > awk '{if(NR==1)sub(/^\xef\xbb\xbf/,"");print}' text.txt > #http://www.linuxask.com/questions/how-to-remove-bom-from-utf-8 > Hi, thanks for the script, that is what I suggested dong a couple of days ago but I now find I sent from the wrong account so the list apparently dropped it. (Didn't it used to send a warning for that ??) Since it appears that this BOM is a valid uft-8 white space character isn't it conceivable that try to dance around MS non-standard stupidity could mess up interpretation of a valid input file or gnuplot script? regards |