From: Eric B. <er...@go...> - 2007-11-11 16:50:55
|
Colin Paul Adams wrote: > The re @.@ matches two @ characters separated by a single character. > > If that character is a non-ascii Unicode character, then the trick > Franck suggested of replacing both the data and the re with their > utf-8 byte sequences doesn't work, as the . will match only one byte. You'll have to replace '.' by it's uft-8 equivalent. Something like that: ([\0-\9]|[\11-\x7F]|([\x80-\xFF][\0-\x7F])|......) I guess you see what I mean. -- Eric Bezault mailto:er...@go... http://www.gobosoft.com |