From: Colin P. A. <co...@co...> - 2007-02-05 09:32:37
|
>>>>> "Eric" == Eric Bezault <er...@go...> writes: Eric> UTF-8 byte representation? Even if it's a multibyte, you can Eric> replace [\i:] by (multibyte1|...|multibyten|:) and likewise Eric> for similar regexp constructs. While that is possible, I think the resultant string for \i, \c and similar properties will be of the order of 20KB. I don't know how the regular expression engine works, but if it needs to compare a space character (for instance) with each of 20000 characters in order to reject a test, I think it will be far too inefficient. -- Colin Adams Preston Lancashire |