CHET-C.js does not encode parentheses and angle brackets (<, >) in Greek text. According to Hugh, "the javascript regular expression implementation does not take Unicode into account when evaluating \w"
Logged In: YES user_id=596146
Fixed by substituting a character group of unicode ranges for \w during the parsing of the replacements file.
any reason this shouldn't be closed?
Log in to post a comment.
Logged In: YES
user_id=596146
Fixed by substituting a character group of unicode ranges
for \w during the parsing of the replacements file.
any reason this shouldn't be closed?