Menu

#20 UTF comment support

open
nobody
None
5
2009-10-30
2009-10-30
Anonymous
No

please support non english comments, and use \s insted of \w in CloudeCommitTableRepo file:

line 106:
- final String[] split = comment.split("\\W+");
+ final String[] split = comment.split("\\s+");

because the "W" means only a-zA-Z and _ caracters and it excludes non english characters, but "s" split on whitespaces...

Discussion

  • Anonymous

    Anonymous - 2009-10-30
    • summary: utf comment support --> UTF comment support
     
  • Anonymous

    Anonymous - 2009-10-30

    sorry... the correct meanings:

    \s A whitespace character: [ \t\n\x0B\f\r]
    \w A word character: [a-zA-Z_0-9]
    \W A non-word character: [^\w]

    from: http://java.sun.com/j2se/1.4.2/docs/api/java/util/regex/Pattern.html

     

    Last edit: Anonymous 2013-11-01

Log in to post a comment.