Work at SourceForge, help us to make it a better place! We have an immediate need for a Support Technician in our San Francisco or Denver office.

Close

#3813 Word definition not honoured in word movements

normal bug
open
nobody
5
2013-07-21
2013-07-21
Edward Diener
No

According to the documentation a word is "a sequence of alphanumerics" along with whatever is specified as "Global Options - Editing - Extra Word Characters". Yet when using the "Go to Next Word" and "Go to Previous Word" functionality a word is always treated in a hard-coded manner as a set of non-space characters. Similarly the "Select Next Word" and "Select Previous Word" always treats a word in the same manner as a set of non-space characters.

Discussion

  • Mike Maxwell
    Mike Maxwell
    2013-07-22

    I believe "alphanumerics" needs to be defined (at least) on a character encoding specific basis. In Unicode, for example, it should probably default to the regular expression
    [\p{Letter}|\p{Mark}|\p{Number}]
    I'm not sure whether ZWJ and ZWNJ would count as word-forming

    The definition in some of the ISO 8859-N encodings would differ as well, and presumably Big5 etc. etc.

     
  • Word chars are defined differently in different languages, in most of them it is alphanumeric, but it can also include _ and other chars. Did you encounter a problem ?