#962 perl syntax highlighter breaks with umlaut


The perl syntax highlighter doesn't recognize the end of string, when there is an Umlaut inside the string. The python, ruby and lua highlighters don't have a problem with this. The SciTE 3.3.2 also does work. (Geany 1.24 git >= 72c809f compiled May 21 2013; Windows 7/64)
See attached file geany.pl

1 Attachments


Bugs: #995


  • Lex Trotman

    Lex Trotman - 2013-05-31
    • status: open --> open-accepted
  • Lex Trotman

    Lex Trotman - 2013-05-31

    confirmed with latest git and that it works in Scite

  • Heiko

    Heiko - 2013-06-12

    I want to add:
    1. geany 1.24 (git >= c1b511d) compiled 2013-06-12 -> broken
    2. SciTE 3.3.3 -> OK
    3. I double checked that in all cases the file encoding was properly set to CP1252 and NOT UTF-8 (geany: document/encoding/western/WINDOWS 1252; SciTE: file/encoding/Code Page Property)

  • Colomban Wendling

    @Heiko Geany always displays text in UTF-8. If the file is anything else, it gets converted upon loading and saving. On the other hand, SciTE loads the file without any conversion, so keeps the encoding in memory.

    Anyway, this is a Scintilla bug. It indeed is a byte vs. character issue, so only applies when there is a multi-byte character involved, but that's all. Try saving your file as UTF-8 and then load it in SciTE (as UTF-8, of course): you'll see both break exactly the same.

  • Heiko

    Heiko - 2013-06-12

    I understand. So I made a copy of geany.pl encoded in UTF-8 with UTF-8-BOM (see attachment).
    Now also SciTE 3.3.3 breaks the syntax highlighting.
    Also: geany 1.24 (git >= ea87576) compiled 2013-04-24 shows the syntax just fine with both versions of the file (UTF-8 and CP1252) - but I bet you already knew that.

  • Colomban Wendling

    • status: open-accepted --> closed-fixed
    • Group: v1.23.1 --> v1.24

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:

No, thanks