#1393 Unicode characters sometimes displayed as boxes

Scintilla (796)
Dan Weiss

Some Unicode characters are not displayed properly, unless they are followed by another character of a similar script.
For example, Unicode character 25C6 (◆ black diamond) turns into a box unless it is followed by a Japanese character.
So text like "◆Text" instead displays the diamond as a box, but "◆あ" displays the diamond correctly.
Another example: "ↂℵ✁Text text" displays boxes instead of the three characters, but if you put a Japanese character at the end, ("ↂℵ✁Text textあ") it suddenly displays correctly.
I am on Windows XP, system locale is set to Japanese, and I'm using the Courier New font.
The problem seems to depend on what font is selected. If I select a font like Arial, Times New Roman, Comic Sans MS, Courier New, etc, I get boxes. But if I select a font like Microsoft Sans Serif, Meiryo UI, Lucida Sans Unicode, or DejaVu Sans, there are no problems.


  • Neil Hodgson

    Neil Hodgson - 2012-07-27

    Windows substitutes fonts when it finds a character that can not be displayed in the current font but this feature does not always work. It can depend on the fonts being used as well as the character set. You could try using SCI_STYLESETCHARACTERSET to change the character set.

    Here is a similar issue for Korean:

  • Neil Hodgson

    Neil Hodgson - 2012-07-27
    • assigned_to: nobody --> nyamatongwe
    • priority: 5 --> 3
    • status: open --> open-accepted

Log in to post a comment.

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:

JavaScript is required for this form.

No, thanks