Menu

#757 Display error using font "Dialog Input"

3.1
closed-works-for-me
nobody
5
2015-08-17
2015-08-01
eraser
No

OmegaT does not display the German Umlaute (öäü) correctly when using font "Dialog Input" (see screenshot attached). If I switch so another font, erverything looks fine.

A also attached the whole project for reproduction.

If it's a geneal issue with Dialog Input, this font should probably be removed from the list.

The strange thing is, that I am using this font for quite some time and never had this trouble.

Using OmegaT 3.1.9_1

1 Attachments

Related

Bugs: #758

Discussion

  • Aaron Madlon-Kay

    Dialog Input is a logical font defined by your JRE's fontconfig files.

    Not all fonts contain all glyphs; it is not reasonable to remove some fonts, especially JRE-defined ones, just become some glyphs aren't present.

    You can adjust your fontconfigs if you like, or simply use a different font.

     
  • Aaron Madlon-Kay

    Since you say you were using this font without problem before, I suggest the issue is with your JRE or the fonts on your system.

     
  • eraser

    eraser - 2015-08-01

    Are you able to reproduce the behavior? The strange thing is, that eg. "ä" is displayed correctly, while "ü" is not. Both are in the same set of chars (ANSI). That's why I am asking myself if it's a bug.

     
  • Aaron Madlon-Kay

    No, I'm not able to reproduce the issue on my system (OS X 10.10.4, Java 1.8, OmegaT 3.5.1 trunk).

     
  • Didier Briel

    Didier Briel - 2015-08-01

    I cannot reproduce it either: Windows 7, Java 1.8, OmegaT 3.5 update 1. I close as "works for me".

    Didier

     
  • Didier Briel

    Didier Briel - 2015-08-01
    • status: open --> closed-works-for-me
     
  • Yu Tang

    Yu Tang - 2015-08-01

    I can reproduce here.
    Windows 8.1 (Japanese) + Oracle Java 1.7.0_51 + OmegaT 3.5u1.

     
    • Jean-Christophe Helary

      Could it be related to the source format ? I can't reproduce with the HTML source here:
      https://de.wikipedia.org/wiki/Der_Spiegel

       

      Last edit: Aaron Madlon-Kay 2015-08-02
      • Aaron Madlon-Kay

        No. The only way it could have anything to do with the source format is if OmegaT was guessing the encoding wrong, but the provided test case is a .docx which requires no guessing.

         
      • Didier Briel

        Didier Briel - 2015-08-03

        Based on Aaron's findings, my understanding is that it doesn't happen with "real" characters with umlauts, such as the ones you have in your source documents or the ones I type on my Azerty keyboard, but with "composed" ones.

         
  • Aaron Madlon-Kay

    I can reproduce on Windows 8.1 (Java 1.8, OmegaT 3.5).

    The Windows JRE defines DialogInput as dialoginput.plain.alphabetic=Courier New so the issue is really with Courier New. Checking other apps, I find that Courier New will correctly show ü (copied from the provided source) in most programs, but in jEdit we see the same issue. Thus it is likely Java-specific.

    Looking at the source document internals, the difference between the ä and the ü is that the former is U+00E4 LATIN SMALL LETTER A WITH DIAERESIS while the latter is <U+0075 LATIN SMALL LETTER U U+0308 COMBINING DIAERESIS>. For some reason the latter is getting split up, but only with this font.

    I have no idea why Java and Courier New are misbehaving on Windows. But the solutions are:

    • Make sure your source text is NFC Normalized, or
    • Use a different font

    There may be a case for having OmegaT normalize the text on parse as well.

     

    Last edit: Aaron Madlon-Kay 2015-08-02
  • Aaron Madlon-Kay

    I created a new ticket about the normalization issue here: [#758]

     

    Related

    Bugs: #758


    Last edit: Aaron Madlon-Kay 2018-02-27
  • Aaron Madlon-Kay

    The "real" bug here (decomposed characters not rendered correctly in Courier New in Java on Windows) cannot be addressed in OmegaT, but the fix for [#758] effectively works around this particular issue.

     

    Related

    Bugs: #758


    Last edit: Aaron Madlon-Kay 2018-02-27

Log in to post a comment.

MongoDB Logo MongoDB