Menu

#644 Accept UTF-16 text files for glossaries

3.1
closed-fixed
Yu Tang
5
2014-10-21
2010-10-12
No

Excel exports encoding seem to respect the data character set only when the format is "Unicode-16 encoded text file".
The result is a tab separated text file in Unicode 16 with the .txt extension. There does not seem to be a way to set the encoding to UTF-8 (at least not on Excel 2008 for Mac).

It would be easier for users if glossaries could be directly exported from Excel by using that format instead of having to later open the file in a text editor to convert that to UTF-8, since the file will then not open correctly in Excel...

Discussion

  • Didier Briel

    Didier Briel - 2010-10-12

    The .txt extension has been assigned to UTF-8 files on user request.

    To be able to accept also UTF-16 with the same extension, we have to do BOM detection.

    Didier

     
  • Didier Briel

    Didier Briel - 2010-10-12
    • milestone: 1269173 -->
     
  • Jean-Christophe Helary

    I tested today on Excel 2008 to reply to Patricia on the user group and it seems Excel has problems exporting when multiple character sets are involved. It would be nice if Excel users had an "out of the box" way to create glossaries, hence the .txt in utf-16 encoding. I understand that it involves BOM detection too since we are using the same extension.

     
  • Yu Tang

    Yu Tang - 2014-09-14

    I implemented in trunk.
    It supports only UTF-16LE. I think it is enough for Microsoft Excel.
    If UTF-16BE should be supported or I missed something, please let me know that.

     
  • Didier Briel

    Didier Briel - 2014-09-14

    Implemented in SVN (/trunk).

    It only supports UTF-16LE.

    Didier

     
  • Didier Briel

    Didier Briel - 2014-09-14
    • status: open --> open-fixed
    • assigned_to: Yu Tang
    • Group: --> 2.6
     
  • Didier Briel

    Didier Briel - 2014-10-11
    • Group: 2.6 --> 3.1
     
  • Didier Briel

    Didier Briel - 2014-10-21

    Implemented in the released version 3.1.7 of OmegaT.

    Didier

     
  • Didier Briel

    Didier Briel - 2014-10-21
    • status: open-fixed --> closed-fixed
     

Log in to post a comment.