Menu

Special characters not converted correctly

Nick Bart
2012-12-14
2012-12-15
  • Nick Bart

    Nick Bart - 2012-12-14

    Some special characters do not seem to be converted correctly.

    biblatex file (utf-8):

        Title = {Testing em-dash —, en-dash –, d/s curly quotes “” ‘’, ellipsis …, d/s guillemets «» ‹›, and a few accented chars áàâäåæ},
    

    biblatex2xml output:

    <titleInfo>
        <title>Testing em-dash —, en-dash –, d/s curly quotes “” ‘’, ellipsis 
, d/s guillemets «» ‹›, and a few accented chars áàâÀåÊ</title>
    </titleInfo>
    
     
  • Chris Putnam

    Chris Putnam - 2012-12-14

    The current default input character set is iso8859-1. If your input file is utf8-encoded unicode, then you need to use the "-i unicode" command line switch.

    The default output for MODS is utf8-encoded unicode, and the default output for the xml2XXX converters is iso8859-1.

    The defaults will switch to all utf8-encoded unicode pretty soon, but not in the 4.16 timeframe. When that happens I will be bumping the major version number to 5 to reflect the change in the user interface.

     
  • Nick Bart

    Nick Bart - 2012-12-15

    Thank you. My mistake assuming everything is in utf8 today.

     

Log in to post a comment.