#13 FIX AVAILABLE: Cannot parse http://rss.orf.at/futurezone.xml

closed-accepted
Morbus Iff
None
5
2003-12-12
2003-07-04
No

Cannot parse http://rss.orf.at/futurezone.xml -- I
suspect the problem is the encoding iso-8859-15 but not
sure. The feed does validate.

Discussion

  • Morbus Iff
    Morbus Iff
    2003-07-04

    • assigned_to: nobody --> morbus
     
  • Morbus Iff
    Morbus Iff
    2003-07-04

    Logged In: YES
    user_id=69804

    Correct. The next version of AmphetaDesk will include a few more
    encodings (from XML::Parser::Encodings), but -15 won't be one of
    them, since an encoding map has never been made for it. In
    asking around on various lists a month or so ago, it appears we
    can make our own by using some of the tools already provided
    with XML::Parser::Encoding. In particular, this reply from Steve
    Hay (steve.hay@uk.radan.com) seemed most relevant:

    -----
    There's a directory of mappings at
    ftp://ftp.unicode.org/Public/MAPPINGS/ISO8859/
    including iso-8859-15.

    I think you can build the binary encoding file necessary for
    XML::Parser from those files using make_encmap and
    compile_encoding, which come with XML::Encoding.
    -----

     
  • Logged In: YES
    user_id=365576

    Now if I attached the iso-8859-15 mapping might you
    consider adding it to the next distribution? Thanks for the
    pointer to the MAPPINGS, building the encoding file with this
    was pretty simple (in case someone wants to build another
    encoding):

    * Download encoding.txt from
    ftp://ftp.unicode.org/Public/MAPPINGS/
    * Run make_encmap encoding encoding.txt >
    encoding.encmap
    * Add expat="yes" on the document element in
    encoding.encmap
    * Run compile_encoding -o encoding.enc encoding.encmap

     
  •  
    Attachments
  • Morbus Iff
    Morbus Iff
    2003-07-05

    Logged In: YES
    user_id=69804

    I'd be most inclined to add it to the next version, yes. Have you
    (or do you know how) to test the encoding map you've created? If
    not, here's how you would:

    a) close AmphetaDesk.
    b) go to AmphetaDesk/lib/XML.
    c) add a new directory called "Parser".
    d) Within the "Parser" directory,
    add a new directory called "Encodings".
    e) Name your encoding iso-8859-15.enc.
    f) Place it into this directory.
    g) Restart Ampheta, try the feed again.

    Lemme know, eh? If it parses fine, either email me, or attach it to
    the ticket (I'd prefer both, if possible).

     
  • Logged In: YES
    user_id=365576

    The encoding is attached and working fine for ISO-8859-15 feeds.

     
  • Morbus Iff
    Morbus Iff
    2003-12-12

    • status: open --> closed-accepted
     
  • Morbus Iff
    Morbus Iff
    2003-12-12

    Logged In: YES
    user_id=69804

    The following encodings have been added to the AmphetaDesk
    CVS: from XML::Parser::Encodings: big5, euc-kr, iso-8859-2
    through -9, windows-1250 and 1252, x-euc-jp-jisx0221,
    x-euc-jp-unicode, x-sjis-cp932, x-sjis-jdk1117,
    x-sjis-jisx0221, x-sjis-unicode. from Klaus Johannes Rusch:
    iso-8859-15.