#555 Tidy not ready for Internationalized Domain Names (IDN)

open-invalid
5
2004-04-02
2004-03-28
Per Ångström
No

Tidy does not allow me to make links using
Internationalized Domain Names (IDN), like this:
<a href="http://www.räksmörgĺs.se/">A link</a>

Tidy converts this to:
<a
href="http://www.r%C3%A4ksm%C3%B6rg%C3%A5s.se/">A link</a>

The result is a link that cannot be loaded by Mozilla
or Opera.

My configuration is:
tidy-mark:no
markup:yes
char-encoding: latin1
enclose-text: yes
wrap: 0
quiet: true
output-xhtml: no
add-xml-decl: no
gnu-emacs: true

If I add "fix-uri:false" to my config, Tidy will output
a warning:
Warning: <a> improperly escaped URI reference

I find this warning inappropriate, since the URI is
valid. Tidy should not issue this warning for the
domain-name URI component if the domain is valid
according to the IDN rules.

I have a strict no-warning policy, so when Tidy issues
a warning it causes my publishing machinery to halt. If
I somehow suppress this particular warning I run the
risk of ignoring cases when other parts of a URI are
truly illegal.

Discussion

  • Per Ångström
    Per Ångström
    2004-03-28

    Test case

     
    Attachments
    • labels: --> HTML/XHTML Parser
    • status: open --> open-invalid
     
    • milestone: --> Current - all platforms
    • assigned_to: nobody --> hoehrmann