Emails and Web pages with non-ASCII Latin-1 characters are not filed by the software. This happens because they cause a MySQL error at insertion time, which is then caught by the try-except mechanism, which causes the filing logic to skip the problem record. Non-ASCII Latin-1 characters are invalid in URLs, but the software should be properly escaping and filing them if they are entered. The underlying technical problem is inconsistent use of XML escaping and encoding. For example, artist and author names are handled as follows: "data = XMLunescape(artist.firstChild.data.encode('iso-8859-1'))" while emails and URLs simply use "address = webpage.firstChild.data"
Anonymous
Diff:
Fixed in:
Installed in r2105-136 on 2015-09-15. Closing.