#11 XML-encoded characters conversion needed

open
nobody
None
5
2009-09-14
2009-09-14
Giulio Paci
No

Reading the anchor.csv I found that there are a few XML-encoded characters that should be converted to UTF-8 or replaced by other characters.
Here is an example line (from the Italian dump) with encoded characters: "18° Grande Prêmio do Brasil",65385,1,0

Another critical example is the following:
"Paratore Giuseppe",63203,4,0
"Paratore Giuseppe",63203,2,0

In my opinion   should be replaced with a normal space and these two entries should collapse into "Paratore Giuseppe",63203,6,0.

Discussion


Log in to post a comment.

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:

JavaScript is required for this form.





No, thanks