Reading the anchor.csv I found that there are a few XML-encoded characters that should be converted to UTF-8 or replaced by other characters.
Here is an example line (from the Italian dump) with encoded characters: "18° Grande Prêmio do Brasil",65385,1,0
Another critical example is the following:
In my opinion should be replaced with a normal space and these two entries should collapse into "Paratore Giuseppe",63203,6,0.
Log in to post a comment.