-
I'm interested in helping with this. I have several years experience at parsing data from the English Wiktionary. Does DBpedia have an IRC channel?.
2009-11-03 03:31:18 UTC in DBpedia - Wikipedia Data Extraction
-
Language infoboxes have language family fields fam1...fam15.
The order of these fields encodes the heirarchy of the language family tree.
DBpedia extracts the fields as as multiple "fam" elements in a random order with no sequence data.
For example, the article "Middle Welsh" has this:
|fam1=[[Indo-European languages|Indo-European]]
|fam2=[[Celtic languages|Celtic]]
|fam3=[[Insular...
2009-11-03 03:20:02 UTC in DBpedia - Wikipedia Data Extraction
-
Data on the meanings and translations of words in many languages could be extracted from Wiktionary.
Pro: Wiktionary data is more structured than Wikipedia data
Con: Each Wiktionary has its own different format.
2009-10-29 03:12:44 UTC in DBpedia - Wikipedia Data Extraction