you can download the dump from: http://dumps.wikimedia.org/enwiki/20131001/enwiki-20131001-pages-articles.xml.bz2
and CSVs from: https://drive.google.com/file/d/0BzypdTlf-gkDNWVhTl8wNm00MWM/edit?usp=sharing
I have not done much experimenting with this, but I have noticed a problem with the getTranslations() function. In some cases instead of getting "a TreeMap associating language code with translated title for all available translations" I get a single link to the corresponding article in Wiktionary.
I asked you to do all the tests, the use of several machines in testing, how much memory each machine, thank you very much
single machine: OptiPlex 780
16 GB RAM
took about 3 days
Hello, I want to consulting you, you install in the Java API process, running the demo class WikipediaDefiner. Java a success? My abnormal, I OutOfMemoryError: Java heap space. My JVM configuration is as follows