when I try > ant build-database , I got "Could not locate markup file" exception but build is successful. What is markup file, I couldn't figure it out. Here is the part:
Creating the database
Exception in thread "main" java.io.IOException: Could not locate markup file in /mnt/wikipedia-miner-1.2.0/data/tr_dataDir
at org.wikipedia.miner.db.WEnvironment.getMarkupDataFile(Unknown Source)
at org.wikipedia.miner.db.WEnvironment.buildEnvironment(Unknown Source)
at org.wikipedia.miner.util.EnvironmentBuilder.main(Unknown Source)
Java Result: 1
Total time: 8 seconds
Ok I've solved it, xml dump file is supposed end with "-pages-articles.xml".
For me just changing the xml dump file to end with "-pages-articles.xml" has not solved the problem, but placing the xml dump file inside the same directory where the sumarries files are (i.e. articleParents.csv, categoryParents.csv, childArticles.csv, ..., etc) has.
Sign up for the SourceForge newsletter:
You seem to have CSS turned off.
Please don't fill out this field.