I have not been able to find a Read Me file for the above dump which has many csv files.
Is there any such file and I am missing it ..
Thanks for any help
you can find from
firstly download and excract the file…
Read Me file name is Readme.htm
Thanks for that info. May be the Readme.htm should be available in the direct dump as well.
I had one more doubt. I am not able to find the generality.csv though the readme file has a summary about generality.
Any clues on which csv file, fields give this info.
Thanks a million for all the help.
Also, when I look at the csv files and the readme file, there are lot of fields appearing without any explanation.
For example, pagelink has only 2 fileds in the readme file, but the csv file has many fields. same with the page explanation on readme.
I kindly request anybody to clarify on this.
first of all, thanks for this nicely written software!
I'm also trying to understand the format of the new 1.2 csv files. Are they documented anywhere ?
Another question, I would like to get the contexts of the anchors. Are these also present on the csv? Or should I parse the xml dump and extract the contents from there ?
thank you in advance,