Readme file for enwiki-20110722-csv.tar

vin
2011-09-07
2013-05-30
  • vin
    vin
    2011-09-07

    Hi,

    I have not been able to find a Read Me file for the above dump which has many csv files.

    Is there any such file and I am missing it ..

    Thanks for any help

    Vin

     
  • vin
    vin
    2011-09-08

    Hi Görener,

    Thanks for that info. May be the Readme.htm should be available in the direct dump as well.
    I had one more doubt. I am not able to find the generality.csv though the readme file has a summary about generality.

    Any clues on which csv file, fields give this info.

    Thanks a million for all the help.

    Vin

     
  • vin
    vin
    2011-09-08

    Also, when I look at the csv files and the readme file, there are lot of fields appearing without any explanation.
    For example, pagelink has only 2 fileds in the readme file, but the csv file has many fields. same with the page explanation on readme.
    I kindly request anybody to clarify on this.

    Thanks

    Vin

     
  • Aitor Soroa
    Aitor Soroa
    2011-10-10

    Hi,

    first of all, thanks for this nicely written software!

    I'm also trying to understand the format of the new 1.2 csv files. Are they documented anywhere ?

    Another question, I would like to get the contexts of the anchors. Are these also present on the csv? Or should I parse the xml dump and extract the contents from there ?

    thank you in advance,
                                                                 aitor