Dump extraction: csv files doubt

Jairo
2012-02-17
2013-05-30
  • Jairo

    Jairo - 2012-02-17

    Hello all!,

    I'm extracting spanish dump csv files in Wikipedia Miner 1.2 with hadoop and, in my page.csv file, the pages with type 4 contains depth -1 (it's means not found). I don't know that means type 4.

    Could someone tell me what this means?

    Thank you very much!,

    Jairo

     
  • David Milne

    David Milne - 2012-02-26

    Hi Jairo,

    A depth of -1 just means that the page cant be navigated to from the root category you chose when creating your language config file. This is normal for templates and redirects. As long as articles (type 0) and categories (type 1) have a depth, then everything is working properly.

    - Dave

     

Log in to post a comment.