Menu

#4 Memory thrashing in extraction scripts

closed-fixed
None
9
2008-12-02
2008-12-01
David Milne
No

Since the xml dumps have gotten bigger, the extraction scripts have started failing, or at least thrashing around memory until they slow completely down.
The problem areas are in building the anchor_summary and page_links_in files

Discussion

  • David Milne

    David Milne - 2008-12-01
    • status: open --> closed
     
  • David Milne

    David Milne - 2008-12-02
    • status: closed --> closed-fixed
     
  • Nobody/Anonymous

    I have exactly the same kind of problem and it's not resolved in the last version of patchWikipediaData.pl

     

Log in to post a comment.