#4 Memory thrashing in extraction scripts

closed-fixed
David Milne
None
9
2008-12-02
2008-12-01
David Milne
No

Since the xml dumps have gotten bigger, the extraction scripts have started failing, or at least thrashing around memory until they slow completely down.
The problem areas are in building the anchor_summary and page_links_in files

Discussion

  • David Milne
    David Milne
    2008-12-01

    • status: open --> closed
     
  • David Milne
    David Milne
    2008-12-02

    • status: closed --> closed-fixed
     
  • I have exactly the same kind of problem and it's not resolved in the last version of patchWikipediaData.pl