Menu

#201 Write a new cvterm OBO loader - maybe

eventually
open
nobody
None
3
2013-11-29
2013-07-06
No

The current GMOD method using Stag is quite slow - an hour or two for GO and FYPO. (Loading the cvtermpath table takes another 10-ish hours after that).

If no one has written a new loader, maybe we should.

Notes to self:
try reading all the current cvterm table contents into memory about then read all the OBO files into memory - we have lots of memory, let's use it.

needs to update cvterm_relationship too

don't use DBIx::Class

use the COPY FROM support in DBD::Pg to make loading fast: http://search.cpan.org/~turnstep/DBD-Pg/Pg.pm#pg_putcopydata

see also: http://tapoueh.org/blog/2013/03/15-batch-update

Discussion

  • Valerie Wood

    Valerie Wood - 2013-09-10
    • Priority: 5 --> 3
     
  • Kim Rutherford

    Kim Rutherford - 2013-11-29

    We should load the replaced_by tags from the OBO files for this ticket:

    https://sourceforge.net/p/pombase/chado/271/

     

Log in to post a comment.