From: Deborah P. <pi...@pc...> - 2005-07-08 17:15:58
|
LoadNRDB.pm is not the appropriate plugin to use to load BLAST results=20 into GUS. It is specifically designed to load defline and sequence=20 information from NCBI's non redundant protein database files. It uses=20 the gitax file to get taxon information for each of the gi numbers in=20 the defline so as to attach a taxon_id. Therefore, LoadTaxon would have=20 to be run with NCBI's taxonomy files prior to running LoadNRDB. There is at least one plugin that can be used to load BLAST results,=20 InsertBlastSimilarities.pm, which is one of the GUS/Supporterd plugins.=20 Take a look at that and see if it serves your purposes. = = =20 Debbie =20 Michael Saffitz wrote: >Hi Debbie, > >Can you help with this? I think we should start with an overview of wha= t >LoadNRDB is doing-- specifically how it handles the gitax file. We can = then >figure out if Fabricio's issue is in the data or code. > >Please include gusdev in your reply-- I think it will be useful informat= ion. > >--Mike > >------ Forwarded Message > =20 > >>From: Fabr=EDcio <fab...@de...> >>Date: Thu, 9 Jun 2005 16:38:44 -0300 >>To: <gus...@li...> >>Subject: [GUSDEV] Load a Blast result using LoadNRDB >> >>Hello all,=20 >> >>=20 >> >>We=B9re trying to insert a Blast result into Gus Schema using LoadNRDB >>plug-in. Our blast result was filtered and is in a Fasta format accordi= ng to >>the plug-in requirement. The problem is the load process takes many tim= es >>and our postgres server process crashes after some hours, thus the plug= -in >>never concludes. I would like to know why this plug-in is so slow, if t= he >>file we want to store is not so big. I think that it=B9s due to the >>=ADgitax=3Dgi_taxid_prot.dmp parameter that has a large file. >> >>=20 >> >>Does anyone could help us to understand this? >> >>=20 >> >>Thanks a lot, >> >>=20 >> >>Fabr=EDcio. >> >> =20 >> > >------ End of Forwarded Message > =20 > |