|
From: Deborah P. <pi...@pc...> - 2005-07-08 17:41:57
|
And finally I see your attachment and it looks like it is in the right=20
format to be loaded with LoadNRDB. However, if the defline only has a=20
single source_id, I think I would use LoadFastaSequences.pm to load your=20
sequences into dots.AASequence or one of its views. This plugin is=20
generic and loads your choice of sequence tables with source_id, etc=20
defined using regex. I think you will find that more satisfactory.
=
=20
Debbie
Deborah Pinney wrote:
> I reread your e-mail and am guessing that your file is a file of=20
> sequences (query or subject ?) with information on the defline=20
> pertaining to a BLAST analysis. Perhaps you can send an excerpt of=20
> your file and a bit more explanation. Perhaps we can make some better=20
> suggestions.
>
> =
=20
> Debbie
>
>
>
> Deborah Pinney wrote:
>
>> LoadNRDB.pm is not the appropriate plugin to use to load BLAST=20
>> results into GUS. It is specifically designed to load defline and=20
>> sequence information from NCBI's non redundant protein database=20
>> files. It uses the gitax file to get taxon information for each of=20
>> the gi numbers in the defline so as to attach a taxon_id. Therefore,=20
>> LoadTaxon would have to be run with NCBI's taxonomy files prior to=20
>> running LoadNRDB.
>>
>> There is at least one plugin that can be used to load BLAST results,=20
>> InsertBlastSimilarities.pm, which is one of the GUS/Supporterd=20
>> plugins. Take a look at that and see if it serves your purposes.
>> =
=
=20
>> Debbie
>>
>> =20
>>
>>
>> Michael Saffitz wrote:
>>
>>> Hi Debbie,
>>>
>>> Can you help with this? I think we should start with an overview of=20
>>> what
>>> LoadNRDB is doing-- specifically how it handles the gitax file. We=20
>>> can then
>>> figure out if Fabricio's issue is in the data or code.
>>>
>>> Please include gusdev in your reply-- I think it will be useful=20
>>> information.
>>>
>>> --Mike
>>>
>>> ------ Forwarded Message
>>> =20
>>>
>>>> From: Fabr=EDcio <fab...@de...>
>>>> Date: Thu, 9 Jun 2005 16:38:44 -0300
>>>> To: <gus...@li...>
>>>> Subject: [GUSDEV] Load a Blast result using LoadNRDB
>>>>
>>>> Hello all,
>>>>
>>>>
>>>> We=B9re trying to insert a Blast result into Gus Schema using LoadNR=
DB
>>>> plug-in. Our blast result was filtered and is in a Fasta format=20
>>>> according to
>>>> the plug-in requirement. The problem is the load process takes many=20
>>>> times
>>>> and our postgres server process crashes after some hours, thus the=20
>>>> plug-in
>>>> never concludes. I would like to know why this plug-in is so slow,=20
>>>> if the
>>>> file we want to store is not so big. I think that it=B9s due to the
>>>> =ADgitax=3Dgi_taxid_prot.dmp parameter that has a large file.
>>>>
>>>>
>>>>
>>>> Does anyone could help us to understand this?
>>>>
>>>>
>>>>
>>>> Thanks a lot,
>>>>
>>>>
>>>>
>>>> Fabr=EDcio.
>>>>
>>>> =20
>>>
>>>
>>>
>>> ------ End of Forwarded Message
>>> =20
>>>
>>
>>
>>
>> -------------------------------------------------------
>> This SF.Net email is sponsored by the 'Do More With Dual!' webinar=20
>> happening
>> July 14 at 8am PDT/11am EDT. We invite you to explore the latest in du=
al
>> core and dual graphics technology at this free one hour event hosted=20
>> by HP, AMD, and NVIDIA. To register visit=20
>> http://www.hp.com/go/dualwebinar
>> _______________________________________________
>> Gusdev-gusdev mailing list
>> Gus...@li...
>> https://lists.sourceforge.net/lists/listinfo/gusdev-gusdev
>
>
|