From: MICHAEL L. <lu...@cs...> - 2003-05-27 14:58:42
|
Hello -- I used the following script to get some data out of GUS: #!/usr/bin/perl use GUS::ObjRelP::DbiDatabase; use GUS::Common::GusConfig; my $gusconfig= GUS::Common::GusConfig->new("/home/luchtan/gus.properties"); my $db= GUS::ObjRelP::DbiDatabase->new($gusconfig->getDbiDsn(), $gusconfig->getReadOnlyDatabaseLogin(), $gusconfig->getReadOnlyDatabasePassword, 1,1,1, $gusconfig->getCoreSchemaName); my $dbh= $db->getQueryHandle(); my $idStmt= $dbh->prepare("select '>'||na_sequence_id||' '||name||'\n' name_str,sequence from dots.externalnasequence"); $idStmt->execute(); open(TEMP, ">retrievedSeqForBLAST.tmp"); while(my (@row)=$idStmt->fetchrow_array()){ print TEMP "@row\n"; } close(TEMP); and blasted the resulting retrievedSeqForBLAST file. Now I want to load the results into GUS using the plugin LoadBlastSimilaritiesPK. This plugin requires a subject_table and a query_table. Perhaps it is a question of my unfamiliarity of the whole biology aspect of what is going on here(I'm just a programmer), but I am not sure what these tables are for. Can anybody help me out? Michael Luchtan http://www.cs.uga.edu/~luchtan On Thu, 22 May 2003, Jonathan Crabtree wrote: > > Jessie- > > Jessica Kissinger wrote: > > If memory serves me correct, once sequences were loaded into GUS, we > > then retrieved them along with their GUS ID to submit for blast searches. > > Yes, I think that's right. > > > When we retrieved the sequences, we created a custom format for the > > header line, such that the blast results once generated for these > > sequences could be easily parsed and loaded with the existing plug-in. > > > > Can someone tell me what the format of the fasta header should be, > > i.e is it ">GUSID, External_NA _sequence Name" or the other way around > > and should there be any formatting, tabs, spaces etc. If I remember > > correctly, the blast results were loaded by GUSID not "name", but I > > don't remember. > > My recollection is that the defline started as you said, with ">GUSID ". > I don't believe that the format is crucial, because (again, from what > I remember) when you run the plugin to load the BLAST similarities you > supply it with a regular expression that it uses to pick the GUSID > (an na_sequence_id for most of the PlasmoDB searches) out of the defline. > So as long as the regex matches the defline format, you should be OK, > and I don't think that the plugin uses anything on the defline except > for the GUSID. > > Jonathan > > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: ObjectStore. > If flattening out C++ or Java code to make your application fit in a > relational database is painful, don't do it! Check out ObjectStore. > Now part of Progress Software. http://www.objectstore.net/sourceforge > _______________________________________________ > Gusdev-gusdev mailing list > Gus...@li... > https://lists.sourceforge.net/lists/listinfo/gusdev-gusdev > |