|
From: Ed R. <ero...@ug...> - 2006-05-03 20:41:20
|
I had to go back and check on this. Intron is not supported
by the Unflattener the way it is invoked by
InsertSequenceFeatures. When we invoke the unflattener,
because the exons in CryptoDB's gbfile were implicit, we set
the unflattener attribute use_magic to true. This creates
explicit representations of the exons implicitly present in
the encoding of our gene locations.
Unfortunately, use_magic also requires that we set the
structure type to "0" GENE->mRNA->exon
->CDS
To get the introns which are specifically encoded in your GB
file, you will need to set the structure type to 1 when the
unflattener is invoked. I made a number of changes to the
plugin for some colleagues a while back so they could
dynamically choose whether or not to use magic and which kind
of structure they wanted to load. These changes were never
checked into the official GUS release because they involved
significant changes to the commandline interface. I will dig
up these changes and show you where to make the necessary
modifications.
-ed
---- Original message ----
>Date: Tue, 2 May 2006 11:23:55 -0400
>From: "Kumar, Sanjeev" <San...@ng...>
>Subject: [GUSDEV] InsertSequenceFeature Issue in GUS3.5
>To: <gus...@li...>
>
> Hi Group,
> I am having little problem with
> InsertSequenceFeature in our environment.
> It is loading all features from the genBank file
> except "intron" feature. I tried to look into the
> code and found that it is all together skipping the
> "intron" feature from genBank file. Is that the
> issue with "BioPerl"?
>
> Did any one face this issue earlier?
>
> The version of bioperl in our environment is :1.5.0
> And I am using very first version of GUS3.5. Do we
> have any subsequent release like GUS3.5.X?
> And the file I am trying to load is related to
> MICROSPORIDIA organism (NC_003238.gbf).
> ------
> The genBank2gus.xml has entry for the intron like
> this:
> <feature name="intron" table="DoTS::IntronFeature"
> so="intron">
> <qualifier name="allele" lost="true"/>
> <qualifier name="citation"/>
> <qualifier name="cons_splice"/>
> <qualifier name="evidence"/>
> <qualifier name="function"/>
> <qualifier name="gene" specialcase="gene"/>
> <qualifier name="label"/>
> <qualifier name="locus_tag" column="source_id"/>
> <qualifier name="map"/>
> <qualifier name="note" specialcase="note"/>
> <qualifier name="number" column="num"/>
> <qualifier name="old_locus_tag" lost="true"/>
> <qualifier name="standard_name"/>
> <qualifier name="usedin"/>
> <qualifier name="db_xref" specialcase="dbxref"/>
> </feature>
>
> And the intron entry in genBank file looks like
> this:
> gene complement(52254..52444)
> /db_xref="GeneID:860376"
> /locus_tag="ECU09_0395"
> CDS
> join(complement(52254..52409),complement(52442..52444))
> /db_xref="GI:19173160"
> /db_xref="GeneID:860376"
> /locus_tag="ECU09_0395"
> /codon_start=1
> /protein_id="NP_596963.1"
>
> /translation="MGSRKTALVKTRLMRALKRNREIPAWKRMMKEHKGEYNRARRHW
> RSKKLKIY"
> /product="60S RIBOSOMAL PROTEIN
> L39"
> misc_feature
> join(complement(52260..52409),complement(52442..52444))
> /db_xref="CDD:11875"
> /locus_tag="ECU09_0395"
> /note="RPL39; Region: Ribosomal
> protein L39E
> [Translation, ribosomal
> structure and biogenesis]"
> intron complement(52410..52441)
> /locus_tag="ECU09_0395"
> /note="intron L39"
>
> Any help will be appreciated.
>
> Thanks
> Sanjeev Kumar
> Northrop Grumman
-----------------
Ed Robinson
Center for Tropical and Emerging Global Diseases
University of Georgia, Athens, GA 30602
ero...@ug.../(706)542.1447/254.8883
|