From: Aaron J. M. <am...@pc...> - 2005-07-08 14:00:54
|
On Jul 8, 2005, at 9:43 AM, Robert Olson wrote: > In the SEED, we support having multiple genomes installed for any > given taxon. For instance, we have a genome we tag as 630.1 which > is Yersinia enterocolitica from NCBI Refseq, and 630.2 which is > Sanger's version of that organism. It was discussed in the workshop > to use the external database version to tag the difference between > these in GUS, but that seems clumsy to me (since I may want that to > tag overall SEED data versions). I'm curious what the rank > attribute on DoTS.Taxon is intended to be used for; it seems > possible I could subvert that to use as the organism version > indicator. > You have multiple genomes for one organism. That one organism is in DoTS.Taxon, and it's rank is what the NCBI taxonomy database specifies as "genus", "phylum", "subspecies", etc. You want each version of the genome (i.e. the nasequence entries) "tagged" by the different database release versions. You don't have to further "tag" the associated data, as they are already tied to the nasequence entries, so it's not as clumsy as it may seem at first. -Aaron -- Aaron J. Mackey, Ph.D. Project Manager, ApiDB Bioinformatics Resource Center Penn Genomics Institute, University of Pennsylvania email: am...@pc... office: 215-898-1205 fax: 215-746-6697 postal: Penn Genomics Institute Goddard Labs 212 415 S. University Avenue Philadelphia, PA 19104-6017 |