Thread: [Gusdev-gusdev] LoadAnnotatedSeqs plugin underway

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

folks-

the UGA folks and CBIL folks have started collaborating on a new plugin 
called LoadAnnotatedSeqs.   It will use BioPerl to parse the input data.

We expect it to take annotated sequences (NA at first) in genbank, tigr 
xml and embl formats (plus any others supported by the bioPerl parser).

It will take an XML file that describes the mapping from the input 
features to GUS features, and SO features. 

It will also hard code special cases to handle qualifer data that is 
distributed to tables outside of the NAFeature tables.

For our projects we will be developing a mapping that unifies the 
semantics of the data we are getting from our different sources and 
formats.  

(we plan to work with the PSU folks to incorporate the knowledge they 
have acquired in their work to make an EMBL parser)

ideas and suggestions are encouraged.

steve

gusdev-gusdev