From: Steve F. <sfi...@pc...> - 2004-12-09 19:11:47
|
folks- the UGA folks and CBIL folks have started collaborating on a new plugin called LoadAnnotatedSeqs. It will use BioPerl to parse the input data. We expect it to take annotated sequences (NA at first) in genbank, tigr xml and embl formats (plus any others supported by the bioPerl parser). It will take an XML file that describes the mapping from the input features to GUS features, and SO features. It will also hard code special cases to handle qualifer data that is distributed to tables outside of the NAFeature tables. For our projects we will be developing a mapping that unifies the semantics of the data we are getting from our different sources and formats. (we plan to work with the PSU folks to incorporate the knowledge they have acquired in their work to make an EMBL parser) ideas and suggestions are encouraged. steve |