The RDP Classifier is a nave Bayesian classifier that can rapidly and accurately provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. More information can be found at http://rdp.cme.msu.edu/.
Be the first to post a text review of RDP Classifier. Rate and review a project by clicking thumbs up or thumbs down in the right column.
Previous release would classify sequences with no valid words. Changed to require new variable MIN_GOOD_WORDS and set it to be MIN_SEQ_LEN - Sequence.WORDSIZE.
The version 2.0 has only minor changes over the version 1.0. The minimum sequence length is reduced to 50 bases due to short reads produced by pyrosequencing technique. The Hierarchy is now based on the new phylogenetically consistent higher-order bacterial taxonomy (TOBA Release 7.7) proposed by Garrity et. al, with additional major rearrangements that have been proposed for the Firmicutes and Cyanobacteria. It also includes published informal classifications for well-defined lineage with few cultivated members, such as Acidobacteria, Verrucomicrobia and OP11. Additional information for the classification of chloroplasts, Korarchaeota, and Nanoarchaeum were taken from the NCBI taxonomy.
The RDP Classifier is a nave Bayesian classifier that can rapidly and accurately provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. More information can be found at http://rdp.cme.msu.edu/. Environmental gene library analysis has become a mainstream tool in microbial ecology due to the rapid improvements in high-throughput sequencing technology. The small-subunit rRNA gene is often targeted for library construction because of its uses as a phylogenetic marker and in bacterial identification. However, tools for simple and rapid identification of the large numbers of sequences have been lacking. Here we present a nave Bayesian classifier (RDP Classifier) which can rapidly and accurately classify bacterial 16s rRNA sequences into the new higher-order taxonomy proposed by Bergeys Trust. It provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. The RDP Classifier is suitable both for analysis of single rRNA sequences and for analysis of libraries of thousands of sequences. This Classifier is fast, does not require sequence alignment and works well with partial sequences. The Classifier and related software were written in Java (API v1.4.1) and have been tested on the Solaris (2.8), Linux (2.4.23) and Macintosh (OS 10.3) operating systems using Java virtual machines from Sun and Apple. The compiled RDP Classifier library includes the data from Bergey's bacterial taxonomy outline release 5.0 as the default trained data. To run the Classifier with the default settings, simply provide a file with one or more bacterial 16S rRNA sequences in FASTA, Genbank or EMBL format. The classification results will be written to an output file. The RDP Classifier is not limited to using the bacterial taxonomy proposed by the Bergey's editors. It worked equally well when trained on the NCBI taxonomy. Two files are needed to train the Classifier: one taxonomy file describing the relationships between the taxa, and another file containing trusted seqeunces, each labeled with taxonomic assignments, from domain to genus level. To continue improving RDP Classifier, we now need your help in testing them. Please download the package and try to use it. Please contact rdpstaff@msu.edu or (517) 432-4998 if you have any questions or suggestions. This research was supported by the Office of Science (BER), U.S. Department of Energy, Grant No. DE-FG02-99ER62848 and the National Science Foundation, Grant No. DBI-0328255.
Be the first person to add a text review.
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use
Thanks for your rating!
Would you also like to write a review?