| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| ncbi_refseq_complete_microbes.fa.gz | 2012-01-17 | 857.5 MB | |
| VPT.tre | 2012-01-16 | 50.5 kB | |
| README.txt | 2012-01-16 | 1.2 kB | |
| taxids.txt | 2012-01-16 | 320.9 kB | |
| ncbi_refseq_complete_viruses.fa.gz | 2012-01-16 | 19.3 MB | |
| ncbi_refseq_complete_protozoa.fa.gz | 2012-01-16 | 51.9 MB | |
| ITOL.tre | 2012-01-16 | 4.5 kB | |
| Totals: 7 Items | 929.2 MB | 0 | |
This directory contains files useful for GAAS (http://sourceforge.net/projects/gaas) The *.fa files are FASTA nucleic sequence files obtained from NCBI Refseq 11/28/2008 (RefSeq Release 32) (ftp://ftp.ncbi.nih.gov/refseq/release/). All sequences whose title contained the following words were removed: shotgun, contig, partial, end, part. Thus the resulting FASTA files contain complete nucleic sequences (from viruses, microbes and protozoa). The taxids.txt file contains the accession number, taxon ID and taxon name of the sequences present in the FASTA files based on the NCBI taxonomy (ftp://ftp.ncbi.nih.gov/pub/taxonomy/). The taxon ID of the following type of sequences was removed unless the main genome was also present: plasmid, transposon, chloroplast, plastid, mitochondrion, apicoplast, macronuclear, cyanelle and kinetoplast. The *.tre files are Newick files representing the Interactive Tree Of Life (ITOL) (http://itol.embl.de/) and the Viral Proteomic Tree (VPT), an updated version of the Phage Proteomic Tree (PTP) (http://phage.sdsu.edu/~rob/phage_tree/) that includes all viruses (phage and eukaryotic viruses). The node IDs are the NCBI taxon ID.