Download Latest Version GAAS-0.17.tar.gz (162.1 kB)
Email in envelope

Get an email when there's a new version of GAAS

Home / gaas-data / 2009
Name Modified Size InfoDownloads / Week
Parent folder
ncbi_refseq_complete_microbes.fa.gz 2012-01-17 857.5 MB
VPT.tre 2012-01-16 50.5 kB
README.txt 2012-01-16 1.2 kB
taxids.txt 2012-01-16 320.9 kB
ncbi_refseq_complete_viruses.fa.gz 2012-01-16 19.3 MB
ncbi_refseq_complete_protozoa.fa.gz 2012-01-16 51.9 MB
ITOL.tre 2012-01-16 4.5 kB
Totals: 7 Items   929.2 MB 0
This directory contains files useful for GAAS (http://sourceforge.net/projects/gaas)

The *.fa files are FASTA nucleic sequence files obtained from NCBI Refseq
11/28/2008 (RefSeq Release 32) (ftp://ftp.ncbi.nih.gov/refseq/release/). All
sequences whose title contained the following words were removed: shotgun,
contig, partial, end, part. Thus the resulting FASTA files contain complete
nucleic sequences (from viruses, microbes and protozoa).

The taxids.txt file contains the accession number, taxon ID and taxon name of
the sequences present in the FASTA files based on the NCBI taxonomy
(ftp://ftp.ncbi.nih.gov/pub/taxonomy/). The taxon ID of the following type of 
sequences was removed unless the main genome was also present: plasmid,
transposon, chloroplast, plastid, mitochondrion, apicoplast, macronuclear,
cyanelle and kinetoplast.

The *.tre files are Newick files representing the Interactive
Tree Of Life (ITOL) (http://itol.embl.de/) and the Viral Proteomic Tree (VPT),
an updated version of the Phage Proteomic Tree (PTP)
(http://phage.sdsu.edu/~rob/phage_tree/) that includes all viruses (phage and
eukaryotic viruses). The node IDs are the NCBI taxon ID.



Source: README.txt, updated 2012-01-16