Genome Database - A tool to create a local database of reference genome sequences

Usage: java path/to/GenomeDatabase.jar [options]

By Marc Strous, 2016

This tool enables you to download fasta files of protein and RNA sequences encoded
in reference genomes at NCBI. You can select relevant genomes with a set of queries.
Each query has four fields, separated by comma's. Example of a queries are:

superkingdom,Bacteria,genus,ftp
superkingdom,Archaea,genus,ftp
superkingdom,Eukaryota,phylum,ftp
superkingdom,Viruses,family,elink

The first query would download (with ftp) all available reference genomes of the
superkingdom Bacteria, limited to one genome per genus. The second query would do
the same for superkingdom Archaea. The third would download all Eukaryotic genomes,
a single representative for each phylum. The fourth would download all available
viral genomes, one representative per family, using the ncbi elink tool.

Project Activity

See All Activity >

Follow GenomeDatabase

GenomeDatabase Web Site

Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of GenomeDatabase!

Additional Project Details

Registered

2016-08-23