GenomeDownloader is a command-line Perl program to download genomic data (using wget) from NCBI. It has been recently (2017-10) completely rewritten to work with the "new" data organization structure at NCBI. Assembly completion level (i.e., Contig, Scaffold, Chromosome or Complete Genome) can also be selected as a criterion for downloading data.

Genomic data can be downloaded from all organisms belonging to a certain taxon (e.g., Mammalia or 40674), and downloads can be limited to certain kinds of files (e.g., faa or faa,gbff etc.). Search terms can also be used to further limit results.

This program runs in Linux but could be made to run on Mac OS (maybe with a few modifications, and provided that dependencies are met).

Features

  • Automatically search and retrieve genomic data from NCBI
  • Searches using NCBI's taxonomic information, either as a name or as a taxon identifier number
  • Retrieves either all info for each genome, or only files ending in user-defined extensions (e.g. to download FASTA genome and GenBank, use fna,gbff)
  • User provided list of search terms (e.g. Strep) further limits which genomes will be retrieved
  • Multiple search terms can be provided, one per line in a file, or one term can be given on the command-line
  • Can also limit download to certain assembly completion levels (Contig, Scaffold, Chromosome or Complete Genome)

Project Activity

See All Activity >

Follow Genome Downloader

Genome Downloader Web Site

Other Useful Business Software
$300 in Free Credit Towards Top Cloud Services Icon
$300 in Free Credit Towards Top Cloud Services

Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
Get Started
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Genome Downloader!

Additional Project Details

Registered

2015-01-22