Download Latest Version GenomeRunner-v4.0-Setup.zip (2.3 MB)
Email in envelope

Get an email when there's a new version of GenomeRunner

Home / Examples
Name Modified Size InfoDownloads / Week
Parent folder
GPL6244-ncRNA_hg19 2013-11-04
H3K4me2 2013-11-04
SNPs_100_genes 2013-10-29
SNPs_40_cons46 2013-10-29
RandFOIweighted2 2013-10-29
RandFOIweighted1 2013-10-29
wgEncodeBroadHistoneGm12878H3k4me2StdPk.bed 2013-10-29 2.1 MB
SNPs_40_cons46.bed 2013-10-29 1.5 kB
GPL6244-ncRNA_hg19.bed 2013-10-29 124.2 kB
RandFOIweighted1.bed 2013-10-29 24.9 kB
RandFOIweighted2.bed 2013-10-29 24.9 kB
SNPs_100_genes.bed 2013-10-29 3.6 kB
GenomeRunner-v3.0-Supplemental.pdf 2012-01-30 209.3 kB
Totals: 13 Items   2.5 MB 0

GenomeRunner - Annotation and Enrichment analysis of Next-Gen Sequencing data

GenomeRunner is a program for annotation and enrichment analyses of user-provided genomic features of interest (FOIs) against (epi)genome annotation features (GFs) available from UCSC genome browser. GenomeRunner is a "high-level" genome data analysis and interpretation tool, that is, it helps to interpret a set of genomic regions of interest within genome annotation context.

GenomeRunner accepts FOIs from a tab-delimited .BED format, and runs annotation/enrichment analysis against >6,000 GFs (genes, alternative splicing sites, transcription factor binding sites etc.) Currently, GenomeRunner works with NCBI36/hg18 and GRCh37/hg19 human genome assemblies, and NCBI/mm9 mouse genome assemblies. Genome annotation data are organized similar to the UCSC tracks. The data are split into "tiers" according to the UCSC notation.

Annotation analysis provides a used with detailed annotation of each FOI against corresponding GFs, e.g. a set of SNPs may be annotated for co-localization and closest transctiption factor binding sites. Enrichment analysis calculates if a set of FOIs co-localizes with genomic features more often that could happen by random chance.

Prerequisites

Windows XP and up. .NET 4 framework (will be installed automatically, if missing). At least one SQLite database, downloadable separately, e.g., hg19tier1.sqlite.

Database description

  • hg19tier1.sqlite - Human Genome 19 assembly, 69 genome annotation features of best quality
  • hg19tier2.sqlite - Human Genome 19 assembly, additional 125 genome annotation features
  • hg19tier100.sqlite - Human Genome 19 assembly, 380 tracks, transcription factor binding sites
  • mm9.sqlite - Mouse Genome 9 assembly, 267 genome annotation features
  • For more data, contact me, as SourceForge has limits for storing large files

How to use

  1. Download and install GenomeRunner using setup.exe
  2. Download one of the databases, e.g., hg19tier1.sqlite. When first starting GenomeRunner, browse to the database file. GenomeRunner will remember the database location for the future runs.
  3. Download examples from the Example folder, and follow the tutorial in GenomeRunner-v3.0-Supplemental.pdf

Questions/bugs reporting/features request

I would be happy to help, or just hear from you. Mikhail Dozmorov <mikhail dot dozmorov at gmail dot com>

What's new

06/28/2013 v. 4.0.0.0 GUI ! Major update - Database format change. GenomeRunner is now independent of Internet connection and uses SQLite databases, downloadable separately. ! Minor fixes: GUI update to load local SQLite databases, interface enhancements.

1/25/2012 v. 3.1.0.0 GUI + Adding ability to load any GF as spot background, generate weighted random regions, hide unused/unfinished menu items, small fixes in menu items. Now, if one wants to use all genes to be used as a spot background, one should add gene table into "Genomic features that will be run" windos and select "File/Load selected Genomic Feature as spot background". Another way to do the same thing is to export gene table into a .bed file, and load it via "File/Open background file - Spot" + Use any SNP database as spot background for random sampling. This allows analysis of a set of SNPs against random set sampled from all SNPs in the table, not from the whole genome. Implementation is not very straightforward, as it requires manual addition of autoincrementing numerical "id" column to a SNP table. Therefore, access to this feature is currently placed in "Tools/Enrichment Analysis with SNP table as Background". Note that selecting this feature will prompt for selection of SNP table and then directly process with enrichment analysis using current settings. + Output fraction of FOIs overlapped with a GF, as matrix _PersentObsTot.gr. This is non-statistical overview what fraction out of total number of FOIs overlaps with a GF. If 100 - all FOIs overlap with a GF, if 0 - none of the FOIs overlap with GF. ! Minor fixes: Menu items adjusted, annotation output for 'Run Enrichment for all names' fixed, better handling of database connection string, binomial distribution formulae re-checking and minor adjustments, log file format adjustment.

01/30/2012 v. 3.0.0.0 GUI + Using chromInfo table for genome-specific background + Better Tier organisation for each organism + Ability to output FOI-specific enrichment matrixes, or combined matrixes + Annotation analysis - if NoOverlap is encountered, output either closest single GF, or closest left/right GF + Database dumps download available at http://wren.omrf.org/GenomeRunner/GenomeRunner.aspx ! Small fixes to improve speed and output layout of the results v. 3.0.0.0 Command line + Using Settings from .xml file, this file copied into the folder with the results output + As console version processes one FOI file at a time, ability added to merge separate matrix files into one ! Rewrote help section, invoked when GenomeRunnerConsole.exe is called without parameters

v. 2.0.0.7 + Addition of human GRCh37/hg19 and mouse NCBI37/mm9 data tables, selection of proper genomic background for each database + Option for calculating p-values using traditional Monte-Carlo simulations ! Improved random chromosome selection ! More compact log file; transposed matrix of -log10 p-values (FOIs horizontal, GFs vertical) ! Progress bar label shows the number of current Monte-Carlo run ! MySQL database connection timeout is set to 10 min

Citation

If you find GenomeRunner useful, please, cite: Dozmorov MG, Cara LR, Giles CB, Wren JD. GenomeRunner: automating genome exploration. Bioinformatics. 2012 Feb 1;28(3):419-20. doi:10.1093/bioinformatics/btr666. Epub 2011 Dec 6. PubMed PMID: 22155868; PubMed Central PMCID: PMC3268239.

Authors

Mikhail G. Dozmorov <mikhail at dozmorov dot com> Lukas R. Cara <lks underscore cara at yahoo> Cory B. Giles <cory dot b dot giles at gmail> Krista Bean

License and Disclaimer

(C) Mikhail Dozmorov 2013, Licensed under the Non-Profit Open Software License version 3.0

Source: README.md, updated 2013-10-30