DNase2Hotspots README

Last updated: November 2, 2012

Current Maintainer
: Songjoon Baek baeks@mail.nih.gov

License
: GNU General Public License, v 3.0
Introduction

 DNase2Hotspots is a software package for identifying tag-enriched genomic regions (hotspots) from DNase-seq data.  The program reads a BAM (Binary Alignment Map) file or a tab-delimited text file as input and produces a list of hotspots and associated z-scores as output.  Mappability profiles for the reference genome and repeat-masked regions are also required as input.   
Reference

Baek S, Sung MH, Hager GL.: Quantitative analysis of genome-wide chromatin remodeling.
Methods Mol. Biol. 833: 433-41, 2012.

The software has been tested on Linux Ubuntu 11.10, Mac OS X 10.7, and Microsoft Windows 7.

1.	At least 4 GB of RAM
2.	BamTools (
https://github.com/pezmaster31/bamtools
)
Installation

1. Make a folder and unzip DNase2Hotspots.zip file into the folder.
2. Specify 'include' and 'lib' directories after BAMTOOL_INCLUDEDIR and BAMTOOL_LIBDIR in 'Makefile'.
3. Type 'make' in the command shell to compile 'dnase2hotspots'.
Usage

Example:
   $ dnase2hotspots  example_1.cfg
'example_1.cfg' is a text file containing locations of data files, mappability files, and options.  Two example configuration files are included in the package.

Mappability
A k-mer mappability file of a chromosome is a binary file that has, at each nth byte position of the file, the occurrence frequency (values between 0 and 255) of the 5'-directional k-mer (starting at the position of the chromosome) in the reference genome.   Mappability files can be generated by a mappability mapping program developed by the Gerstein lab at Yale University (gerstein.org) as part of the PeakSeq package (http://archive.gersteinlab.org/proj/PeakSeq/Mappability_Map/Code/).   The filenames of the mappability profiles must be in the form of "Chromosome name"+"b.out" such as  "chr1b.out", "chr2b.out", etc.
Source: DNase2Hotspots_README.rtf, updated 2014-03-14