<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to TopHat</title><link>https://sourceforge.net/p/altanalyze/wiki/TopHat/</link><description>Recent changes to TopHat</description><atom:link href="https://sourceforge.net/p/altanalyze/wiki/TopHat/feed" rel="self"/><language>en</language><lastBuildDate>Sat, 01 Feb 2014 01:06:55 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/altanalyze/wiki/TopHat/feed" rel="self" type="application/rss+xml"/><item><title>TopHat modified by Anonymous</title><link>https://sourceforge.net/p/altanalyze/wiki/TopHat/</link><description>&lt;div class="markdown_content"&gt;&lt;h2 id="introduction"&gt;Introduction&lt;/h2&gt;
&lt;p&gt;&lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; is a component of the Tuxedo suite of tools developed for the analysis of short sequence reads from various high-throughput sequencing platforms. &lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; works in conjunction with the short read mapper Bowtie to align sequence reads to the genome, identify and quantify junction expression (known and novel). &lt;/p&gt;
&lt;h2 id="details"&gt;Details&lt;/h2&gt;
&lt;p&gt;Current version details on &lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; can be found &lt;a class="" href="http://tophat.cbcb.umd.edu/" rel="nofollow"&gt;here&lt;/a&gt;. &lt;/p&gt;
&lt;h3 id="usage-with-altanalyze"&gt;Usage with &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; can import junction alignment data from &lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; junction BED files and exon alignment data from the associated accepted_hit.bam file. See &lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; documentation for the latest details on usage with the sequencing technology you using. Below is an example for Illumina single read data. &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Analyze a single non-paired end sample FASTQ file&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Illumina single read&lt;/em&gt;&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;tophat&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;GTF&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;work&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;Common&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;Data&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;Reference_RNASeq_Assembly_GTF&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;Homo_sapiens&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;GRCh37&lt;/span&gt;&lt;span class="mf"&gt;.56&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;chr&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;keep&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;tmp&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;output&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;dir&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;home&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;user&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;tophat_data&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;NP902&lt;/span&gt; &lt;span class="n"&gt;hg19&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;home&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;user&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;hESC&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;NP&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;SRS011902&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;NP&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;fq&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;&lt;em&gt;Additional options: Paired-end data&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;code&gt;--mate-inner-dist 104 --mate-std-dev 17 --solexa1.3-quals --segment-length 18&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Additional options: Exclude novel junctions&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;code&gt;--no-novel-juncs&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Obtaining exon expression estimates from &lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; BAM files &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Install &lt;a class="" href="/p/altanalyze/wiki/BEDTools/"&gt;BEDTools&lt;/a&gt; and call the utility bamToBed (recognized on Unix systems once BEDTools has been added to the local or global .bashrc file). The file accepted_hits.bam is produced with each &lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; run in the same output directory as the junction BED file. &lt;/p&gt;
&lt;p&gt;In the below example, "hESC_differentiation_exons.bed" is produced by &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; prior to running BEDTools (&lt;a class="" href="/p/altanalyze/wiki/BAMtoBED/"&gt;see instructions here&lt;/a&gt;), containing all known mRNA exon region coordinates from Ensembl/UCSC and all novel exon coordinates indicated from the &lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; junction BED results. These methods should work equivalently for non-&lt;a class="" href="/p/altanalyze/wiki/TopHat/"&gt;TopHat&lt;/a&gt; produced BAM files, however, additional sorting of the BAM file may be required (e.g., SAMTools). &lt;/p&gt;
&lt;p&gt;&lt;em&gt;Build Exon BED file from BAM&lt;/em&gt;&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;bamToBed&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;i&lt;/span&gt; &lt;span class="n"&gt;accepted_hits&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;bam&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;split&lt;/span&gt;&lt;span class="o"&gt;|&lt;/span&gt; &lt;span class="n"&gt;coverageBed&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;stdin&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;b&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;home&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;user&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;BAMtoBED&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;hESC_differentiation_exons&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;bed&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;home&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;user&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;RNASeqStudy&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;Sample1&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;day0_s1__exons&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;bed&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;&lt;strong&gt;File naming&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If one or multiple junction and exon alignment files are produced, specific naming conventions are needed to ensure that different files for the the same biological samples are properly matched up. Below is an example of multiple junction and exon files in a single directory for import into &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt;. &lt;/p&gt;
&lt;p&gt;&lt;code&gt;Cancer_s1__canonical-junction.bed&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;&lt;code&gt;Cancer_s1__noncanonical-junction.bed&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;&lt;code&gt;Cancer_s1__exon.bed&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;&lt;code&gt;Wt_s1__canonical-junction.bed&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;&lt;code&gt;Wt_s1__noncanonical-junction.bed&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;&lt;code&gt;Wt_s1__exon.bed&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;When imported into &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt;, any file name proceeding a double underscore will be recognized as the same sample and merged with the other files. Exon and junction BED files are differentiated based on the number of columns present in the two files (12 for junction, 10 for exon). &lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Sat, 01 Feb 2014 01:06:55 -0000</pubDate><guid>https://sourceforge.net3d2a485648d6cbfe9fbdf605ebebbdbb586ffb7e</guid></item></channel></rss>