<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to FileFormats</title><link>https://sourceforge.net/p/tuque/wiki/FileFormats/</link><description>Recent changes to FileFormats</description><atom:link href="https://sourceforge.net/p/tuque/wiki/FileFormats/feed" rel="self"/><language>en</language><lastBuildDate>Wed, 10 Apr 2013 17:20:51 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/tuque/wiki/FileFormats/feed" rel="self" type="application/rss+xml"/><item><title>FileFormats modified by Ian Reid</title><link>https://sourceforge.net/p/tuque/wiki/FileFormats/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v1
+++ v2
@@ -1,17 +1,16 @@
 File formats
 ============

-The input RNA-Seq reads should be in Fastq format [http://maq.sourceforge.net/fastq.shtml]. tuqueSplice and tuqueMap
-determine the read length and quality value encoding automatically; these values should be constant within each reads file, but can differ between input files.
+The input RNA-Seq reads should be in __Fastq__ format \[l\]. tuqueSplice and tuqueMap determine the read length and quality value encoding automatically; these values should be constant within each reads file, but can differ between input files.

-The genome sequence file should contain the sequence of each chromosome (or scaffold or contig) in Fasta format
-[http://www.ncbi.nlm.nih.gov/BLAST/blastcgihelp.shtml].
+The genome sequence file should contain the sequence of each chromosome (or scaffold or contig) in __Fasta__ format
+\[\].

-Sequence feature annotations should be in GFF3 format [http://www.sequenceontology.org/gff3.shtml].
+Sequence feature annotations should be in __GFF3__ format \[\].

-Read mappings are output in BAM format [http://samtools.sourceforge.net/SAM1.pdf].
+Read mappings are output in __BAM__ format \[\].

-The .juncs format is as used in early versions of Tophat.
+The __.juncs__ format is as used in early versions of Tophat.
 It is a tab-delimited text format, with one line for each splice junction. Each line contains at least 4 fields
 separated by tab characters:
  1. Chromosome Id
@@ -28,4 +27,4 @@
 10. Right anchor - the maximum distance between the junction and the 3' end of a spanning read
 11. Class        - either regular, variant, or wrongway.

-The coverage.wig files are in bedGraph format [http://genome.ucsc.edu/goldenPath/help/bedgraph.html]
+The coverage.wig files are in __bedGraph__ format \[\]
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ian Reid</dc:creator><pubDate>Wed, 10 Apr 2013 17:20:51 -0000</pubDate><guid>https://sourceforge.netdf0f1c721c268eacbadbf3a26bf81c4a99aa6b92</guid></item><item><title>FileFormats modified by Ian Reid</title><link>https://sourceforge.net/p/tuque/wiki/FileFormats/</link><description>&lt;div class="markdown_content"&gt;&lt;h1 id="file-formats"&gt;File formats&lt;/h1&gt;
&lt;p&gt;The input RNA-Seq reads should be in Fastq format &lt;span&gt;[http://maq.sourceforge.net/fastq.shtml]&lt;/span&gt;. tuqueSplice and tuqueMap&lt;br /&gt;
determine the read length and quality value encoding automatically; these values should be constant within each reads file, but can differ between input files.&lt;/p&gt;
&lt;p&gt;The genome sequence file should contain the sequence of each chromosome (or scaffold or contig) in Fasta format&lt;br /&gt;
&lt;span&gt;[http://www.ncbi.nlm.nih.gov/BLAST/blastcgihelp.shtml]&lt;/span&gt;.&lt;/p&gt;
&lt;p&gt;Sequence feature annotations should be in GFF3 format &lt;span&gt;[http://www.sequenceontology.org/gff3.shtml]&lt;/span&gt;.&lt;/p&gt;
&lt;p&gt;Read mappings are output in BAM format &lt;span&gt;[http://samtools.sourceforge.net/SAM1.pdf]&lt;/span&gt;.&lt;/p&gt;
&lt;p&gt;The .juncs format is as used in early versions of Tophat.&lt;br /&gt;
It is a tab-delimited text format, with one line for each splice junction. Each line contains at least 4 fields&lt;br /&gt;
separated by tab characters:&lt;br /&gt;
 1. Chromosome Id&lt;br /&gt;
 2. Start  - the 0-based genomic coordinate of the first base that is spliced out&lt;br /&gt;
 3. End    - the 0-based genomic coordinate of the last base that is spliced out&lt;br /&gt;
 4. Strand - either + or -&lt;/p&gt;
&lt;p&gt;.juncs files produced by tuqueSplice contain additional fields:&lt;br /&gt;
 5. Read-through ratio - the ratio of the mean read coverage depth on the spliced-out bases to the coverage depth on the bases immediately before and immediately after the junction&lt;br /&gt;
 6. Multiplicity - the number of mapped reads that are spliced at this junction&lt;br /&gt;
 7. Diversity    - the number of distinct reads that are spliced at this junction&lt;br /&gt;
 8. Donor-acceptor pair of the spliced-out intron&lt;br /&gt;
 9. Left anchor  - the maximum distance between the 5' end of a spanning read and the junction&lt;br /&gt;
10. Right anchor - the maximum distance between the junction and the 3' end of a spanning read&lt;br /&gt;
11. Class        - either regular, variant, or wrongway.&lt;/p&gt;
&lt;p&gt;The coverage.wig files are in bedGraph format &lt;span&gt;[http://genome.ucsc.edu/goldenPath/help/bedgraph.html]&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ian Reid</dc:creator><pubDate>Wed, 10 Apr 2013 17:05:25 -0000</pubDate><guid>https://sourceforge.neta7b6a3dac342d835e14215b772c64d4f48d53198</guid></item></channel></rss>