<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to Tutorial</title><link>https://sourceforge.net/p/gowinda/wiki/Tutorial/</link><description>Recent changes to Tutorial</description><atom:link href="https://sourceforge.net/p/gowinda/wiki/Tutorial/feed" rel="self"/><language>en</language><lastBuildDate>Wed, 18 Mar 2015 14:56:31 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/gowinda/wiki/Tutorial/feed" rel="self" type="application/rss+xml"/><item><title>Tutorial modified by Anonymous</title><link>https://sourceforge.net/p/gowinda/wiki/Tutorial/</link><description>&lt;div class="markdown_content"&gt;&lt;ul&gt;
&lt;li&gt;Introduction&lt;/li&gt;
&lt;li&gt;Data&lt;ul&gt;
&lt;li&gt;Tutorial Gowinda&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Tutorial Gowinda&lt;ul&gt;
&lt;li&gt;Example 1: Basic analysis&lt;/li&gt;
&lt;li&gt;Example 2: Including regulatory regions&lt;/li&gt;
&lt;li&gt;Example 3: high resolution GO term enrichment&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Convert a .gff file into a .gtf&lt;/li&gt;
&lt;li&gt;Obtain a GO association file from GoMiner&lt;/li&gt;
&lt;li&gt;Dealing with synonymous gene IDs&lt;ul&gt;
&lt;li&gt;Obtain a list of synonymous gene IDs from a 'gff' file&lt;/li&gt;
&lt;li&gt;Synchronize the gene IDs of the annotation with the gene sets&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h1 id="introduction"&gt;Introduction&lt;/h1&gt;
&lt;p&gt;The following section provides a quick and fast introduction into the analysis of gene set enrichment with Gowinda. For the sake of this tutorial we provide a small artificial data set from &lt;em&gt;Drosophila melanogaster&lt;/em&gt; chromosome 2R. In these sample data we identified 348.701 SNPs and a GWAS-like study showed that 279 of those are associated with the trait of interest, lets say body weight. Furthermore the dataset contains an annotation of &lt;em&gt;D. melanogaster&lt;/em&gt; from FlyBase and the GO associations from FuncAssociate2. In the following section we demonstrate how to test, whether our set of candidate SNPs shows a significant enrichment for any GO terms. &lt;/p&gt;
&lt;h1 id="data"&gt;Data&lt;/h1&gt;
&lt;h2 id="tutorial-gowinda"&gt;Tutorial Gowinda&lt;/h2&gt;
&lt;p&gt;The following sample data set contains SNPs, annotation and the GO associations for &lt;em&gt;D.melanogaster&lt;/em&gt; chromosome arm 2R. First, download the sample data for the Walkthrough: &lt;a href="http://gowinda.googlecode.com/files/data_gowinda_tutorial.zip" rel="nofollow"&gt;http://gowinda.googlecode.com/files/data_gowinda_tutorial.zip&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;After unzipping the archive you should have the following files: &lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;annotation.gtf: Annotation of the genome in the .gtf file format &lt;/li&gt;
&lt;li&gt;annotation.gff: Annotation of the genome in the .gff file format &lt;/li&gt;
&lt;li&gt;goassociations_cg.txt: A GO annotation file obtained from the FuncAssociate2 web page &lt;/li&gt;
&lt;li&gt;total_snps.txt: the set of all SNPs used in the GWAS-like study &lt;/li&gt;
&lt;li&gt;cand_snps.txt: the candidate SNPs. Some GWAS-like study showed that these SNPs are associated with a trait of interest, for example wing length. &lt;/li&gt;
&lt;/ul&gt;
&lt;h1 id="tutorial-gowinda_1"&gt;Tutorial Gowinda&lt;/h1&gt;
&lt;p&gt;&lt;strong&gt;NOTE: Please follow the examples in the given order&lt;/strong&gt;&lt;/p&gt;
&lt;h2 id="example-1-basic-analysis"&gt;Example 1: Basic analysis&lt;/h2&gt;
&lt;p&gt;First, let's test whether the candidate SNPs show any enrichment for any GO category when counting every gene only once (&lt;code&gt;--mode gene&lt;/code&gt;). Furthermore we want a SNP to be associated with a gene if the SNP is overlapping with exons or introns (&lt;code&gt;--gene-definition gene&lt;/code&gt;). To speed up the tutorial let's use 8 threads for the simulations and perform 100.000 simulations. &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;java&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;Xmx4g&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;jar&lt;/span&gt; &lt;span class="o"&gt;&amp;lt;&lt;/span&gt;&lt;span class="n"&gt;path&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;to&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;gowinda&lt;/span&gt;&lt;span class="o"&gt;&amp;gt;/&lt;/span&gt;&lt;span class="n"&gt;Gowinda&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;jar&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;snp&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;total_snps&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;candidate&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;snp&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;cand_snps&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gene&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;set&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;goassociations_cg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;annotation&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;annotation&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;simulations&lt;/span&gt; &lt;span class="mi"&gt;100000&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;min&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;significance&lt;/span&gt; &lt;span class="mi"&gt;1&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gene&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;definition&lt;/span&gt; &lt;span class="n"&gt;gene&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;threads&lt;/span&gt; &lt;span class="mi"&gt;8&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;output&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;results_gene_gene&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;mode&lt;/span&gt; &lt;span class="n"&gt;gene&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;min&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;genes&lt;/span&gt; &lt;span class="mi"&gt;1&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;After approximately ~1-2 minutes the results should be available: &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0045155&lt;/span&gt;      &lt;span class="mf"&gt;0.050&lt;/span&gt;   &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mf"&gt;0.0004800000&lt;/span&gt;    &lt;span class="mf"&gt;0.1648850000&lt;/span&gt;    &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="n"&gt;electron&lt;/span&gt; &lt;span class="n"&gt;transporter&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;transferring&lt;/span&gt; &lt;span class="n"&gt;electrons&lt;/span&gt; &lt;span class="n"&gt;from&lt;/span&gt; &lt;span class="n"&gt;CoQH2&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;cytochrome&lt;/span&gt; &lt;span class="n"&gt;c&lt;/span&gt; &lt;span class="n"&gt;reductase&lt;/span&gt; &lt;span class="n"&gt;compl&lt;/span&gt;
&lt;span class="n"&gt;ex&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;cytochrome&lt;/span&gt; &lt;span class="n"&gt;c&lt;/span&gt; &lt;span class="n"&gt;oxidase&lt;/span&gt; &lt;span class="n"&gt;complex&lt;/span&gt; &lt;span class="n"&gt;activity&lt;/span&gt;        &lt;span class="n"&gt;cg13263&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg17903&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0006119&lt;/span&gt;      &lt;span class="mf"&gt;0.050&lt;/span&gt;   &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mf"&gt;0.0004800000&lt;/span&gt;    &lt;span class="mf"&gt;0.1648850000&lt;/span&gt;    &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="n"&gt;oxidative&lt;/span&gt; &lt;span class="n"&gt;phosphorylation&lt;/span&gt;       &lt;span class="n"&gt;cg13263&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg17903&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0009112&lt;/span&gt;      &lt;span class="mf"&gt;0.066&lt;/span&gt;   &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mf"&gt;0.0009500000&lt;/span&gt;    &lt;span class="mf"&gt;0.2321100000&lt;/span&gt;    &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;16&lt;/span&gt;      &lt;span class="n"&gt;nucleobase&lt;/span&gt; &lt;span class="n"&gt;metabolic&lt;/span&gt; &lt;span class="n"&gt;process&lt;/span&gt;    &lt;span class="n"&gt;cg7811&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg7171&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0006725&lt;/span&gt;      &lt;span class="mf"&gt;0.308&lt;/span&gt;   &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mf"&gt;0.0027700000&lt;/span&gt;    &lt;span class="mf"&gt;0.3813620000&lt;/span&gt;    &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mi"&gt;11&lt;/span&gt;      &lt;span class="mi"&gt;73&lt;/span&gt;      &lt;span class="n"&gt;cellular&lt;/span&gt; &lt;span class="n"&gt;aromatic&lt;/span&gt; &lt;span class="n"&gt;compound&lt;/span&gt; &lt;span class="n"&gt;metabolic&lt;/span&gt; &lt;span class="n"&gt;process&lt;/span&gt;    &lt;span class="n"&gt;cg7811&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg7171&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg10501&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0005759&lt;/span&gt;      &lt;span class="mf"&gt;0.348&lt;/span&gt;   &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mf"&gt;0.0031100000&lt;/span&gt;    &lt;span class="mf"&gt;0.3813620000&lt;/span&gt;    &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mi"&gt;9&lt;/span&gt;       &lt;span class="mi"&gt;55&lt;/span&gt;      &lt;span class="n"&gt;mitochondrial&lt;/span&gt; &lt;span class="n"&gt;matrix&lt;/span&gt;    &lt;span class="n"&gt;cg13263&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg7235&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg17903&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Please see the manual for a detailed description of the output: &lt;a href="http://code.google.com/p/gowinda/wiki/Manual" rel="nofollow"&gt;http://code.google.com/p/gowinda/wiki/Manual&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;However, in this example, with an FDR &amp;lt;0.05, no GO categories show a significant enrichment. &lt;/p&gt;
&lt;h2 id="example-2-including-regulatory-regions"&gt;Example 2: Including regulatory regions&lt;/h2&gt;
&lt;p&gt;In the previous analysis only SNPs overlapping with exons or introns were associated with a gene. In order to account for regulatory regions, let's also include 2000 bp upstream and 2000 bp downstream of a gene (&lt;code&gt;--gene-definition updownstream2000&lt;/code&gt;), for the next analysis. &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;java&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;Xmx4g&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;jar&lt;/span&gt; &lt;span class="o"&gt;&amp;lt;&lt;/span&gt;&lt;span class="n"&gt;path&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;to&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;gowinda&lt;/span&gt;&lt;span class="o"&gt;&amp;gt;/&lt;/span&gt;&lt;span class="n"&gt;Gowinda&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;jar&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;snp&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;total_snps&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;candidate&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;snp&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;cand_snps&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gene&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;set&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;goassociations_cg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;annotation&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;annotation&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;simulations&lt;/span&gt; &lt;span class="mi"&gt;100000&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;min&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;significance&lt;/span&gt; &lt;span class="mi"&gt;1&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gene&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;definition&lt;/span&gt; &lt;span class="n"&gt;updownstream2000&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;threads&lt;/span&gt; &lt;span class="mi"&gt;8&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;output&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;results_gene_2000ud&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;mode&lt;/span&gt; &lt;span class="n"&gt;gene&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;min&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;genes&lt;/span&gt; &lt;span class="mi"&gt;1&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Following an example of the results. Again, no significantly enriched GO categories were found with an FDR &amp;lt;0.05. &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0005516&lt;/span&gt;      &lt;span class="mf"&gt;1.019&lt;/span&gt;   &lt;span class="mi"&gt;4&lt;/span&gt;       &lt;span class="mf"&gt;0.0027100000&lt;/span&gt;    &lt;span class="mf"&gt;0.8419150000&lt;/span&gt;    &lt;span class="mi"&gt;4&lt;/span&gt;       &lt;span class="mi"&gt;4&lt;/span&gt;       &lt;span class="mi"&gt;29&lt;/span&gt;      &lt;span class="n"&gt;calmodulin&lt;/span&gt; &lt;span class="n"&gt;binding&lt;/span&gt;      &lt;span class="n"&gt;cg5125&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg8475&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg14472&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg6713&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0016831&lt;/span&gt;      &lt;span class="mf"&gt;0.301&lt;/span&gt;   &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mf"&gt;0.0027100000&lt;/span&gt;    &lt;span class="mf"&gt;0.8419150000&lt;/span&gt;    &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mi"&gt;4&lt;/span&gt;       &lt;span class="mi"&gt;25&lt;/span&gt;      &lt;span class="n"&gt;carboxy&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;lyase&lt;/span&gt; &lt;span class="n"&gt;activity&lt;/span&gt;  &lt;span class="n"&gt;cg5029&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg7811&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg10501&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0022618&lt;/span&gt;      &lt;span class="mf"&gt;0.389&lt;/span&gt;   &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mf"&gt;0.0047600000&lt;/span&gt;    &lt;span class="mf"&gt;0.8419150000&lt;/span&gt;    &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mi"&gt;6&lt;/span&gt;       &lt;span class="mi"&gt;33&lt;/span&gt;      &lt;span class="n"&gt;ribonucleoprotein&lt;/span&gt; &lt;span class="n"&gt;complex&lt;/span&gt; &lt;span class="n"&gt;assembly&lt;/span&gt;      &lt;span class="n"&gt;cg7138&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg4602&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg6137&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0016830&lt;/span&gt;      &lt;span class="mf"&gt;0.370&lt;/span&gt;   &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mf"&gt;0.0048500000&lt;/span&gt;    &lt;span class="mf"&gt;0.8419150000&lt;/span&gt;    &lt;span class="mi"&gt;3&lt;/span&gt;       &lt;span class="mi"&gt;6&lt;/span&gt;       &lt;span class="mi"&gt;34&lt;/span&gt;      &lt;span class="n"&gt;carbon&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;carbon&lt;/span&gt; &lt;span class="n"&gt;lyase&lt;/span&gt; &lt;span class="n"&gt;activity&lt;/span&gt;    &lt;span class="n"&gt;cg5029&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg7811&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg10501&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0007349&lt;/span&gt;      &lt;span class="mf"&gt;3.040&lt;/span&gt;   &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mf"&gt;0.0048600000&lt;/span&gt;    &lt;span class="mf"&gt;0.8419150000&lt;/span&gt;    &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mi"&gt;24&lt;/span&gt;      &lt;span class="mi"&gt;91&lt;/span&gt;      &lt;span class="n"&gt;cellularization&lt;/span&gt; &lt;span class="n"&gt;cg9506&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg13263&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg14472&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg10728&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg8049&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg5020&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg17161&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg6137&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;h2 id="example-3-high-resolution-go-term-enrichment"&gt;Example 3: high resolution GO term enrichment&lt;/h2&gt;
&lt;p&gt;As we are getting quite desperate and we know that linkage disequilibrium is decaying quite rapidly in our organism we want to perform a higher resolution analysis in which genes may be counted several times, dependent on the number of candidate SNPs (&lt;code&gt;--mode snp&lt;/code&gt;). For example a gene containing 5 candidate SNPs is counted 5 times (with the simulations as well as with the candidate SNPs). &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;java&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;Xmx4g&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;jar&lt;/span&gt; &lt;span class="o"&gt;&amp;lt;&lt;/span&gt;&lt;span class="n"&gt;path&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;to&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;gowinda&lt;/span&gt;&lt;span class="o"&gt;&amp;gt;/&lt;/span&gt;&lt;span class="n"&gt;Gowinda&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;jar&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;snp&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;total_snps&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;candidate&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;snp&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;cand_snps&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gene&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;set&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;goassociations_cg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;annotation&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;annotation&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;simulations&lt;/span&gt; &lt;span class="mi"&gt;100000&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;min&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;significance&lt;/span&gt; &lt;span class="mi"&gt;1&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gene&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;definition&lt;/span&gt; &lt;span class="n"&gt;updownstream2000&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;threads&lt;/span&gt; &lt;span class="mi"&gt;8&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;output&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;file&lt;/span&gt; &lt;span class="n"&gt;results_snp_2000ud&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;mode&lt;/span&gt; &lt;span class="n"&gt;snp&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;min&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;genes&lt;/span&gt; &lt;span class="mi"&gt;1&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;The results should look like in the following example: &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0006497&lt;/span&gt;      &lt;span class="mf"&gt;0.606&lt;/span&gt;   &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="mi"&gt;7&lt;/span&gt;       &lt;span class="mi"&gt;45&lt;/span&gt;      &lt;span class="n"&gt;protein&lt;/span&gt; &lt;span class="n"&gt;lipidation&lt;/span&gt;      &lt;span class="n"&gt;cg18810&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0043543&lt;/span&gt;      &lt;span class="mf"&gt;0.797&lt;/span&gt;   &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mi"&gt;36&lt;/span&gt;      &lt;span class="n"&gt;protein&lt;/span&gt; &lt;span class="n"&gt;acylation&lt;/span&gt;       &lt;span class="n"&gt;cg18810&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0006119&lt;/span&gt;      &lt;span class="mf"&gt;0.167&lt;/span&gt;   &lt;span class="mi"&gt;7&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="n"&gt;oxidative&lt;/span&gt; &lt;span class="n"&gt;phosphorylation&lt;/span&gt;       &lt;span class="n"&gt;cg13263&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg17903&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0005794&lt;/span&gt;      &lt;span class="mf"&gt;1.865&lt;/span&gt;   &lt;span class="mi"&gt;12&lt;/span&gt;      &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;12&lt;/span&gt;      &lt;span class="mi"&gt;65&lt;/span&gt;      &lt;span class="n"&gt;Golgi&lt;/span&gt; &lt;span class="n"&gt;apparatus&lt;/span&gt; &lt;span class="n"&gt;cg18810&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg5020&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0045155&lt;/span&gt;      &lt;span class="mf"&gt;0.167&lt;/span&gt;   &lt;span class="mi"&gt;7&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="mi"&gt;2&lt;/span&gt;       &lt;span class="n"&gt;electron&lt;/span&gt; &lt;span class="n"&gt;transporter&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt; &lt;span class="n"&gt;transferring&lt;/span&gt; &lt;span class="n"&gt;electrons&lt;/span&gt; &lt;span class="n"&gt;from&lt;/span&gt; &lt;span class="n"&gt;CoQH2cytochrome&lt;/span&gt; &lt;span class="n"&gt;c&lt;/span&gt; &lt;span class="n"&gt;reductase&lt;/span&gt; &lt;span class="n"&gt;complex&lt;/span&gt; &lt;span class="n"&gt;and&lt;/span&gt; &lt;span class="n"&gt;cytochrome&lt;/span&gt; &lt;span class="n"&gt;c&lt;/span&gt; &lt;span class="n"&gt;oxidase&lt;/span&gt; &lt;span class="n"&gt;complex&lt;/span&gt; &lt;span class="n"&gt;activity&lt;/span&gt;        &lt;span class="n"&gt;cg13263&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg17903&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0005212&lt;/span&gt;      &lt;span class="mf"&gt;0.118&lt;/span&gt;   &lt;span class="mi"&gt;4&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="n"&gt;structural&lt;/span&gt; &lt;span class="n"&gt;constituent&lt;/span&gt; &lt;span class="n"&gt;of&lt;/span&gt; &lt;span class="n"&gt;eye&lt;/span&gt; &lt;span class="n"&gt;lens&lt;/span&gt;      &lt;span class="n"&gt;cg16963&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0019707&lt;/span&gt;      &lt;span class="mf"&gt;0.464&lt;/span&gt;   &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="mi"&gt;5&lt;/span&gt;       &lt;span class="mi"&gt;21&lt;/span&gt;      &lt;span class="n"&gt;protein&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;cysteine&lt;/span&gt; &lt;span class="n"&gt;S&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;acyltransferase&lt;/span&gt; &lt;span class="n"&gt;activity&lt;/span&gt;     &lt;span class="n"&gt;cg18810&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0019706&lt;/span&gt;      &lt;span class="mf"&gt;0.464&lt;/span&gt;   &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="mi"&gt;5&lt;/span&gt;       &lt;span class="mi"&gt;21&lt;/span&gt;      &lt;span class="n"&gt;protein&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;cysteine&lt;/span&gt; &lt;span class="n"&gt;S&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;palmitoleyltransferase&lt;/span&gt; &lt;span class="n"&gt;activity&lt;/span&gt;      &lt;span class="n"&gt;cg18810&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0016409&lt;/span&gt;      &lt;span class="mf"&gt;0.640&lt;/span&gt;   &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="mi"&gt;6&lt;/span&gt;       &lt;span class="mi"&gt;25&lt;/span&gt;      &lt;span class="n"&gt;palmitoyltransferase&lt;/span&gt; &lt;span class="n"&gt;activity&lt;/span&gt;   &lt;span class="n"&gt;cg18810&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0007349&lt;/span&gt;      &lt;span class="mf"&gt;4.461&lt;/span&gt;   &lt;span class="mi"&gt;16&lt;/span&gt;      &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mi"&gt;24&lt;/span&gt;      &lt;span class="mi"&gt;91&lt;/span&gt;      &lt;span class="n"&gt;cellularization&lt;/span&gt; &lt;span class="n"&gt;cg9506&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg13263&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg14472&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg10728&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg8049&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg5020&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg17161&lt;/span&gt;&lt;span class="o"&gt;,&lt;/span&gt;&lt;span class="n"&gt;cg6137&lt;/span&gt;
&lt;span class="n"&gt;GO&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="mi"&gt;0018345&lt;/span&gt;      &lt;span class="mf"&gt;0.464&lt;/span&gt;   &lt;span class="mi"&gt;8&lt;/span&gt;       &lt;span class="mf"&gt;0.0000100000&lt;/span&gt;    &lt;span class="mf"&gt;0.0014018182&lt;/span&gt;    &lt;span class="mi"&gt;1&lt;/span&gt;       &lt;span class="mi"&gt;5&lt;/span&gt;       &lt;span class="mi"&gt;22&lt;/span&gt;      &lt;span class="n"&gt;protein&lt;/span&gt; &lt;span class="n"&gt;palmitoylation&lt;/span&gt;  &lt;span class="n"&gt;cg18810&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Finally, we have found a significantly enriched GO categories with an FDR &amp;lt;0.05 . For example, for the first GO term (GO:0006497) we find on the average only one candidate SNP, but in our analysis we found 8. Note that we have a significant enrichment for this category although candidate SNPs were only found for a single gene. In this mode we assume that selection is acting on all 8 SNPs independently. However this could also be due to selection acting on a single SNP and the others hitch-hicking along. Therefore results of this analysis mode must be interpreted with care. &lt;/p&gt;
&lt;h1 id="convert-a-gff-file-into-a-gtf"&gt;Convert a &lt;code&gt;.gff&lt;/code&gt; file into a &lt;code&gt;.gtf&lt;/code&gt;&lt;/h1&gt;
&lt;p&gt;In this section we demonstrate how to convert a &lt;code&gt;.gff&lt;/code&gt; file into a &lt;code&gt;.gtf&lt;/code&gt; file using the script &lt;code&gt;Gff2Gtf.py&lt;/code&gt;: &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="o"&gt;~/&lt;/span&gt;&lt;span class="n"&gt;dev&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;Gowinda&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;src&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;scripts&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;Gff2Gtf&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;input&lt;/span&gt; &lt;span class="n"&gt;annotation&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gff&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="n"&gt;converted&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;The output should be similar to the following: &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;18442&lt;/span&gt;   &lt;span class="mi"&gt;18629&lt;/span&gt;   &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;+&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"FBgn0262115"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;18487&lt;/span&gt;   &lt;span class="mi"&gt;18629&lt;/span&gt;   &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;+&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"FBgn0262115"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;18498&lt;/span&gt;   &lt;span class="mi"&gt;18773&lt;/span&gt;   &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;+&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"FBgn0262115"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;18681&lt;/span&gt;   &lt;span class="mi"&gt;19484&lt;/span&gt;   &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;+&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"FBgn0262115"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;18681&lt;/span&gt;   &lt;span class="mi"&gt;18773&lt;/span&gt;   &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;+&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"FBgn0262115"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;This converted annotation has the &lt;code&gt;FBgn&lt;/code&gt; gene IDs whereas the GO association file has the &lt;code&gt;CG&lt;/code&gt; gene IDs. It is thus necessary to obtain a different GO association file. Most conveniently such a GO association file can be directly obtained from FuncAssociate2 (&lt;a href="http://llama.mshri.on.ca/funcassociate/download_go_associations" rel="nofollow"&gt;http://llama.mshri.on.ca/funcassociate/download_go_associations&lt;/a&gt;). However if many genes in the association file have an unmatched entry in the annotation, or vice versa, we recommend to obtain the GO association file from GoMiner (see following tutorial): &lt;/p&gt;
&lt;h1 id="obtain-a-go-association-file-from-gominer"&gt;Obtain a GO association file from GoMiner&lt;/h1&gt;
&lt;p&gt;Usually we recommend to obtain a GO association file from FuncAssociate2. However in some cases it may be advantageous to obtain this file from GoMiner instead. We recommend to obtain an GO association file from GoMiner when you have many unmatched gene IDs, between the annotation and the GO association file (the log of Gowinda contains information about the number of matching gene IDs). GoMiner uses a database of synonymous gene IDs and this can be utilized to produce a GO association file suitable for Gowinda. However, GoMiner does not directly offer such a download, therefore we have to perform a 'fake' query for GO term enrichment and subsequently extract the GO association file from the results. &lt;/p&gt;
&lt;p&gt;First we need a unique set of the gene IDs (using the &lt;code&gt;converted.gtf&lt;/code&gt; file from the previous tutorial) &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;cat&lt;/span&gt; &lt;span class="n"&gt;converted&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt;&lt;span class="o"&gt;|&lt;/span&gt;&lt;span class="n"&gt;awk&lt;/span&gt; &lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="n"&gt;print&lt;/span&gt; &lt;span class="err"&gt;$&lt;/span&gt;&lt;span class="mi"&gt;10&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="o"&gt;|&lt;/span&gt;&lt;span class="n"&gt;sed&lt;/span&gt; &lt;span class="err"&gt;'&lt;/span&gt;&lt;span class="n"&gt;s&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="s"&gt;";//'|sed 's/"&lt;/span&gt;&lt;span class="c1"&gt;//'|sort |uniq &amp;gt; total_genes.txt&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Than we start a query at High Throughput GoMiner (&lt;a href="http://discover.nci.nih.gov/gominer/relationship.jsp):" rel="nofollow"&gt;http://discover.nci.nih.gov/gominer/relationship.jsp):&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Step 1: provide the total_genes.txt file shown above &lt;/li&gt;
&lt;li&gt;Step 2: again provide the total_genes.txt file shown above &lt;/li&gt;
&lt;li&gt;Step 3,4,5: choose your species and evidence level &lt;/li&gt;
&lt;li&gt;Step 6: make sure that &lt;code&gt;Cross Reference&lt;/code&gt; and &lt;code&gt;Synonym&lt;/code&gt; are checked &lt;/li&gt;
&lt;li&gt;Step 7,8: leave the devaults &lt;/li&gt;
&lt;li&gt;Step 9: set smalles category to 1 &lt;/li&gt;
&lt;li&gt;Step 10: set largest category to 10.000 &lt;/li&gt;
&lt;li&gt;Step 11: Choose all/gene ontology &lt;/li&gt;
&lt;li&gt;Step 13: Provide email address and submit query &lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Your parameters should be similar to the following: &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;totalfile&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="n"&gt;total_genes&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt;
&lt;span class="n"&gt;changedfile&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="n"&gt;total_genes&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt;
&lt;span class="n"&gt;datasource&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="n"&gt;FB&lt;/span&gt;
&lt;span class="n"&gt;organism&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;7227&lt;/span&gt;
&lt;span class="n"&gt;evidencecode&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="n"&gt;all&lt;/span&gt;
&lt;span class="n"&gt;crossref&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="nb"&gt;true&lt;/span&gt;
&lt;span class="n"&gt;synonym&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="nb"&gt;true&lt;/span&gt;
&lt;span class="n"&gt;thresholdtype&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="n"&gt;BOTH&lt;/span&gt;
&lt;span class="n"&gt;timeseriesthreshold&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mf"&gt;0.05&lt;/span&gt;
&lt;span class="n"&gt;randomization&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;100&lt;/span&gt;
&lt;span class="n"&gt;categorymin&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;
&lt;span class="n"&gt;cim&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;
&lt;span class="n"&gt;categorymax&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;10000&lt;/span&gt;
&lt;span class="n"&gt;rootcategory&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="n"&gt;all&lt;/span&gt;
&lt;span class="n"&gt;tf&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;0&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Finally convert the resulting &lt;code&gt;.gce&lt;/code&gt; file into the file format compatible with Gowinda: &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="o"&gt;&amp;lt;&lt;/span&gt;&lt;span class="n"&gt;path&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;to&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;conversiontool&lt;/span&gt;&lt;span class="o"&gt;&amp;gt;/&lt;/span&gt;&lt;span class="n"&gt;GoMiner2FuncAssociate&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;input&lt;/span&gt; &lt;span class="n"&gt;work1338826265&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;total_genes&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt1338826265&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;dir&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;total_genes&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;dir&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;total_genes&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;change&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gce&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="n"&gt;association_gominer&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;h1 id="dealing-with-synonymous-gene-ids"&gt;Dealing with synonymous gene IDs&lt;/h1&gt;
&lt;p&gt;Many genes have a tremendous amount of aliases i.e.: alternative gene IDs. As a requirement for using Gowinda the gene IDs of the annotation have to match the gene IDs from the gene set file. Due to an extensive amount of synonymous gene IDs this may not always be the case. Therefore we provide a script which allowes to synchronized the gene IDs of the annotation and the gene set file given a list of synonymous gene IDs. The script basically updates the gene IDs of the annotation ('.gtf') with the corresponding gene ID from the gene set. A list of synonymous gene IDs is required. Such a list of synonymous gene IDs may for example be obtained from a .gff file, see next example: &lt;/p&gt;
&lt;h2 id="obtain-a-list-of-synonymous-gene-ids-from-a-gff-file"&gt;Obtain a list of synonymous gene IDs from a 'gff' file&lt;/h2&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="n"&gt;ExtractGeneAliasesFromGff&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;input&lt;/span&gt; &lt;span class="n"&gt;annotation&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gff&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="n"&gt;synonymous&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;genids&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Which should result in a file similar to the following example &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;FBgn0259968&lt;/span&gt; &lt;span class="n"&gt;Sfp60F&lt;/span&gt;  &lt;span class="n"&gt;CG42478&lt;/span&gt; &lt;span class="n"&gt;Seminal&lt;/span&gt; &lt;span class="n"&gt;fluid&lt;/span&gt; &lt;span class="n"&gt;protein&lt;/span&gt; &lt;span class="mf"&gt;60F&lt;/span&gt;   &lt;span class="n"&gt;Seminal&lt;/span&gt; &lt;span class="n"&gt;fluid&lt;/span&gt; &lt;span class="n"&gt;protein&lt;/span&gt; &lt;span class="mf"&gt;60F&lt;/span&gt;
&lt;span class="n"&gt;FBgn0011694&lt;/span&gt; &lt;span class="n"&gt;PebII&lt;/span&gt;   &lt;span class="n"&gt;PEB&lt;/span&gt; &lt;span class="n"&gt;PEBmeII&lt;/span&gt; &lt;span class="n"&gt;CG2665&lt;/span&gt;  &lt;span class="n"&gt;PEBII&lt;/span&gt;   &lt;span class="n"&gt;PEB2&lt;/span&gt;    &lt;span class="n"&gt;Protein&lt;/span&gt; &lt;span class="n"&gt;ejaculatory&lt;/span&gt; &lt;span class="n"&gt;bulb&lt;/span&gt; &lt;span class="n"&gt;II&lt;/span&gt;
&lt;span class="n"&gt;FBgn0004181&lt;/span&gt; &lt;span class="n"&gt;Peb&lt;/span&gt; &lt;span class="n"&gt;FBgn0028493&lt;/span&gt; &lt;span class="n"&gt;FBgn0044740&lt;/span&gt; &lt;span class="n"&gt;CG2668&lt;/span&gt;  &lt;span class="n"&gt;PEB&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;me&lt;/span&gt;  &lt;span class="n"&gt;PEB&lt;/span&gt; &lt;span class="n"&gt;P38&lt;/span&gt; &lt;span class="n"&gt;peb&lt;/span&gt; &lt;span class="n"&gt;cDNA&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="n"&gt;GH06048&lt;/span&gt;    &lt;span class="n"&gt;anon&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;WO0140519&lt;/span&gt;&lt;span class="mf"&gt;.140&lt;/span&gt;  &lt;span class="n"&gt;ejaculatory&lt;/span&gt; &lt;span class="n"&gt;bulb&lt;/span&gt; &lt;span class="n"&gt;protein&lt;/span&gt;    &lt;span class="n"&gt;Protein&lt;/span&gt; &lt;span class="n"&gt;ejaculatory&lt;/span&gt; &lt;span class="n"&gt;bulb&lt;/span&gt;
&lt;span class="n"&gt;FBgn0035094&lt;/span&gt; &lt;span class="n"&gt;CG9380&lt;/span&gt;
&lt;span class="n"&gt;FBgn0001325&lt;/span&gt; &lt;span class="n"&gt;Kr&lt;/span&gt;  &lt;span class="n"&gt;FBgn0001251&lt;/span&gt; &lt;span class="n"&gt;FBgn0015732&lt;/span&gt; &lt;span class="n"&gt;Kuppel&lt;/span&gt;  &lt;span class="n"&gt;kruppel&lt;/span&gt; &lt;span class="n"&gt;kr&lt;/span&gt;  &lt;span class="n"&gt;CG3340&lt;/span&gt;  &lt;span class="n"&gt;Krpple&lt;/span&gt;  &lt;span class="n"&gt;If&lt;/span&gt;  &lt;span class="n"&gt;Irregular&lt;/span&gt; &lt;span class="n"&gt;facets&lt;/span&gt;    &lt;span class="n"&gt;Kruppel&lt;/span&gt; &lt;span class="n"&gt;Dm&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;Kr&lt;/span&gt;   &lt;span class="n"&gt;krueppel&lt;/span&gt;    &lt;span class="n"&gt;Lruppel&lt;/span&gt; &lt;span class="n"&gt;KR&lt;/span&gt;  &lt;span class="n"&gt;KRUPPEL&lt;/span&gt; &lt;span class="n"&gt;Krueppel&lt;/span&gt;    &lt;span class="n"&gt;Kruppel&lt;/span&gt;
&lt;span class="n"&gt;FBgn0050429&lt;/span&gt; &lt;span class="n"&gt;CG30429&lt;/span&gt;
&lt;span class="n"&gt;FBgn0053680&lt;/span&gt; &lt;span class="n"&gt;CG33680&lt;/span&gt;
&lt;span class="n"&gt;FBgn0050428&lt;/span&gt; &lt;span class="n"&gt;CG30428&lt;/span&gt; &lt;span class="n"&gt;FBgn0064772&lt;/span&gt; &lt;span class="n"&gt;BcDNA&lt;/span&gt;&lt;span class="o"&gt;:&lt;/span&gt;&lt;span class="n"&gt;SD21514&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Every line contains all synonymous gene IDs for a single gene &lt;/p&gt;
&lt;h2 id="synchronize-the-gene-ids-of-the-annotation-with-the-gene-sets"&gt;Synchronize the gene IDs of the annotation with the gene sets&lt;/h2&gt;
&lt;p&gt;The following example used the file 'converted.gtf' from a previous tutorial. Converted.gtf contains the gene IDs as FBGN. After updating they will be replaced with the corresponding 'CG' gene IDs. &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="n"&gt;SynchronizeGtfGeneIDs&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt; &lt;span class="n"&gt;converted&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gene&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;set&lt;/span&gt; &lt;span class="n"&gt;goassociations_cg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;gene&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;aliases&lt;/span&gt; &lt;span class="n"&gt;synonymous&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;genids&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;txt&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="n"&gt;converted_cg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;gtf&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Example of the '.gtf' before updating: &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;209942&lt;/span&gt;  &lt;span class="mi"&gt;210220&lt;/span&gt;  &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;-&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"FBgn0260798"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;210281&lt;/span&gt;  &lt;span class="mi"&gt;210443&lt;/span&gt;  &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;-&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"FBgn0260798"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;223882&lt;/span&gt;  &lt;span class="mi"&gt;224088&lt;/span&gt;  &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;-&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"FBgn0260798"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Example of the '.gtf' after updating: &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;209942&lt;/span&gt;  &lt;span class="mi"&gt;210220&lt;/span&gt;  &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;-&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"CG40129"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;210281&lt;/span&gt;  &lt;span class="mi"&gt;210443&lt;/span&gt;  &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;-&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"CG40129"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;R&lt;/span&gt;      &lt;span class="n"&gt;FlyBase&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt;    &lt;span class="mi"&gt;223882&lt;/span&gt;  &lt;span class="mi"&gt;224088&lt;/span&gt;  &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="o"&gt;-&lt;/span&gt;       &lt;span class="p"&gt;.&lt;/span&gt;       &lt;span class="n"&gt;gene_id&lt;/span&gt; &lt;span class="s"&gt;"CG40129"&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;Note that the IDs of genes not having a synonymous gene ID will not be updated! &lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Wed, 18 Mar 2015 14:56:31 -0000</pubDate><guid>https://sourceforge.net13a690aff96f35b934c1951643f0306e44e37f50</guid></item></channel></rss>