<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to Pipeline</title><link>https://sourceforge.net/p/microbegps/wiki/Pipeline/</link><description>Recent changes to Pipeline</description><atom:link href="https://sourceforge.net/p/microbegps/wiki/Pipeline/feed" rel="self"/><language>en</language><lastBuildDate>Wed, 03 Dec 2014 16:49:10 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/microbegps/wiki/Pipeline/feed" rel="self" type="application/rss+xml"/><item><title>Pipeline modified by Martin S. Lindner</title><link>https://sourceforge.net/p/microbegps/wiki/Pipeline/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v6
+++ v7
@@ -10,3 +10,31 @@
 The window is divided into four panels. The upper left panel shows the MicrobeGPS __Pipeline__, where all seven steps are listed. A red light means that the step is not yet completed, a yellow light means that the step is currently running, a green light means that the step is complete. A red X indicates that a step failed. To run a step, its previous step must be completed (green light). Clicking on one step shows its __Parameters__ in the upper right panel. These parameters can be changed as long as the step has a red light. A __Parameter Description__ is shown in the lower left panel. Progress messages, errors and other notifications are shown on the __Console__ in the lower right panel.

 You can eiter preconfigure all steps and then run the complete pipeline from the beginning to the end, or you can each step separately. The former has the advantage that you can start your analysis and leave the program alone while it is running. The latter allows you to check the progress of each step such that you can, for example, rerun a step with a different parameter setting.
+
+Pipeline parameters
+-------------------
+
+### Raw filtering ###
+
+__Minimum Genome Support__ discards all reference sequences that obtained less than the specified number of reads in total (including shared reads). A higher threshold reduces the number of genomes to be analyzed (lower run time) at the risk of discarding genomes of low abundant species.
+
+__Max. Read Matches__ discards all reads having matches to more than the specified number of genomes. These reads are uninformative for the clustering step of MicrobeGPS.
+
+__Max. Read Mapping Error__ filters out all read mappings with an error above the specified value. Here, the error is defined as the fraction of mismatches in the total read length.
+
+
+### Quality Filtering ###
+__Min. Number Unique Reads__ discards references with less than the specified number of unique reads. Here, reads are considered as _unique reads_ when they were mapped uniquely to this reference. Reads are also considered as _unique_ when there are multiple matches to the same reference (i.e. also on different chromosomes). A higher threshold can further reduce the amount of data to be analyzed, especially in datasets with many noisy read mappings (e.g. when a large fraction originates from completely uncharacterized organisms).
+
+__Max. Homogeneity__ puts a threshold on the homogeneity of the distribution of the reads on the genome. The homogeneity of the read distribution over the genome is measured by comparing the read distribution to a uniform distribution using the Kolmogorov-Smirnov test statistic. Organisms with a test statistic higher than the specified value are discarded. Note, that this ist not a p-value, but the raw Kolmogorov-Smirnov test statistic!
+
+
+### Calculate Candidates ###
+
+__Min. Genome Validity__ discards all organisms below the specified validity threshold. The validity is the estimated fraction of the genome that could be covered by reads. This threshold should be kept low (or even zero) when many uncharacterized organisms are expected in the dataset. Higher thresholds may be used to only keep very certain candidates.
+
+__Coverage Similarity__ sets the characteristics of the so-called Core Reads (CR). These are reads mapping to genomes with similar genome coverage depth. This parameter defines the maximum relative coverage differences of all target genomes of a read. A lower coverage similarity parameter requires a narrower range of measured coverages of all genomes a read maps to. This yields to less CR in total. 
+
+__Fraction Shared USR__  defines in the clustering step the minimum required fraction of CR (or Unique Source Reads USR) shared with another reference to be put in the same cluster. This threshold prevents the clustering scheme from accidentally merging two reference in the same cluster that have by chance a similar coverage. Lower thresholds should only be set when really required.
+
+__Fraction Shared Reads__ allows putting references in the same cluster that were not joined via the shared CR. Here, the references are required to have a fraction of reads mapping to both references set by this parameter.
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Martin S. Lindner</dc:creator><pubDate>Wed, 03 Dec 2014 16:49:10 -0000</pubDate><guid>https://sourceforge.net54c6ec71762fb673a8cc05b9f848ba109a4d7239</guid></item><item><title>Pipeline modified by Martin S. Lindner</title><link>https://sourceforge.net/p/microbegps/wiki/Pipeline/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v5
+++ v6
@@ -5,7 +5,7 @@

 The MicrobeGPS Pipeline window shows the analysis pipeline and allows you to adjust the analysis parameters. You can either run each step separately or run all steps at once.

-[[img src=Pipeline.jpg alt=MicrobeGPS Pipeline]]
+[[img src=Pipeline.jpg]]

 The window is divided into four panels. The upper left panel shows the MicrobeGPS __Pipeline__, where all seven steps are listed. A red light means that the step is not yet completed, a yellow light means that the step is currently running, a green light means that the step is complete. A red X indicates that a step failed. To run a step, its previous step must be completed (green light). Clicking on one step shows its __Parameters__ in the upper right panel. These parameters can be changed as long as the step has a red light. A __Parameter Description__ is shown in the lower left panel. Progress messages, errors and other notifications are shown on the __Console__ in the lower right panel.

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Martin S. Lindner</dc:creator><pubDate>Tue, 01 Apr 2014 13:51:54 -0000</pubDate><guid>https://sourceforge.net5416428386bca015a6944742ac0457b3ba3dfc3a</guid></item><item><title>Pipeline modified by Martin S. Lindner</title><link>https://sourceforge.net/p/microbegps/wiki/Pipeline/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Martin S. Lindner</dc:creator><pubDate>Tue, 01 Apr 2014 13:50:37 -0000</pubDate><guid>https://sourceforge.net2afce4de810470982d936931b2c0e0ac7674393b</guid></item><item><title>Pipeline modified by Martin S. Lindner</title><link>https://sourceforge.net/p/microbegps/wiki/Pipeline/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v3
+++ v4
@@ -5,7 +5,7 @@

 The MicrobeGPS Pipeline window shows the analysis pipeline and allows you to adjust the analysis parameters. You can either run each step separately or run all steps at once.

-Put image here.
+[[img src=Pipeline.jpg alt=MicrobeGPS Pipeline]]

 The window is divided into four panels. The upper left panel shows the MicrobeGPS __Pipeline__, where all seven steps are listed. A red light means that the step is not yet completed, a yellow light means that the step is currently running, a green light means that the step is complete. A red X indicates that a step failed. To run a step, its previous step must be completed (green light). Clicking on one step shows its __Parameters__ in the upper right panel. These parameters can be changed as long as the step has a red light. A __Parameter Description__ is shown in the lower left panel. Progress messages, errors and other notifications are shown on the __Console__ in the lower right panel.

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Martin S. Lindner</dc:creator><pubDate>Tue, 01 Apr 2014 13:49:57 -0000</pubDate><guid>https://sourceforge.netf555cf4d7380e48f97be56e0b6a3bc852dcb0f9d</guid></item><item><title>Pipeline modified by Martin S. Lindner</title><link>https://sourceforge.net/p/microbegps/wiki/Pipeline/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Martin S. Lindner</dc:creator><pubDate>Tue, 01 Apr 2014 13:42:54 -0000</pubDate><guid>https://sourceforge.net2fa2f337cacdefd7f48317e6c9f35d27def98dac</guid></item><item><title>Pipeline modified by Martin S. Lindner</title><link>https://sourceforge.net/p/microbegps/wiki/Pipeline/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Martin S. Lindner</dc:creator><pubDate>Tue, 01 Apr 2014 13:42:28 -0000</pubDate><guid>https://sourceforge.net8ef5051cf8ea78fb42088a2fea418388169f582a</guid></item><item><title>Pipeline modified by Martin S. Lindner</title><link>https://sourceforge.net/p/microbegps/wiki/Pipeline/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;&lt;a class="alink" href="/p/microbegps/wiki/Home/"&gt;[Home]&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="microbegps-pipeline"&gt;MicrobeGPS Pipeline&lt;/h1&gt;
&lt;p&gt;The MicrobeGPS Pipeline window shows the analysis pipeline and allows you to adjust the analysis parameters. You can either run each step separately or run all steps at once.&lt;/p&gt;
&lt;p&gt;Put image here.&lt;/p&gt;
&lt;p&gt;The window is divided into four panels. The upper left panel shows the MicrobeGPS &lt;strong&gt;Pipeline&lt;/strong&gt;, where all seven steps are listed. A red light means that the step is not yet completed, a yellow light means that the step is currently running, a green light means that the step is complete. A red X indicates that a step failed. To run a step, its previous step must be completed (green light). Clicking on one step shows its &lt;strong&gt;Parameters&lt;/strong&gt; in the upper right panel. These parameters can be changed as long as the step has a red light. A &lt;strong&gt;Parameter Description&lt;/strong&gt; is shown in the lower left panel. Progress messages, errors and other notifications are shown on the &lt;strong&gt;Console&lt;/strong&gt; in the lower right panel.&lt;/p&gt;
&lt;p&gt;You can eiter preconfigure all steps and then run the complete pipeline from the beginning to the end, or you can each step separately. The former has the advantage that you can start your analysis and leave the program alone while it is running. The latter allows you to check the progress of each step such that you can, for example, rerun a step with a different parameter setting.&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Martin S. Lindner</dc:creator><pubDate>Tue, 01 Apr 2014 13:42:16 -0000</pubDate><guid>https://sourceforge.netac33cb69b923d02b08d6ede92624a840a631ef2d</guid></item></channel></rss>