<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to GS</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>Recent changes to GS</description><atom:link href="https://sourceforge.net/p/arden/wiki/GS/feed" rel="self"/><language>en</language><lastBuildDate>Thu, 25 Apr 2013 12:00:51 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/arden/wiki/GS/feed" rel="self" type="application/rss+xml"/><item><title>GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v26
+++ v27
@@ -103,7 +103,7 @@

 To filter alignments according to the features RQS (-r), gaps (-g) and mismatches (-m) just run:

-&lt;code&gt; arden-filter input.sam output.sam -r value1 -m value2  - g value3  &lt;/code&gt;
+&lt;code&gt; arden-filter input.sam output.sam -r value1 -m value2  -g value3  &lt;/code&gt;

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Thu, 25 Apr 2013 12:00:51 -0000</pubDate><guid>https://sourceforge.neta2789f2cf7462d9e9becc9ac95c344b2f91b9c4e</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v25
+++ v26
@@ -62,8 +62,8 @@
   #  | Artificial fasta
   &amp;  | Fastq file
   @  | Mapper ID
-  bamref  | alignment REF
-  bamart  | alignment ART
+  ref  | alignment REF
+  art  | alignment ART
   +| End

 For our example the ControlFile.ini should like this:
@@ -73,12 +73,12 @@
 \#:/data/test/AR_111_21_zymamo.fasta
 &amp;:/data/test/zymamo.fastq
 @Bow2
-bamref:/data/test/Bow2_REF_sorted.sam
-bamarg:/data/test/Bow2_ART_sorted.sam
+ref:/data/test/Bow2_REF_sorted.sam
+art:/data/test/Bow2_ART_sorted.sam
 +
 @RazerS3
-bamref:/data/test/Raz3_REF_sorted.sam
-bamarg:/data/test/Raz3_ART_sorted.sam
+ref:/data/test/Raz3_REF_sorted.sam
+art:/data/test/Raz3_ART_sorted.sam
 +
 &lt;/code&gt; 
 After de
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Wed, 13 Mar 2013 14:42:13 -0000</pubDate><guid>https://sourceforge.netbe19c153d32364eed6eb17fb5f54dfc9649fc1a6</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v24
+++ v25
@@ -93,7 +93,7 @@

-* .esam files: The files contain the alignment information extracted by ARDEN in a simpliefied "sam" format. The columns include (ReadID, Boolean indicating TP(1) or FP(0), substitutions between artificial and reference in the region spanned with the read, edit distance, read quality score, mapping quality, start alignment, end alignment number of gaps, number of mismatches)
+* .esam files: The files contain the alignment information extracted by ARDEN in a simpliefied "sam" format (see &lt;a href="https://sourceforge.net/p/arden/wiki/results/"&gt;here&lt;/a&gt; for details)
 * results.txt: This textfile contains the results table with the overview about numerical results (senstivitiy, specificity, AUC) and all read mappers
 * ROC plots: Plots of the correspoding ROC curve for each mapper.
 * ROC tables: Tables for each mapper, where for each subclass the sensitivity, specificity and M are displayed. This table is usefull to determine an adequate quality threshold.
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Wed, 13 Mar 2013 14:41:40 -0000</pubDate><guid>https://sourceforge.net297d95017f4d2a69de5928147dda1d20da62c621</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v23
+++ v24
@@ -6,7 +6,7 @@

 #Introduction
-For this guide we will use a very small example for a walk through. The reference genome is the sequence of Zymomonas mobilis subsp. mobilis ZM4 chromosome written to &lt;b&gt;NC_006526.fna&lt;/b&gt;. For the read mapping step we simply simulate 10k reads of length 100 using mason  \[2] in &lt;b&gt;zymamo_10k_100L.fastq&lt;/b&gt; (simulation just for the sake of this tutorial).
+For this guide we will use a very small example for a walk through. The reference genome is the sequence of Zymomonas mobilis subsp. mobilis ZM4 chromosome written to &lt;b&gt;NC_006526.fna&lt;/b&gt;. For the read mapping step we simply simulate 100k reads of length 100 using mason  \[2] in &lt;b&gt;zymamo_100k_100L.fastq&lt;/b&gt; (simulation just for the sake of this tutorial). An overview for the results of this guide is given &lt;a href="https://sourceforge.net/p/arden/wiki/results/"&gt;HERE&lt;/a&gt;

 #Creating an artificial reference genome#
 The first step will create the artificial reference genome. The options can be seen with: 
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Wed, 13 Mar 2013 13:40:07 -0000</pubDate><guid>https://sourceforge.net67c9d4ed83ff151f69ee236d382f490bb7751d0e</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v22
+++ v23
@@ -6,7 +6,7 @@

 #Introduction
-For this guide we will use a very small example for a walk through. The data is contained in the test folder included in ARDEN download archive. The reference genome is the sequence of Zymomonas mobilis subsp. mobilis ZM4 chromosome written to &lt;b&gt;NC_006526.fna&lt;/b&gt;. For the read mapping step we included 10k simulated reads using mason \[2] in &lt;b&gt;zymamo_10k_100L.fastq&lt;/b&gt;
+For this guide we will use a very small example for a walk through. The reference genome is the sequence of Zymomonas mobilis subsp. mobilis ZM4 chromosome written to &lt;b&gt;NC_006526.fna&lt;/b&gt;. For the read mapping step we simply simulate 10k reads of length 100 using mason  \[2] in &lt;b&gt;zymamo_10k_100L.fastq&lt;/b&gt; (simulation just for the sake of this tutorial).

 #Creating an artificial reference genome#
 The first step will create the artificial reference genome. The options can be seen with: 
@@ -32,7 +32,7 @@
 #Mapping step
 For this step map the simulated reads to the reference and the created arti
 cial
-reference. Use any read mapper supporting SAM format.
+reference. Use any read mapper supporting SAM format. We chose Bowtie2 and RazerS3.

 #Sorting step
 To sort the reads the following combination of SAMtools \[2] commands can be
@@ -77,8 +77,8 @@
 bamarg:/data/test/Bow2_ART_sorted.sam
 +
 @RazerS3
-bamref:/data/test/REF_sorted.sam
-bamarg:/data/test/ART_sorted.sam
+bamref:/data/test/Raz3_REF_sorted.sam
+bamarg:/data/test/Raz3_ART_sorted.sam
 +
 &lt;/code&gt; 
 After de
@@ -91,7 +91,9 @@
 ## Generated Output
  The program call will generate four types of output:

-* .esam files: The files contain the alignment information extracted by ARDEN in a simpliefied "sam" format.
+
+
+* .esam files: The files contain the alignment information extracted by ARDEN in a simpliefied "sam" format. The columns include (ReadID, Boolean indicating TP(1) or FP(0), substitutions between artificial and reference in the region spanned with the read, edit distance, read quality score, mapping quality, start alignment, end alignment number of gaps, number of mismatches)
 * results.txt: This textfile contains the results table with the overview about numerical results (senstivitiy, specificity, AUC) and all read mappers
 * ROC plots: Plots of the correspoding ROC curve for each mapper.
 * ROC tables: Tables for each mapper, where for each subclass the sensitivity, specificity and M are displayed. This table is usefull to determine an adequate quality threshold.
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Wed, 13 Mar 2013 13:10:06 -0000</pubDate><guid>https://sourceforge.nete3850ac0ae19859a0b154ba378e466c7190fe9a3</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v21
+++ v22
@@ -99,9 +99,9 @@

 #Filtering (optional)

-To filter alignments according to the features RQS (-r), gaps -g) and mismatches-m) just run:
+To filter alignments according to the features RQS (-r), gaps (-g) and mismatches (-m) just run:

-&lt;code&gt; arden-filter input.sam output.sam -r x -m y - g z &lt;/code&gt;
+&lt;code&gt; arden-filter input.sam output.sam -r value1 -m value2  - g value3  &lt;/code&gt;

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Tue, 12 Mar 2013 14:15:12 -0000</pubDate><guid>https://sourceforge.net36eba544a9679f76113ec799cf27cdf8dea8e803</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v20
+++ v21
@@ -99,7 +99,7 @@

 #Filtering (optional)

-To filter alignments according to the features RSQ, gaps and mismatches just run:
+To filter alignments according to the features RQS (-r), gaps -g) and mismatches-m) just run:

 &lt;code&gt; arden-filter input.sam output.sam -r x -m y - g z &lt;/code&gt;

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Mon, 11 Mar 2013 17:32:30 -0000</pubDate><guid>https://sourceforge.net5bc38a69375f568c9b19e5e8225650ecf567af76</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v19
+++ v20
@@ -40,7 +40,7 @@

 &lt;code&gt; samtools view -F 4 -bS input.sam | samtools sort -no - -| samtools view -h - &gt; output.sam&lt;/code&gt;

-To sort the files in parallel it might be an option to use &lt;b&gt;GNU parallel &lt;/b&gt;. The following command will convert all *.sam files in the current directory with all available cores.  \[3].
+To sort the files in parallel it might be an option to use &lt;b&gt;GNU parallel &lt;/b&gt; \[3]. The following command will convert all *.sam files in the current directory with all available cores.

 &lt;code&gt; ls *.sam | parallel "samtools view -F 4 -bS {} | samtools sort -no - -| samtools view -h - &gt; {.}_sorted.sam"&lt;/code&gt;

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Mon, 11 Mar 2013 16:54:41 -0000</pubDate><guid>https://sourceforge.netd0d19919781dd48205ef1fdaafaaaccf622d7a03</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v18
+++ v19
@@ -17,9 +17,17 @@

 &lt;code&gt; arden-create -e 1&lt;/code&gt;

-The following command line will produce a artificial reference with a substitution on every 21th nucleotide (-d 21). Orfs will not be protected from mutations (-o 0) and the name for the sequence is zymamo (-n). Note that the options for CreateAR are written after the first -o without blanks!
+The following command line will produce an artificial reference with a substitution on every 21th nucleotide (-d 21). Orfs will not be protected from mutations (-o 0) and the name for the sequence is zymamo (-n). It is recommended to set this name to a string without special characters. Otherwise the fasta header will be used which might not look "nice".

-&lt;code&gt; arden-create /data/test/ /data/test/NC_006526.fna -o -d21-o0-nzymamo &lt;/code&gt;
+&lt;code&gt; arden-create /data/test/ /data/test/NC_006526.fna -d 21 -o 0 -n zymamo &lt;/code&gt;
+
+## Generated Output
+
+arden-create will generate three different types of output with the following naming convention. AR_abc_distance_referencename, where a = orf option, b = reverse unbalanced mutations option, c = random option.
+
+* Artificial Reference: A nucleotide sequence generated from the input genome.
+* Distances.txt: A file which summarizes the differences between the artificial reference and the input genome in terms of nucleotide /amino acid distribution.
+* CompleteLog.txt: A log file which traces the nucleotide / amino acid changes between artificial reference and input genome.

 #Mapping step
 For this step map the simulated reads to the reference and the created arti
@@ -32,7 +40,7 @@

 &lt;code&gt; samtools view -F 4 -bS input.sam | samtools sort -no - -| samtools view -h - &gt; output.sam&lt;/code&gt;

-To sort the files in parallel it might be an option to use &lt;b&gt;GNU parallel &lt;/b&gt;  \[3].
+To sort the files in parallel it might be an option to use &lt;b&gt;GNU parallel &lt;/b&gt;. The following command will convert all *.sam files in the current directory with all available cores.  \[3].

 &lt;code&gt; ls *.sam | parallel "samtools view -F 4 -bS {} | samtools sort -no - -| samtools view -h - &gt; {.}_sorted.sam"&lt;/code&gt;

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Mon, 11 Mar 2013 16:53:15 -0000</pubDate><guid>https://sourceforge.net37250b267bf01bc34e571e11b6f6d8666467233a</guid></item><item><title>WikiPage GS modified by Giese, Sven</title><link>https://sourceforge.net/p/arden/wiki/GS/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v17
+++ v18
@@ -11,15 +11,15 @@
 #Creating an artificial reference genome#
 The first step will create the artificial reference genome. The options can be seen with:

-&lt;code&gt; arden CreateAR&lt;/code&gt;
+&lt;code&gt; arden-create&lt;/code&gt;

 Moreover a list of examples can be printed with:

-&lt;code&gt; arden CreateAR -e 1&lt;/code&gt;
+&lt;code&gt; arden-create -e 1&lt;/code&gt;

 The following command line will produce a artificial reference with a substitution on every 21th nucleotide (-d 21). Orfs will not be protected from mutations (-o 0) and the name for the sequence is zymamo (-n). Note that the options for CreateAR are written after the first -o without blanks!

-&lt;code&gt; arden CreateAR /data/test/ /data/test/NC_006526.fna -o -d21-o0-nzymamo &lt;/code&gt;
+&lt;code&gt; arden-create /data/test/ /data/test/NC_006526.fna -o -d21-o0-nzymamo &lt;/code&gt;

 #Mapping step
 For this step map the simulated reads to the reference and the created arti
@@ -78,9 +78,25 @@
 the programm call to compare the alignments can
 be done with:

-&lt;code&gt; arden analyse /data/test/ControlFile.ini /data/test/results/ &lt;/code&gt; 
+&lt;code&gt; arden-analyze /data/test/ControlFile.ini /data/test/results/ &lt;/code&gt; 
+
+## Generated Output
+ The program call will generate four types of output:
+
+* .esam files: The files contain the alignment information extracted by ARDEN in a simpliefied "sam" format.
+* results.txt: This textfile contains the results table with the overview about numerical results (senstivitiy, specificity, AUC) and all read mappers
+* ROC plots: Plots of the correspoding ROC curve for each mapper.
+* ROC tables: Tables for each mapper, where for each subclass the sensitivity, specificity and M are displayed. This table is usefull to determine an adequate quality threshold.
+

 #Filtering (optional)
+
+To filter alignments according to the features RSQ, gaps and mismatches just run:
+
+&lt;code&gt; arden-filter input.sam output.sam -r x -m y - g z &lt;/code&gt;
+
+
+
 #References
 \[1] Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer,
 N., Marth, G., et al. (2009). The Sequence Alignment/Map for-mat and SAMtools. Bioinformatics (Oxford, England), 25(16), 2078-9. doi:10.1093/bioinformatics/btp352
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Giese, Sven</dc:creator><pubDate>Thu, 07 Mar 2013 14:18:11 -0000</pubDate><guid>https://sourceforge.net49ed92a5e6c539ee092ec32e868624c4ee2d2ef3</guid></item></channel></rss>