<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to Home</title><link>https://sourceforge.net/p/crisprtrack/wiki/Home/</link><description>Recent changes to Home</description><atom:link href="https://sourceforge.net/p/crisprtrack/wiki/Home/feed" rel="self"/><language>en</language><lastBuildDate>Mon, 23 Apr 2018 03:14:13 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/crisprtrack/wiki/Home/feed" rel="self" type="application/rss+xml"/><item><title>Home modified by Tony J. Lam</title><link>https://sourceforge.net/p/crisprtrack/wiki/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v4
+++ v5
@@ -80,7 +80,9 @@
 Runs CRISPRtrack, sets directory containing contigs as 'test/', output directory set as current working directory, utilizes both denovo-based and reference-based methods for CRISPR prediction, output similarity table for between donor and recipient samples based on spacer content.

 ## Example Usage 2 -- estimate microbiome similarity based on the sharing of CRISPR spacers in general
+```
 python CRISPRtrack.py -i test -o test -m test/FMT_metadata_example2.txt -p
+```
 Runs CRISPRtrack, using test directory as output directory, denovo-based method for CRISPR prediction, outputs pairwise similarity table based on spacer content.

 ## Outputs
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tony J. Lam</dc:creator><pubDate>Mon, 23 Apr 2018 03:14:13 -0000</pubDate><guid>https://sourceforge.net094401e817d05a90406c30a1928abafb1c169458</guid></item><item><title>Home modified by Tony J. Lam</title><link>https://sourceforge.net/p/crisprtrack/wiki/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v3
+++ v4
@@ -1,6 +1,6 @@
 =========================================
-CRISPRtrack
-Version: v1.0.0 (April 20, 2018)
+# CRISPRtrack
+Version: v1.0.1 (April 22, 2018)

 Developers: Yuzhen Ye (yye@indiana.edu) and Tony J. Lam (tjlam@indiana.edu)

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tony J. Lam</dc:creator><pubDate>Mon, 23 Apr 2018 03:08:06 -0000</pubDate><guid>https://sourceforge.net71454b7c8c48f72b526e646238e0aef2ac575cd7</guid></item><item><title>Home modified by Tony J. Lam</title><link>https://sourceforge.net/p/crisprtrack/wiki/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v2
+++ v3
@@ -1,66 +1,117 @@
+=========================================
 CRISPRtrack
-=====
-Version: v1.0.0 (April 19, 2018)
+Version: v1.0.0 (April 20, 2018)

-Yuzhen Ye (yye@indiana.edu, yuzhenye@gmail.com)
+Developers: Yuzhen Ye (yye@indiana.edu) and Tony J. Lam (tjlam@indiana.edu)

-Tony J. Lam (tjlam@indiana.edu)
+School of Informatics, Computing and Engineering, Indiana University, Bloomington

-School of Informatics, Computing, and Engeneering, IUB
+This work was supported by NIH grant 1R01AI108888 to YY
+CRISPRtrack is free software under the terms of the GNU General Public License as published by
+the Free Software Foundation.
+==========================================

-CRISPRtrack is used for strain based tracking via microbiome CRISPR arrays.  
-Utilizes two approaches for prediction of CRISPR arrays: de novo based (default) and reference based (optional).  
-de novo based prediction utilizes CRISPRone,  
-Reference based prediction utilizes CRISPRAlign.  
+## Introduction
+CRISPRtrack uses CRISPR spacers as molecular markers to track bacterial strains.
+It can be used to estimate the microbiome similarity based on the sharing of CRISPR spacer contents
+between the microbiomes. It can also be used to quantify the retention of donor strains in recipients
+that receive microbiota transfer treatment (such as fecal microbiota transfer, FMT) using microbiome data.
+
+CRISPRtrack Utilizes two approaches for prediction of CRISPR arrays: de novo approach (default) and reference based (optional).
+- de novo prediction utilizes CRISPRone,
+- reference based prediction utilizes CRISPRAlign.
+
+The reference based approach relies on reference CRISPR repeats to identify CRISPR arrays that contain repeats similar to the reference repeats.
+We include in the package a set of reference repeats for characterizing human gut microbiomes (gutref-expanded.fna, see below); these repeats were extracted from human gut-associated bacterial genomes.

 ## Dependencies
-Python 2.7+, Java, R 
-Matplotlib v2.2.0+  
-sklearn v0.19.0+  
-Pandas v0.21.0+  
-
-## Installation 
-tar -xvf &amp;lt;filename&amp;gt;  
+Python 2.7+, Java

 ## Usage 
 ```
-CRISPRtrack.py [-h] [-i INPUT_DIRECTORY] [-o OUTPUT_DIR] -m METADATA [-r, --reference] 
-               [--ref_fast REF_FAST] [--reference REFERENCE] [--CRISPRAlign CRISPRALIGN] 
-               [--cdhit CDHIT] [--CRISPRone CRISPRONE]  
+usage: CRISPRtrack.py [-h] [-i INPUT_DIRECTORY] [-o OUTPUT_DIR] -m METADATA
+                      [-r] [--ref_fast REF_FAST] [--CRISPRAlign CRISPRALIGN]
+                      [--cdhit CDHIT] [--CRISPRone CRISPRONE]

-optional arguments:  
-  -h, --help            show this help message and exit  
-  -i INPUT_DIRECTORY, --input_directory INPUT_DIRECTORY  
-                        Input directory of genome assembly files. (default:  
-                        current working directory).  
-  -o OUTPUT_DIR, --output_dir OUTPUT_DIR  
-                        Output directory to create output files in (default:  
-                        current working directory).  
-  -m METADATA, --metadata METADATA  
-                        Input metadata file. (required).  
-  -r, --reference       Run reference based (CRISPRAlign), default=FALSE  
-  --ref_fast REF_FAST   Input CRISPR repeat for reference based search in  
-                        fasta foremat, default = /bin/gutref-expanded.fna  
-  --reference REFERENCE  
-                        Run reference based (CRISPRAlign), default off, uses  
-                        reference crispr repeats if provided, else default:  
-                        /bin/gutref-expanded.fna  
-  --CRISPRAlign CRISPRALIGN  
-                        Path to CRISPRAlign. (default =  
-                        ./bin/CRISPRAlign/CrisprAlign)  
-  --cdhit CDHIT         Path to CD-HIT. (default = ./bin/cdhit/)  
-  --CRISPRone CRISPRONE  
-                        Path to CRISPRone. (default  
-                        =./bin/CRISPRone/crisprone-local-nocas.php)  
+CRISPRtrack, CRISPR based strain tracking
+
+optional arguments:
+  -h, --help            show this help message and exit
+  -i INPUT_DIRECTORY, --input_directory INPUT_DIRECTORY
+                        Input directory of genome assembly files. (default:
+                        current working directory).
+  -o OUTPUT_DIR, --output_dir OUTPUT_DIR
+                        Output directory to create output files in (default:
+                        current working directory).
+  -m METADATA, --metadata METADATA
+                        Input metadata file. (required).
+  -r, --reference       Run reference based (CRISPRAlign), default=FALSE
+  --ref_fast REF_FAST   Input CRISPR repeat for reference based search in
+                        fasta foremat, default = CRISPRtrack/bin/gutref-
+                        expanded.fna
+  --CRISPRAlign CRISPRALIGN
+                        Path to CRISPRAlign. (default =
+                        CRISPRtrack/bin/CRISPRAlign/CrisprAlign)
+  --cdhit CDHIT         Path to CD-HIT. (default =
+                        CRISPRtrack/bin/CRISPRone/bin/cd-
+                        hit-v4.6.1-2012-08-27/)
+  --CRISPRone CRISPRONE
+                        Path to CRISPRone. (default =
+                        CRISPRtrack/bin/CRISPRone/crisprone-local-nocas.php)
+```
+## Metadata Format 
+txt file in spacer or tab delimited format
+```
+&amp;lt;sample-name&amp;gt; &amp;lt;subject&amp;gt; &amp;lt;assembly-file&amp;gt; &amp;lt;donor&amp;gt; &amp;lt;donor recipient=""&amp;gt; &amp;lt;date&amp;gt;
+```
+CRISPRtrack evaluate all files included in metadata file.
+
+Note that CRISPRtrack can be used to estimate the similarity between any two microbiomes (not necessarily from donor and/or recipient).
+For this case, CRISPRtrack will utilize the first three columns of the metafile, while also invoking the -p flag.
+```
+&amp;lt;sample-name&amp;gt; &amp;lt;subject&amp;gt; &amp;lt;assembly-file&amp;gt;
+```
+Utilization of the -p flag will override the standard similarity output, and output pairwise similarities between all samples in metadata file.
+
+## Example Usage 1 -- FMT data analysis (with donor &amp;amp; recipient information)  
+```
+python CRISPRtrack.py -i test/ -m test/FMT_metadata_example1.txt -r
+```
+Runs CRISPRtrack, sets directory containing contigs as 'test/', output directory set as current working directory, utilizes both denovo-based and reference-based methods for CRISPR prediction, output similarity table for between donor and recipient samples based on spacer content.
+
+## Example Usage 2 -- estimate microbiome similarity based on the sharing of CRISPR spacers in general
+python CRISPRtrack.py -i test -o test -m test/FMT_metadata_example2.txt -p
+Runs CRISPRtrack, using test directory as output directory, denovo-based method for CRISPR prediction, outputs pairwise similarity table based on spacer content.
+
+## Outputs
+There are two main outputs from CRISPRtrack.py:
+1. spacer-subject table (spacertable.\&amp;lt;prediction type=""&amp;gt;.txt)
+2. sample similarity based on spacer content sharing (sample_similarity_table.\&amp;lt;prediction type=""&amp;gt;.txt)
+- (alternatively) pairwise sample similarity, based on spacer content sharing. (pairwise_similarity_table.\&amp;lt;prediction type=""&amp;gt;.txt)
+
+The spacer-subject table: lists the spacers for each subject (sample): the rows are the samples, and the columns are the spacers.
+The sample similarity table: shows the similarity between the subject samples and their donors based on their sharing of the CRISPR spacers.
+The pairwise similarity table: compares sample subjects show the similarity between all permutations of samples listed in the metadata file
+
+Example usage of the spacer-subject table:
+- PCA plot showing the clustering of the samples based on their spacer profiles
+
+Example usage of the sample similarity table:
+- donor strain tracking plot (the recipient-donor microbiome similarity plot)
+
+## Visualization
+
+Dependencies for visualization scripts:
+- R
+
+Users can use their favorite tools to visualize the spacer sharing between microbiomes based on the outputs from CRISPRtrack. We include in this package some scripts for visualization for your reference.
+
+Tracking plot visualization example:
+```
+Rscript ./CRISPRtrack/scripts/tracking-plots.R sample_similarity_table.&amp;lt;prediction type=""&amp;gt;.txt
 ```

-## Outputs
--spacer csv file  
--grouped boxplot  
--PCA plot  
--similarity tables   
--CRISPR tracking tables
-
-## Example Usage  
-python CRISPRtrack.py -i ../assembly_klist_contigs/ -m subset_FMT2.txt   
-
+PCA of spacer clusters:
+```
+Rscript ./CRISPRtrack/scripts/tracking-plots.R sample_similarity_table.&amp;lt;prediction type=""&amp;gt;.txt
+```
&lt;/pre&gt;


&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tony J. Lam</dc:creator><pubDate>Mon, 23 Apr 2018 03:06:33 -0000</pubDate><guid>https://sourceforge.net235ae831ec3588dac1eafa5bfaf9b862c0b38b69</guid></item><item><title>Home modified by Tony J. Lam</title><link>https://sourceforge.net/p/crisprtrack/wiki/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v1
+++ v2
@@ -1,8 +1,66 @@
-Welcome to your wiki!
+CRISPRtrack
+=====
+Version: v1.0.0 (April 19, 2018)

-This is the default page, edit it as you see fit. To add a new page simply reference it within brackets, e.g.: [SamplePage].
+Yuzhen Ye (yye@indiana.edu, yuzhenye@gmail.com)

-The wiki uses [Markdown](/p/crisprtrack/wiki/markdown_syntax/) syntax.
+Tony J. Lam (tjlam@indiana.edu)

-[[members limit=20]]
-[[download_button]]
+School of Informatics, Computing, and Engeneering, IUB
+
+CRISPRtrack is used for strain based tracking via microbiome CRISPR arrays.  
+Utilizes two approaches for prediction of CRISPR arrays: de novo based (default) and reference based (optional).  
+de novo based prediction utilizes CRISPRone,  
+Reference based prediction utilizes CRISPRAlign.  
+
+## Dependencies
+Python 2.7+, Java, R 
+Matplotlib v2.2.0+  
+sklearn v0.19.0+  
+Pandas v0.21.0+  
+
+## Installation 
+tar -xvf &amp;lt;filename&amp;gt;  
+
+## Usage 
+```
+CRISPRtrack.py [-h] [-i INPUT_DIRECTORY] [-o OUTPUT_DIR] -m METADATA [-r, --reference] 
+               [--ref_fast REF_FAST] [--reference REFERENCE] [--CRISPRAlign CRISPRALIGN] 
+               [--cdhit CDHIT] [--CRISPRone CRISPRONE]  
+
+optional arguments:  
+  -h, --help            show this help message and exit  
+  -i INPUT_DIRECTORY, --input_directory INPUT_DIRECTORY  
+                        Input directory of genome assembly files. (default:  
+                        current working directory).  
+  -o OUTPUT_DIR, --output_dir OUTPUT_DIR  
+                        Output directory to create output files in (default:  
+                        current working directory).  
+  -m METADATA, --metadata METADATA  
+                        Input metadata file. (required).  
+  -r, --reference       Run reference based (CRISPRAlign), default=FALSE  
+  --ref_fast REF_FAST   Input CRISPR repeat for reference based search in  
+                        fasta foremat, default = /bin/gutref-expanded.fna  
+  --reference REFERENCE  
+                        Run reference based (CRISPRAlign), default off, uses  
+                        reference crispr repeats if provided, else default:  
+                        /bin/gutref-expanded.fna  
+  --CRISPRAlign CRISPRALIGN  
+                        Path to CRISPRAlign. (default =  
+                        ./bin/CRISPRAlign/CrisprAlign)  
+  --cdhit CDHIT         Path to CD-HIT. (default = ./bin/cdhit/)  
+  --CRISPRone CRISPRONE  
+                        Path to CRISPRone. (default  
+                        =./bin/CRISPRone/crisprone-local-nocas.php)  
+```
+
+## Outputs
+-spacer csv file  
+-grouped boxplot  
+-PCA plot  
+-similarity tables   
+-CRISPR tracking tables
+
+## Example Usage  
+python CRISPRtrack.py -i ../assembly_klist_contigs/ -m subset_FMT2.txt   
+
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tony J. Lam</dc:creator><pubDate>Fri, 20 Apr 2018 20:41:32 -0000</pubDate><guid>https://sourceforge.net6bc6afb36cd9cf9d2a574d1ea65cc02a21dd535b</guid></item><item><title>Home modified by Yuzhen Ye</title><link>https://sourceforge.net/p/crisprtrack/wiki/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;Welcome to your wiki!&lt;/p&gt;
&lt;p&gt;This is the default page, edit it as you see fit. To add a new page simply reference it within brackets, e.g.: &lt;span&gt;[SamplePage]&lt;/span&gt;.&lt;/p&gt;
&lt;p&gt;The wiki uses &lt;a class="" href="/p/crisprtrack/wiki/markdown_syntax/"&gt;Markdown&lt;/a&gt; syntax.&lt;/p&gt;
&lt;p&gt;&lt;/p&gt;&lt;h6&gt;Project Members:&lt;/h6&gt;
	&lt;ul class="md-users-list"&gt;
		&lt;li&gt;&lt;a href="/u/yuzhenye/"&gt;Yuzhen Ye&lt;/a&gt; (admin)&lt;/li&gt;
		
	&lt;/ul&gt;&lt;br/&gt;
&lt;p&gt;&lt;span class="download-button-5accbbe217508574d345b11b" style="margin-bottom: 1em; display: block;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yuzhen Ye</dc:creator><pubDate>Tue, 10 Apr 2018 13:28:02 -0000</pubDate><guid>https://sourceforge.nete833a14e8d31f498d9b953d7247bc46985bc29a7</guid></item></channel></rss>