<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to BuildingDatabases</title><link>https://sourceforge.net/p/altanalyze/wiki/BuildingDatabases/</link><description>Recent changes to BuildingDatabases</description><atom:link href="https://sourceforge.net/p/altanalyze/wiki/BuildingDatabases/feed" rel="self"/><language>en</language><lastBuildDate>Sat, 01 Feb 2014 01:07:02 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/altanalyze/wiki/BuildingDatabases/feed" rel="self" type="application/rss+xml"/><item><title>BuildingDatabases modified by Anonymous</title><link>https://sourceforge.net/p/altanalyze/wiki/BuildingDatabases/</link><description>&lt;div class="markdown_content"&gt;&lt;h2 id="creating-an-altanalyze-database-from-scratch"&gt;Creating an &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; Database From Scratch&lt;/h2&gt;
&lt;p&gt;Although most users will wish to wait for the official &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; database release, users and developers can also build and customize these databases themselves. &lt;/p&gt;
&lt;h3 id="adding-support-for-a-new-ensembl-build"&gt;Adding Support for a New Ensembl Build&lt;/h3&gt;
&lt;p&gt;New &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; databases can be built in a semi-automated fashion using builtin &lt;a class="" href="/p/altanalyze/wiki/CommandLineMode/"&gt;command-line&lt;/a&gt; arguments. Below is the exact method that &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; developers use to build each new version of the &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; database (excluding code updates). For microarray specific databases (e.g., Affymetrix Exon 1.0, Gene 1.0 and Junction JAY arrays), users will need to download two files for each array type and species before running the update. Files can be found &lt;a class="" href="http://www.affymetrix.com/support/technical/byproduct.affx?" rel="nofollow"&gt;here&lt;/a&gt;. &lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Probeset annotation file &lt;/li&gt;
&lt;li&gt;Probeset sequence file (Affymetrix Exon arrays only) &lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;An example probeset annotation file is the "MoEx-1_0-st-v1.na31.mm9.probeset.csv.zip". These files will need to be saved to the appropriate species and array type destination directory for that species and extracted to that directory. For example, if downloading the human exon array annotations for build 61 of Ensembl, save these to the folder &lt;a class="" href="../AltDatabase"&gt;AltDatabase&lt;/a&gt;/EnsMart61/Hs/exon. &lt;/p&gt;
&lt;p&gt;Once installation is complete, additional files will have to be added to the final database installation package, however, these can be found with existing database installations (probes_to_remove and probeset-probes association - e.g., &lt;a class="" href="../AltDatabase"&gt;AltDatabase&lt;/a&gt;/&lt;a class="" href="../EnsMart55"&gt;EnsMart55&lt;/a&gt;/Hs). &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Updating Configuration File&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Default files to download are specified for &lt;a class="" href="/p/altanalyze/wiki/UniProt/"&gt;UniProt&lt;/a&gt;, Ensembl, miRbase, TargetScan and the UCSC Genome Browser. For the custom &lt;a class="" href="../AltMouse"&gt;AltMouse&lt;/a&gt; array, additional files are downloaded during the build process from our server if not locally found. For &lt;a class="" href="/p/altanalyze/wiki/UniProt/"&gt;UniProt&lt;/a&gt;, UCSC, Ensembl databases, the user will not need to update the download path specified in the configuration file. When miRanda and TargetScan predictions are updated, you can change the path to the most recent version indicated on their websites (see the listed target URL) by opening the file Config/default-files.csv and changing the paths. Otherwise, no changes need to be made by the user. &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Build Commands&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Build all databases for all platforms (RNASeq and arrays) and all species&lt;/em&gt;&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="n"&gt;AltAnalyze&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;species&lt;/span&gt; &lt;span class="n"&gt;all&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;platform&lt;/span&gt; &lt;span class="n"&gt;all&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;all&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;version&lt;/span&gt; &lt;span class="mi"&gt;61&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;&lt;em&gt;Build specific databases for select arrays and species&lt;/em&gt;&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;AltAnalyze&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;Ensembl&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;UniProt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;Probeset&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;Domain&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;miRBs&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;species&lt;/span&gt; &lt;span class="n"&gt;Mm&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;arraytype&lt;/span&gt; &lt;span class="n"&gt;exon&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;arraytype&lt;/span&gt; &lt;span class="n"&gt;gene&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;arraytype&lt;/span&gt; &lt;span class="n"&gt;junction&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;version&lt;/span&gt; &lt;span class="mi"&gt;61&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;&lt;em&gt;Build specific databases for RNASeq and species&lt;/em&gt;&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;AltAnalyze&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;Ensembl&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;UniProt&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;ExonAnnotations&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;Domain&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;miRBs&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;species&lt;/span&gt; &lt;span class="n"&gt;Mm&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;platform&lt;/span&gt; &lt;span class="n"&gt;RNASeq&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;version&lt;/span&gt; &lt;span class="mi"&gt;61&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;&lt;em&gt;Note&lt;/em&gt;: --arraytype and --platform are interchangeable as is Probeset and ExonAnnotations. &lt;/p&gt;
&lt;p&gt;The machine these arguments are run on must have Python installed (recommended is version 2.7.1) and &lt;a class="" href="../BioPython"&gt;BioPython&lt;/a&gt;. To ensure &lt;a class="" href="../BioPython"&gt;BioPython&lt;/a&gt; is installed correctly, &lt;a class="" href="/p/altanalyze/wiki/setBioPython/"&gt;follow these steps&lt;/a&gt;. These build commands, when run on a server will produce a complete &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; database with additional build files. To create a downloadable distribution archive for users, you can enter the below command to write species database zip files to the folder 'ArchiveDBs' in the &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; program directory. &lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="n"&gt;AltAnalyze&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;update&lt;/span&gt; &lt;span class="n"&gt;package&lt;/span&gt; &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;version&lt;/span&gt; &lt;span class="mi"&gt;61&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;
&lt;h3 id="adding-support-for-a-new-speciesarraydata-type"&gt;Adding Support for a New Species/Array/Data Type&lt;/h3&gt;
&lt;p&gt;Explicitly adding support for new species, array or data type (e.g., RNA-sequencing exon and junction data), requires either modification of the &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; source code or direct modification of the &lt;a class="" href="/p/altanalyze/wiki/AltAnalyze/"&gt;AltAnalyze&lt;/a&gt; database files. &lt;/p&gt;
&lt;h2 id="under-construction"&gt;Under Construction&lt;/h2&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Sat, 01 Feb 2014 01:07:02 -0000</pubDate><guid>https://sourceforge.net87c43330f01fee1a1bd5932a6b3a45f13f377dd5</guid></item></channel></rss>