<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to Data loading</title><link>https://sourceforge.net/p/cellx/wiki/Data%2520loading/</link><description>Recent changes to Data loading</description><atom:link href="https://sourceforge.net/p/cellx/wiki/Data%20loading/feed" rel="self"/><language>en</language><lastBuildDate>Tue, 01 Mar 2016 17:48:53 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/cellx/wiki/Data%20loading/feed" rel="self" type="application/rss+xml"/><item><title>Data loading modified by Keith Ching</title><link>https://sourceforge.net/p/cellx/wiki/Data%2520loading/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v5
+++ v6
@@ -23,5 +23,5 @@
 Mutation formats varied between TCGA releases such that identical, yet unique mutations may be entered. eg. KRAS G12D, p.G12D.  Try to catch these and remove. fixMutations.pl

 EXPRESSION
-Normalized RSEM values from TCGA are log2.  Any zero or negative values are set to 0.01
+Normalized RSEM values from TCGA are converted to log2.  Any zero or negative values are set to 0.01
 Values are stored in the database as integers. ( expression x 1000).  They need to be divided by 1000 after retrieval.
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Keith Ching</dc:creator><pubDate>Tue, 01 Mar 2016 17:48:53 -0000</pubDate><guid>https://sourceforge.net3bb070e5c00576d7da99e49ed08422f6aa676b5b</guid></item><item><title>Data loading modified by Keith Ching</title><link>https://sourceforge.net/p/cellx/wiki/Data%2520loading/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v4
+++ v5
@@ -13,3 +13,15 @@
 CNV
 too many fragments is an indication that the SNP6 chip failed.
 remove samples &amp;gt;=3500 segments from the database.
+
+MYSQL
+Data cleaned with cleanDatabase.pl and cleanDatabaseMETA.pl
+Optimized tables to shrink database size after cleaning.
+mysqlcheck -o &amp;lt;db_name&amp;gt; -u &amp;lt;username&amp;gt; -p
+
+MUTATIONS
+Mutation formats varied between TCGA releases such that identical, yet unique mutations may be entered. eg. KRAS G12D, p.G12D.  Try to catch these and remove. fixMutations.pl
+
+EXPRESSION
+Normalized RSEM values from TCGA are log2.  Any zero or negative values are set to 0.01
+Values are stored in the database as integers. ( expression x 1000).  They need to be divided by 1000 after retrieval.
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Keith Ching</dc:creator><pubDate>Tue, 01 Mar 2016 17:48:25 -0000</pubDate><guid>https://sourceforge.netb4c85f103bb381654eee89170ce1c461c127e719</guid></item><item><title>Data loading modified by Keith Ching</title><link>https://sourceforge.net/p/cellx/wiki/Data%2520loading/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v3
+++ v4
@@ -9,3 +9,7 @@

 [RNA-Seq]
+
+CNV
+too many fragments is an indication that the SNP6 chip failed.
+remove samples &amp;gt;=3500 segments from the database.
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Keith Ching</dc:creator><pubDate>Mon, 30 Nov 2015 21:23:09 -0000</pubDate><guid>https://sourceforge.net5340b81d21a209f4e39a4b318e4b9a7d5f0141aa</guid></item><item><title>Data loading modified by Keith Ching</title><link>https://sourceforge.net/p/cellx/wiki/Data%2520loading/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v2
+++ v3
@@ -6,3 +6,6 @@
   2. process files
   3. parse files
   4. load files
+
+
+[RNA-Seq]
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Keith Ching</dc:creator><pubDate>Mon, 13 Apr 2015 17:47:47 -0000</pubDate><guid>https://sourceforge.net1499e6d5beb9fdadcda0d2cf37a904e930e628a5</guid></item><item><title>Data loading modified by Keith Ching</title><link>https://sourceforge.net/p/cellx/wiki/Data%2520loading/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v1
+++ v2
@@ -0,0 +1,8 @@
+Loading data into CELLX
+
+
+TCGA
+  1. fetch files
+  2. process files
+  3. parse files
+  4. load files
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Keith Ching</dc:creator><pubDate>Fri, 10 Oct 2014 22:38:22 -0000</pubDate><guid>https://sourceforge.net5710f74ba34e7c139029c1a1aad80d5c02579079</guid></item><item><title>Data loading modified by Keith Ching</title><link>https://sourceforge.net/p/cellx/wiki/Data%2520loading/</link><description/><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Keith Ching</dc:creator><pubDate>Fri, 10 Oct 2014 22:36:46 -0000</pubDate><guid>https://sourceforge.net642e139af9472f5ff178d6a8c688f7668c60b776</guid></item></channel></rss>