| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| csharp | 2013-01-10 | ||
| java | 2011-04-22 | ||
| README.txt | 2013-02-03 | 2.6 kB | |
| SeqEntropy-demo-20130203.zip | 2013-02-03 | 4.1 MB | |
| Totals: 4 Items | 4.1 MB | 0 |
2.0.1 README - 20130203
======================================================================================
* Sequence Read Entropy
*
* Copyright (c) Hsueh-Ting Chu
* 4F., No.286, Defu Rd., South Dist., Taichung City 402, Taiwan.
* All rights reserved.
*
* This file is a part of the Sequence Read Entropy Program.
* The use and distribution terms for this software are covered by the
* Common Public License 1.0 (http://opensource.org/licenses/cpl1.0.php).
* By using this software in any fashion, you are agreeing to be bound by
* the terms of this license. You must not remove this notice, or
* any other, from this software.
* 2.0.1 Beta 2013/01/11 HTChu
*
======================================================================================
(1) Quick usage
Using E. coli str. K-12 substr. MG1655 chromosome NC_000913.fa:
(1.a) Demo command:
(Windows) SeqReadEntropy SeqReadEntropy NC_000913.fa 35
(1.b) parameters
SeqReadEntropy sequecne_file(.fa) read_Length
(1.c) Output:
Entropy (9114118/9279282)of Length= 35 = 6.95184240948066 = 6.96751437325532 = 0.224929048367957%
The result indicates the genome of the E. coli strain will lose 0.22% entropy if it is sequenced with reads of length=35bp.
-----------------------------------------------------------------------
(2) Multiple sequence files for big genomes such as human genome
using the Feb. 2009 assembly of the human genome (hg19, GRCh37 Genome)
avaialable at http://hgdownload-test.cse.ucsc.edu/goldenPath/hg19/bigZips/chromFa.tar.gz
-----------------------------------------------------------------------
(2.a) Demo command:
(Windows) SeqReadEntropy *.fa 100 5
(2.b) parameters
EBARDenovo multiple_sequecne_file(*.fa) read_Length clusters(N)
Divide the computation into 4^N times.
(2.c) Output:
Entropy (5591277928/5722624360)of Length= 100 = 9.73162717417607 = 9.75759523922774 = 0.266131812347326%
The result indicates the genome of the human genome will lose 0.26% entropy if it is sequenced with reads of length=100bp.
======================================================================================
(3) Sequence Read Repeat
======================================================================================
(2.a) Demo command:
(Windows) SeqReadRepeat *.fa 100 5
(2.b) parameters
EBARDenovo multiple_sequecne_file(*.fa) read_Length clusters(N)
Divide the computation into 4^N times.
(2.c) Output:
(Repeats=118 AllReads=252222)of Length= 30 = 0.0467841821887068%
The result indicates the genome of the genome will have 0.047% repeats if it is sequenced with reads of length=100bp.