Output

Robert Forder
There is a newer version of this page. You can find it here.

A brief description of the output format for ESTReMo. Each generation, statistics on the fittest organism in the population are written to the output file. They have the following form:

Gen.  Adj. Rseq  Rfreq     Rseq      MI        BG-Sum    NCR-Sum   TFs        BG-Size    Fitness
772,  5.046941,  5.965784, 6.128962, 5.047216, 4.476329, 7.538286, 34.000000, 1000,      0.000000

Site       SitePos SiteScore  AvgScore   Act.    Target  Score(1) Score(2) ...
ACGCTACT,  20,     0.4667,    0.2801,    0.0082, 1.0000, 0.4667,  0.1332
CGCCTGCA,  25,     0.4719,    0.2763,    0.0083, 1.0000, 0.0021,  0.4719
ACTTGGCT,  40,     0.4859,    0.2692,    0.0085, 1.0000, 0.1332,  0.4859
AGTCAGCG,  45,     0.4704,    0.2723,    0.0083, 1.0000, 0.4704,  0.1231
AGCTTGAA,  31,     0.4236,    0.2637,    0.0074, 1.0000, 0.4236,  0.0001
AGTTAGGT,  43,     0.4719,    0.2680,    0.0083, 1.0000, 0.4719,  0.0022
CTTTGGCT,  24,     0.4657,    0.2659,    0.0082, 1.0000, 0.1192,  0.4657
CCTCAATT,  39,     0.4194,    0.2730,    0.0074, 1.0000, 0.3321,  0.4194
Field Explanation
Gen. Number of generations which have elapsed (number of iterations of genetic algorithm).
Adj. Rseq Corrected Rsequence (corrects for small sample bias). Rsequence is a measure of the column wise information content in the motif.
Rfreq This is the expected value of Rsequence (a measure of the minimum information content required to identify each of the binding sites.
Rseq Rsequence value prioer to correction.
MI Mutual information. A measure of the dependency between positions in sites.
BG-Sum Sum of scores which the recognizer assigns to sites in the background (non-binding sites).
NCR-Sum Sum of scores assigned to binding sites.
TFs Quantity of transcription factor molecules in organism.
BG-Size Number of times the background is sampled (analgous to the size of the genome).
Fitness How fit is the organism (lower is better, zero is perfect).
Site The binding site.
SitePos The position in the promoter region of the best site.
SiteScore TF affinity for the best site in this NCR.
AvgScore. The average of the occupancies of all sites in the NCR
Act. Activation level. How "turned on" the gene associated with this binding site is.
Target Minimum activation level to be achieved for perfect fitness
Score(n). Score. TF affinity for the n-th position in the NCR.

Target | Minimum activation level required for perfect fitness.


MongoDB Logo MongoDB