SimulaTE allows to simulate arbitrary complex landscapes of transposable elements (TEs). Additionally reads may be simulated using the genomes of the indivdiuals in the population as template. Reads may be simulated using different sequencing technologies (PacBio, Illumina paired-ends) and strategies (sequencing individuals and pooled populations). SimulaTE will greatly aid in evaluating the suitability of different approaches for estimating TE abundance within populations and to test whether given genomic resources, such as a reference genome or a TE database (a fasta file containing consensus sequences of TEs), are suitable for TE identification.
As major innovation we developed a simple Domain Specific Language (DSL) that allows to specify arbitrary complex TE landscapes using a simple syntax. A DSL is a custom tailored programming language that is optimized for a specific purpose, i.e. in the case of SimulaTE describing TE landscapes.
The DSL implemented in SimulaTE allows to specify the following properties of TE landscapes:
The wiki uses Markdown syntax.
Wiki: Manual
Wiki: TheClassic_SanMiguel_TELandscape
Wiki: Validate_reads
Wiki: Validate_unit_tests
Wiki: Validation_Pop2
Wiki: Walkthrough
Wiki: Walkthrough_species_tool_compatibility
Wiki: Walkthrough_toy
Wiki: describing_TE_landscapes
Wiki: describing_TE_sequences
Wiki: special_use_cases