cMESSi - Browse Files at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size
simulatorForSolexaReads_1.2_exact_cp.pl	2014-07-30	20.1 kB
simulatorForSolexaReads_1.2.pl	2014-07-30	19.9 kB
simulatorForSolexaReads_1.2_exact.pl	2014-07-30	20.0 kB
simulatorForSolexaReads.pl	2014-07-30	14.6 kB
simulatorForSolexaReads_1.1.pl	2014-07-30	18.0 kB
simulatorCparts.tar.gz	2014-07-30	2.3 MB
sample_genomeInfo.txt	2014-07-30	546 Bytes
sampleAbundanceFile.txt	2014-07-30	364 Bytes
README.txt	2014-07-30	3.7 kB
sample_command.txt	2014-07-30	370 Bytes
fakeQualFileNoError_75bps.txt	2014-07-30	228 Bytes
cMessi.tar.gz	2014-07-30	2.3 MB
Totals: 12 Items		4.7 MB

README

REQUIREMENTS

Perl (with standard packages: GetOpt, File and Math installed)
BioPerl (with SeqIO and Seq installed)

if you cannot used precompiled executables:
GCC
as well as standard bash utils ar and ranlib

argtable2 library (is part of the sourcecode zip file)

INSTALLATION

1. download files

2a. if you need to compile the executables:
#extract sourcecode
tar xvfz simulatorCparts.tar.gz
#change directory
cd simulatorCparts/
#compile
make clean
make

2a. copy the executables to a path of your choice and set this path as "$binfolder" in the the main simulator perl file.

3. run program
perldoc simulatorForSolexaReads.pl
for infos and you will see following:

==> Different version of the program:
==> simulatorForSolexaReads_1.2.pl: updated version of the original
==> simulatorForSolexaReads_1.2_exact.pl: version without stochastic abundance estimation; abundances are directly transformed into readcounts
==> simulatorForSolexaReads_1.2_exact_cp.pl: same as simulatorForSolexaReads_1.2_exact.pl but using convert_project from the mira assembler (needs to be edited for the location of your local convert project_copy, but is much faster then BioPerl)

NAME
iMESSi - Simulator for Illumina metagenomic sequences
SYNOPSIS
simulatorForSolexaReads.pl [options]

OPTIONS
--help Prints this help message
--outfolder Folder for output
--outprefix Prefix of output filenames (default: SolexaSimReads)
--abundanceFile input file with genomes and their organism number
--genomeInfo file with information about the genomes: genome length, copy number in an organism.
--genomeFolder folder the genomes mentioned in abundanceFile are at.
--readlength Read length of simulated reads.
--insertNumber Number of inserts to simulate, from each insert 2 paired-end reads are generated.
--insertSize Mean insert size of simulated reads.
--insertSD Standard deviation from the mean insert size.

--qualityfiles location of quality files for sequencing error and quality generation.
Note that the first file can be given by just specifying it's location and multiple files can be given using folling scheme --qualityfiles="file1.qual -q file2.qual -q file3.qual". [Sample files can be found at http://www.bork.embl.de/~mende/simulated_data/*.qual.gz]

--binfolder Location of generateReads and simulateSequencingErrors, has to be set here or inside the script

Options to add more data to old simulations
--multiplexIdentifier Identifier for multiplex Libraries (default is 0)
--laneNumber Starting lane number
--tileNumber Starting tile number
--xCoord Starting xCoord number
--yCoord Starting yCoord number

DESCRIPTION
This program is used to simulate Illumina reads from a specified genome or metagenome

AUTHOR
Daniel Mende <mende@embl.de>, Alison Waller, et al.
Please cite us:
Mende DR, Waller AS, Sunagawa S, Järvelin AI, Chan MM, et al. (2012) Assessment of Metagenomic Assembly Using Simulated Next Generation Sequencing Data. PLoS ONE 7(2): e31386. doi:10.1371/journal.pone.0031386

We like to thank Falk Hildebrand for implementing the automatic genome info file generation part and Paul Igor Costea for improving the C-code
This program is open source, feel free to send in improved versions.

VERSIONS
Version 1.0 - First release
Version 1.1 - Fixed some bugs and added automatic genome info file generation
Version 1.2 - Added file that gives the exact location every read originates from
Version 1.3 - Improved C-code with high speed-up.

Source: README.txt, updated 2014-07-30

cMESSi Files

Get an email when there's a new version of cMESSi