Hi Muhammet, First, thing, the default -abundance_model is uniform, so if your goal is to produce an uniform distribution of all reference sequences, you can omit the -abundance_file parameter. Second, the default -total_reads is 100, so with such as small number, you may not have exactly 10 reads from each of your 10 references, but close to that (since this process is based on probabilities). Hope this helps! Best, Florent
Hi Laetitia, I finally had the time to have a look at your example. The problem is that you added several sources of errors in your command. Specifically, the Balzer homopolymer model is the one that causes the extra indels you observe. Since you want to model Illumina reads anyway, omit any homopolymer model (these models are more suited for 454 reads). And let me re-iterate what I said in a previous email. If you run into issues, you need to find the smallest reproducible example (few reference...
Hi Guilherme, Grinder is not multi-threaded. However, you can run several Grinder processes simultaneously to take advantage of multiple processing cores. For example, you create a small Bash script that runs 1 Grinder process per chromosome. A note about Grinder speed. There can be a specific performance decrease when you use very large genomes (e.g. human chromosomes) or a very large number of smaller genomes, since all reference sequences are read and kept in memory simultaneously by Grinder....
Hi Aditi, Grinder was written to simulate sequencing errors, but you could likely indirectly use it to include genomic variant "errors". This is an approach I have not tested: 1/ Input your genomes one by one in Grinder and introduce some errors using the error model desired. For the read length parameter, you will need to provide the exact length of your genome to avoid it being truncated or excluded. 2/ After your variants are ready, concatenate them into a single FASTA/FASTQ file. You may need...
Mutations when no error model specified
Hi Chad, Have a look at the description of the reads: 1 reference=test1 position=1..99 2 reference=test1 position=complement(1..99) No errors were introduced. However, as you see, some of the reads were generated from the reference sequence itself and some others from its reverse-complement, as you would expect from random shotgun sequencing. If your use-case calls for unidirectional reads, have a look at the -unidirectional option. Cheers, Florent
Hi Mao, The Illumina model from Korbel assumes that the reads are 100bp. You will need to adjust the parameters if your reads are longer. I suggest you experiment in Excel with this equation (Korbel et al 2009) to keep the same error rate at the last position as you find for reads that are 100 bp: 3e-3 + 3.3e-8 * i^4 Best, Florent
Hi Mao, That's an interesting idea, but it is not supported by Grinder. Best, Florent On Wed, Jun 20, 2018, 20:29 Maoxuan Lin mlin18@users.sourceforge.net wrote: Hi Ginder, I am using Grinder to simulate paired-end reads using a set of reference sequences. My goal is to guarantee every reference sequence has been selected to simulate reads. When I increased the total number of reads to large enough, I achieved this goal. However, I could not find a general rule or specific parameters to achieve this;...
Thank you for confirming this, Ted. As far as I can tell, I have toggled all BIOS settings that seemed relevant. So, I will contact Dell and see if they can patch the BIOS.
Cannot wake up from suspend using USB mouse/keyboard
Sorry, the image above is best opened in a new tab: https://medias.audiofanzine.com/images/normal/tuxguitar-1-2-477626.jpg The track displayed in black is the Percussion track.
Position marker is not visible on tracks that are colored in black
It works for me in TuxGuitar 1.4 on Linux. The documentation path is then: file:///opt/tuxguitar/share/help/about.html
Playback of current tab stops when closing a different tab
Opening the same file again opens in a new tab
Tabs opened from the TuxGuitar Browser shown as "Untitled.tg"
Trying again with TuxGuitar 1.4 and it seems to work fine. Cheers and feel free to close this issue.
Trying again with TuxGuitar 1.4 and it seems to work fine. Cheers!
Too many repeats in GPX files
Abundance file bug
Hi, The sofware is correct. It does not find reference sequence AE000666 because it is not amplified by the primers you provided. You need to not request this sequence or modified your primers so that they amplify it. Best, Florent
How to generate Illumina amplicon reads with pairwise overlap?
Use the -id <insert_dist> option. From the manpage: "The insert is defined in the...
Improved interface for the poly4 error model, i...
Small internal cleanup
Ok, good, closing.
06-seed.t failing test
Bug in read generation with paired-end and coverage.
Hi there! Thanks a lot, I did not think about this potential issue before. This is...
Using Test::Warn to test and silence warnings
Silenced an expected warning during testing
Grinder version 0.5.4
Test update
Fix for the last mate pair sometimes missing th...
Moved exec from 'script/' to conventional 'bin/...
Seems to work fine on my side with perl 5.20.2 Could you try to individually upgrade...
Error when generating chimeras
read_dist parsing problem
Hi Francisco, I have added documentation to clarify what the intended usage is when...
More dev doc updates
Added some docs about the dev version
Fix for Perl 5.18's hash_randomization (reporte...
Updated test on forbidden chars
Errors and warnings in linux Ubuntu
Hi Francisco, Regarding the Statistics::R, R, distrplus warning, I suspect that you...
Hi Noemi, You need to set the parameter -forward_reverse to produce amplicon datasets...
Hi Piotr, The problem is that using kmer-based chimeras requires reference sequences...
Tests failure
Version 0.4.5
Fixed bug when parsing arguments with missing v...
Thanks for checking this harborsiem. This means that the bug report can be close...
I guess this bug report is invalid, then. Unfortunately, I cannot close it myself....
StringIndexOutOfBoundsException when opening a GP5 file
Unable to open GP5 files with UTF8 BOM
That sounds great! Hopefullly, you can make a release soon? I believe there is initial...
Trivial code reduction
More concise test messages
POD update regarding dependencies
Consistent indentation
Revised dependencies... No need to bundle Biope...
Database test improvements (patch by Francisco ...
Fix hash randomization in K-mer tests
Clean temporary DB files
Sequencing error tests more deterministic (patc...
Deterministic tests of community structure (rep...
FASTQ quality value problem
status: open --> closed
ArrayIndexOutOfBoundsException when opening a GP5 tab